Gene EcSMS35_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1720 
Symbol 
ID6144855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1728170 
End bp1730272 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content51% 
IMG OID641616596 
ProductTonB-dependent receptor 
Protein accessionYP_001743774 
Protein GI170680750 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.000201404 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTT TTTCCGTCCG ACAGACCGTT TTGCCCGCAC TGCTTGTCCT TTCCCCCGTT 
GTTTTTGCCG CTGATGAACA GACCATGATT GTCAGTGCCG CGCCGCAGGT GGTTTCAGAA
CTGGATACGC CAGCAGCAGT AAGCGTGGTG GATGGCGAGG AGATGCGCCT GGCAACACCG
CGCATTAACT TGTCTGAATC ACTGACTGGC GTGCCTGGTT TGCAGGTACA AAACCGGCAG
AACTATGCGC AAGATTTACA GCTGTCGATT CGCGGATTTG GCTCCCGCTC CACTTACGGA
ATTCGCGGTA TTCGCCTGTA TGTGGACGGT ATTCCCGCCA CCATGCCCGA CGGGCAAGGG
CAAACATCCA ACATCGATTT AAGCAGTGTG CAAAATGTGG AAGTGCTGCG TGGCCCCTTC
TCTGCTCTGT ATGGCAACGC GTCTGGCGGT GTAATGAATG TCATCACCCA GACCGGACAA
CAGCCACCAA CCATTGAAGC CAGTAGTTAC TACGGCAGTT TTGGCAGCTG GCGCTATGGG
CTGAAAGCAA CAGGCGCAAC GGGCGACGGC ACACAGCCTG GCGATGTCGA TTACACCGTC
TCAACCACGC GTTTTACGAC CCACGGCTAT CGTGACCATA GTGGCGCACA GAAAAATTTA
GCCAATGCCA AACTGGGCGT TCGCATTGAT GAAGCCAGCA AATTAAGCCT GATTTTCAAT
AGTGTGGATA TCAAAGCAGA TGACCCAGGT GGGCTAACCG AAGCAGAATG GAAGGCGAAT
CCGCAACAAG CGCCACGCGC TGAACAGTAC GACACGCGAA AAACCATCAA GCAAACTCAG
GCTGGGTTGC GCTATGAACG TAGCCTGAGC GCGCAAGATG ATATGAGTGT AATGATGTAT
GCCGGAGAGC GAGAAACGAC CCAGTACCAG TCAATCCCGA TGGCCCCTCA ACTTAATCCG
TCGCATGCGG GCGGCGTGAT TACTCTGCAA CGCCATTATC AGGGGATAGA CAGCCGCTGG
ACACACCATG GCGAGCTGGG TGTTCCGGTC ACGTTCACTA CTGGCCTGAA CTACGAAAAC
ATGAGTGAAA ACCGCAAGGG CTACAATAAC TTCCGCCTGA ATAGCGGCGT GCCGGAATAC
GGGCAAAAAG GTGAGTTACG TCGCGACGAA CGCAATCTGA TGTGGAATGT CGATCCCTAT
TTACAGACAC AGTGGCAGCT GAGCGAAAAA CTGTCGCTGG ATGCTGGCGT GCGCTACAGC
TCCGTATGGT TTGATTCCAA CGACCATTAC GTTACTCCGG GTAACGGCGA TGACAGCGGT
GATGCCAGTT ATCACAAATG GCTACCTGCC GGTTCGTTAA AATATGCAAT GAACGATGCC
TGGAATATCT ATCTGGCAGC CGGGCGTGGT TTTGAAACGC CGACGATTAA TGAACTGTCT
TATCGCGCTG ATGGGCAAAG CGGTATGAAC TTTGGTTTAA AACCATCTAC CAACGATACA
ATTGAGATCG GCAGTAAAAC GCGTATTGGT GATGGGCTGC TGAGTCTCGC ATTGTTCCAG
ACCGACACCG ATGATGAAAT TGTTGTCGAT AGCAGTAGCG GTGGGCGTAC CACATATAAA
AATGCCGGAA AGACCCGTCG TCAAGGCGCT GAACTGGCAT GGGATCAACG TTTCGCGGGA
GATTTTCGCG TAAAAGCGTC CTGGACCTGG CTTGATGCGA CCTATCGCAG CAATGTGTGC
AATGAACAGG ATTGTAACGG TAACCGGATG CCAGGGATCG CCCGTAATAT GGGCTTTGCA
TCGATAGGTT ATGTACCGGA AGAAGGCTGG TATGCAGGCA CGGAAGCGCG ATATATGGGC
GATATTATGG CAGATGATGA AAATACGGCC AAAGCGCCGT CTTATACTCT CGTCGGCTTA
TTCACCGGGT ATAAATACAA TTACCACAAT TTGACTGTGG ATTTATTTGG TCGTGTCGAT
AATTTATTCG ATAAAGGATA CGTTGGTTCT GTCATTGTCA ATGAGTCAAA CGGTCGATAT
TACGAACCTG CGCCCGGGCG AAATTATGGT GTCGGCATGA ATATTGCGTG GCGATTTGAG
TAA
 
Protein sequence
MKIFSVRQTV LPALLVLSPV VFAADEQTMI VSAAPQVVSE LDTPAAVSVV DGEEMRLATP 
RINLSESLTG VPGLQVQNRQ NYAQDLQLSI RGFGSRSTYG IRGIRLYVDG IPATMPDGQG
QTSNIDLSSV QNVEVLRGPF SALYGNASGG VMNVITQTGQ QPPTIEASSY YGSFGSWRYG
LKATGATGDG TQPGDVDYTV STTRFTTHGY RDHSGAQKNL ANAKLGVRID EASKLSLIFN
SVDIKADDPG GLTEAEWKAN PQQAPRAEQY DTRKTIKQTQ AGLRYERSLS AQDDMSVMMY
AGERETTQYQ SIPMAPQLNP SHAGGVITLQ RHYQGIDSRW THHGELGVPV TFTTGLNYEN
MSENRKGYNN FRLNSGVPEY GQKGELRRDE RNLMWNVDPY LQTQWQLSEK LSLDAGVRYS
SVWFDSNDHY VTPGNGDDSG DASYHKWLPA GSLKYAMNDA WNIYLAAGRG FETPTINELS
YRADGQSGMN FGLKPSTNDT IEIGSKTRIG DGLLSLALFQ TDTDDEIVVD SSSGGRTTYK
NAGKTRRQGA ELAWDQRFAG DFRVKASWTW LDATYRSNVC NEQDCNGNRM PGIARNMGFA
SIGYVPEEGW YAGTEARYMG DIMADDENTA KAPSYTLVGL FTGYKYNYHN LTVDLFGRVD
NLFDKGYVGS VIVNESNGRY YEPAPGRNYG VGMNIAWRFE