Gene EcSMS35_3178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3178 
SymboliutA 
ID6143094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3257233 
End bp3259434 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content55% 
IMG OID641618018 
Productferric aerobactin receptor IutA 
Protein accessionYP_001745168 
Protein GI170680936 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.911837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATAA GCAAAAAGTA TACGCTTTGG GCTCTCAACC CACTGCTTCT TACCATGATG 
GCGCCAGCAG TCGCTCAACA AACCGATGAT GAAACGTTCG TGGTGTCTGC CAACCGCAGC
AATCGCACCG TAGCGGAGAT GGCGCAAACC ACCTGGGTTA TCGAAAACGC CGAACTGGAA
CAGCAGATTC AGGGCGGCAA AGAGCTTAAA GACGCACTGG CTCAGCTGAT CCCTGGCCTT
GACGTCAGCA GCCGGAGCCG CACCAACTAC GGTATGAATG TGCGTGGCCG CCCGCTGGTC
GTGCTGGTTG ACGGCGTGCG TCTCAACTCT TCACGTACCG ACAGCCGACA ACTGGACTCT
ATAGATCCTT TTAATATCGA CCATATTGAA GTGATCTCCG GTGCGACGTC CCTGTACGGC
GGCGGCAGTA CCGGTGGCCT GATCAACATC GTGACCAAAA AAGGCCAGCC GGAAACCATG
ATGGAGTTTG AGGCTGGCAC CAAAAGTGGC TTTAGCAGCA GTAAAGATCA CGATGAACGC
ATTGCCGGAG CTGTCTCCGG CGGAAATGAG CATATCTCCG GACGTCTTTC CGTGGCATAT
CAGAAATTTG GCGGCTGGTT TGACGGTAAC GGCGATGCCA CCTTGCTTGA TAACACCCAG
ACCGGCCTGC AGTACTCCGA TCGGCTGGAC ATCATGGGAA CTGGTACGCT GAACATCGAT
GAATCCCGGC AGCTTCAGTT GATCACACAG TACTATAAAA GCCAGGGCGA CGACGATTAC
GGGCTTAATC TCGGGAAAGG CTTCTCTGCC ATCAGAGGGA CCAGCACGCC ATTCGTCAGT
AACGGGCTGA ATTCCGACCG TATTCCCGGC ACTGAGCGGC ATTTGATCAG CCTGCAGTAC
TCTGACAGCG CTTTTCTGGG ACAGGAGCTG GTCGGTCAGG TTTACTACCG CGATGAGTCG
TTGCGATTCT ACCCGTTCCC GACGGTAAAT GCGAACAAAC AGGTGACGGC TTTCTCTTCG
TCACAGCAGG ACACCGACCA GTACGGCATG AAACTGACTC TGAACAGCAA ACCGATGGAC
GGCTGGCAAA TCACCTGGGG GCTGGATGCT GATCATGAGC GCTTTACCTC CAACCAGATG
TTCTTCGACC TGGCTCAGGC AAGCGCTTCC GGAGGGCTGA ACAACAAGAA GATTTACACC
ACCGGGCGCT ATCCGTCGTA TGACATCACC AACCTGGCGG CCTTCCTGCA ATCAGGCTAT
GACATCAATA ATCTCTTTAC CCTCAACGGT GGCGTACGCT ATCAGTACAC TGAAAACAAG
ATTGATGATT TCATCGGCTA CGCGCAGCAA CGGCAGATTG CCGCCGGGAA GGCTACATCC
GCCGACGCCA TTCCTGGCGG CTCAGTCGAT TACGACAACT TCCTGTTCAA CGCCGGTCTG
CTGATGCACA TCACCGAACG CCAGCAGGCA TGGCTCAACT TCTCCCAGGG CGTGGAGCTG
CCGGACCCGG GTAAATACTA TGGTCGCGGC ATCTATGGTG CTGCAGTGAA CGGCCATCTT
CCTCTAACAA AGAGTGTGAA CGTCAGCGAC AGCAAGCTGG AAGGCGTGAA AGTCGATTCT
TATGAGCTGG GCTGGCGCTT TACTGGCAAT AATCTGCGTA CCCAAATCGC GGCCTACTAT
TCGATTTCTG ATAAGAGCGT GGTGGCGAAT AAAGATCTGA CCATCAGCGT GGTGGACGAC
AAACGCCGTA TTTACGGCGT GGAAGGTGCG GTGGACTACC TGATTCCTGA TACTGACTGG
AGTACCGGAG TGAACTTCAA CGTGCTGAAA ACTGAGTCGA AAGTGAACGG TACCTGGCAG
AAATACGATG TGAAGACAGC AAGCCCATCA AAAGCGACAG CCTACATTGG CTGGGCACCG
GACCCGTGGA GTCTGCGCGT GCAGAGCACC ACCTCCTTTG ACGTGAGCGA CGCGCAGGGC
TACAAGGTCG ATGGCTATAC CACCGCGGAT CTGCTCGGCA GTTATCAGCT TCCGGTGGGT
ACACTCAGCT TCAGCATTGA AAACCTCTTC GACCGTGACT ACACCACTGT CTGGGGGCAG
CGTGCGCCAC TGTACTACAG CCCGGGTTAC GGCCCTGCTT CACTGTACGG CTACAAAGGC
AGGGGCCGCA CCTTTGGTCT GAGTTACTCA GTATTATTCT GA
 
Protein sequence
MMISKKYTLW ALNPLLLTMM APAVAQQTDD ETFVVSANRS NRTVAEMAQT TWVIENAELE 
QQIQGGKELK DALAQLIPGL DVSSRSRTNY GMNVRGRPLV VLVDGVRLNS SRTDSRQLDS
IDPFNIDHIE VISGATSLYG GGSTGGLINI VTKKGQPETM MEFEAGTKSG FSSSKDHDER
IAGAVSGGNE HISGRLSVAY QKFGGWFDGN GDATLLDNTQ TGLQYSDRLD IMGTGTLNID
ESRQLQLITQ YYKSQGDDDY GLNLGKGFSA IRGTSTPFVS NGLNSDRIPG TERHLISLQY
SDSAFLGQEL VGQVYYRDES LRFYPFPTVN ANKQVTAFSS SQQDTDQYGM KLTLNSKPMD
GWQITWGLDA DHERFTSNQM FFDLAQASAS GGLNNKKIYT TGRYPSYDIT NLAAFLQSGY
DINNLFTLNG GVRYQYTENK IDDFIGYAQQ RQIAAGKATS ADAIPGGSVD YDNFLFNAGL
LMHITERQQA WLNFSQGVEL PDPGKYYGRG IYGAAVNGHL PLTKSVNVSD SKLEGVKVDS
YELGWRFTGN NLRTQIAAYY SISDKSVVAN KDLTISVVDD KRRIYGVEGA VDYLIPDTDW
STGVNFNVLK TESKVNGTWQ KYDVKTASPS KATAYIGWAP DPWSLRVQST TSFDVSDAQG
YKVDGYTTAD LLGSYQLPVG TLSFSIENLF DRDYTTVWGQ RAPLYYSPGY GPASLYGYKG
RGRTFGLSYS VLF