Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3178 |
Symbol | iutA |
ID | 6143094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3257233 |
End bp | 3259434 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618018 |
Product | ferric aerobactin receptor IutA |
Protein accession | YP_001745168 |
Protein GI | 170680936 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.911837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATAA GCAAAAAGTA TACGCTTTGG GCTCTCAACC CACTGCTTCT TACCATGATG GCGCCAGCAG TCGCTCAACA AACCGATGAT GAAACGTTCG TGGTGTCTGC CAACCGCAGC AATCGCACCG TAGCGGAGAT GGCGCAAACC ACCTGGGTTA TCGAAAACGC CGAACTGGAA CAGCAGATTC AGGGCGGCAA AGAGCTTAAA GACGCACTGG CTCAGCTGAT CCCTGGCCTT GACGTCAGCA GCCGGAGCCG CACCAACTAC GGTATGAATG TGCGTGGCCG CCCGCTGGTC GTGCTGGTTG ACGGCGTGCG TCTCAACTCT TCACGTACCG ACAGCCGACA ACTGGACTCT ATAGATCCTT TTAATATCGA CCATATTGAA GTGATCTCCG GTGCGACGTC CCTGTACGGC GGCGGCAGTA CCGGTGGCCT GATCAACATC GTGACCAAAA AAGGCCAGCC GGAAACCATG ATGGAGTTTG AGGCTGGCAC CAAAAGTGGC TTTAGCAGCA GTAAAGATCA CGATGAACGC ATTGCCGGAG CTGTCTCCGG CGGAAATGAG CATATCTCCG GACGTCTTTC CGTGGCATAT CAGAAATTTG GCGGCTGGTT TGACGGTAAC GGCGATGCCA CCTTGCTTGA TAACACCCAG ACCGGCCTGC AGTACTCCGA TCGGCTGGAC ATCATGGGAA CTGGTACGCT GAACATCGAT GAATCCCGGC AGCTTCAGTT GATCACACAG TACTATAAAA GCCAGGGCGA CGACGATTAC GGGCTTAATC TCGGGAAAGG CTTCTCTGCC ATCAGAGGGA CCAGCACGCC ATTCGTCAGT AACGGGCTGA ATTCCGACCG TATTCCCGGC ACTGAGCGGC ATTTGATCAG CCTGCAGTAC TCTGACAGCG CTTTTCTGGG ACAGGAGCTG GTCGGTCAGG TTTACTACCG CGATGAGTCG TTGCGATTCT ACCCGTTCCC GACGGTAAAT GCGAACAAAC AGGTGACGGC TTTCTCTTCG TCACAGCAGG ACACCGACCA GTACGGCATG AAACTGACTC TGAACAGCAA ACCGATGGAC GGCTGGCAAA TCACCTGGGG GCTGGATGCT GATCATGAGC GCTTTACCTC CAACCAGATG TTCTTCGACC TGGCTCAGGC AAGCGCTTCC GGAGGGCTGA ACAACAAGAA GATTTACACC ACCGGGCGCT ATCCGTCGTA TGACATCACC AACCTGGCGG CCTTCCTGCA ATCAGGCTAT GACATCAATA ATCTCTTTAC CCTCAACGGT GGCGTACGCT ATCAGTACAC TGAAAACAAG ATTGATGATT TCATCGGCTA CGCGCAGCAA CGGCAGATTG CCGCCGGGAA GGCTACATCC GCCGACGCCA TTCCTGGCGG CTCAGTCGAT TACGACAACT TCCTGTTCAA CGCCGGTCTG CTGATGCACA TCACCGAACG CCAGCAGGCA TGGCTCAACT TCTCCCAGGG CGTGGAGCTG CCGGACCCGG GTAAATACTA TGGTCGCGGC ATCTATGGTG CTGCAGTGAA CGGCCATCTT CCTCTAACAA AGAGTGTGAA CGTCAGCGAC AGCAAGCTGG AAGGCGTGAA AGTCGATTCT TATGAGCTGG GCTGGCGCTT TACTGGCAAT AATCTGCGTA CCCAAATCGC GGCCTACTAT TCGATTTCTG ATAAGAGCGT GGTGGCGAAT AAAGATCTGA CCATCAGCGT GGTGGACGAC AAACGCCGTA TTTACGGCGT GGAAGGTGCG GTGGACTACC TGATTCCTGA TACTGACTGG AGTACCGGAG TGAACTTCAA CGTGCTGAAA ACTGAGTCGA AAGTGAACGG TACCTGGCAG AAATACGATG TGAAGACAGC AAGCCCATCA AAAGCGACAG CCTACATTGG CTGGGCACCG GACCCGTGGA GTCTGCGCGT GCAGAGCACC ACCTCCTTTG ACGTGAGCGA CGCGCAGGGC TACAAGGTCG ATGGCTATAC CACCGCGGAT CTGCTCGGCA GTTATCAGCT TCCGGTGGGT ACACTCAGCT TCAGCATTGA AAACCTCTTC GACCGTGACT ACACCACTGT CTGGGGGCAG CGTGCGCCAC TGTACTACAG CCCGGGTTAC GGCCCTGCTT CACTGTACGG CTACAAAGGC AGGGGCCGCA CCTTTGGTCT GAGTTACTCA GTATTATTCT GA
|
Protein sequence | MMISKKYTLW ALNPLLLTMM APAVAQQTDD ETFVVSANRS NRTVAEMAQT TWVIENAELE QQIQGGKELK DALAQLIPGL DVSSRSRTNY GMNVRGRPLV VLVDGVRLNS SRTDSRQLDS IDPFNIDHIE VISGATSLYG GGSTGGLINI VTKKGQPETM MEFEAGTKSG FSSSKDHDER IAGAVSGGNE HISGRLSVAY QKFGGWFDGN GDATLLDNTQ TGLQYSDRLD IMGTGTLNID ESRQLQLITQ YYKSQGDDDY GLNLGKGFSA IRGTSTPFVS NGLNSDRIPG TERHLISLQY SDSAFLGQEL VGQVYYRDES LRFYPFPTVN ANKQVTAFSS SQQDTDQYGM KLTLNSKPMD GWQITWGLDA DHERFTSNQM FFDLAQASAS GGLNNKKIYT TGRYPSYDIT NLAAFLQSGY DINNLFTLNG GVRYQYTENK IDDFIGYAQQ RQIAAGKATS ADAIPGGSVD YDNFLFNAGL LMHITERQQA WLNFSQGVEL PDPGKYYGRG IYGAAVNGHL PLTKSVNVSD SKLEGVKVDS YELGWRFTGN NLRTQIAAYY SISDKSVVAN KDLTISVVDD KRRIYGVEGA VDYLIPDTDW STGVNFNVLK TESKVNGTWQ KYDVKTASPS KATAYIGWAP DPWSLRVQST TSFDVSDAQG YKVDGYTTAD LLGSYQLPVG TLSFSIENLF DRDYTTVWGQ RAPLYYSPGY GPASLYGYKG RGRTFGLSYS VLF
|
| |