Gene EcSMS35_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2025 
SymbolfhuE 
ID6145621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2044976 
End bp2047165 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content50% 
IMG OID641616901 
Productferric-rhodotorulic acid outer membrane transporter 
Protein accessionYP_001744077 
Protein GI170682970 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000237263 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000045764 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTTCGA CACAATTTAA CAGGGATAAT CAACATCAAG CTATCATCAA ATCGTCACTC 
CTTGCCGGTT GCATAGCACT GGCACTATTA CCTTCCGCCG CTTTTGCTGC ACCAGTCACT
GAAGAAACGG TGATTGTTGA GGGTTCAGCC ACAGCCCCAG ATGATGGCGA AAATGATTAC
AACGTAACGT CTACCTCTGC GGGTACAAAA ATGCAGATGA CTCAACGTGA TATTCCTCAG
TCGGTCACTA TTGTTAGCCA GCAGCGGATG GAAGATCAGC AGTTACAAAC GCTGGGCGAA
GTGATGGAAA ACACGCTGGG GATCAGCAAA AGTCAGGCGG ATTCCGATCG TGCTCTTTAT
TATTCCCGCG GATTCCAGAT CGATAACTAT ATGGTTGATG GTATCCCCAC CTATTTTGAA
TCGCGCTGGA ATCTGGGCGA CGCACTTTCT GATATGGCAC TGTTTGAACG CGTAGAAGTA
GTGCGTGGCG CGACAGGCCT CATGACCGGG ACGGGTAATC CATCTGCGGC AATTAATATG
GTTCGAAAAC ACGCGACCAG TCGTGAATTT AAAGGCGATG TCTCGGCGGA ATACGGTAGC
TGGAACAAAG AACGGTATGT GGCGGATTTA CAAAGCCCAC TCACCGAAGA TGGTAAAATC
CGCGCACGAA TTGTCGGCGG CTACCAGAAT AACGACTCAT GGCTGGACCG CTACAACAGT
GAAAAGACCT TCTTCTCCGG CATTGTCGAT GCTGATTTAG GCGATCTTAC GATGCTTTCA
GCCGGTTACG AATATCAGCG TATTGATGTT AATAGCCCAA CCTGGGGCGG TTTACCGCGC
TGGAATACTG ATGGCAGCAG CAACAGTTAC GATCGCGCAC GCAGTACCGC ACCTGACTGG
GCGTACAACG ATAAAGAGAT CAACAAGGTC TTTATGACCC TGAAGCAGCG GTTTGCTGAT
ACCTGGCAAG CGACGCTGAA TGCCACCCAC TCTGAAGTCG AATTTGACAG CAAAATGATG
TATGTCGATG CCTATGTAAA CAAAGCGGAT GGTATGCTGG TTGGGCCATA CAGTAATTAT
GGACCTGGTT TTGATTATGT CGGCGGCACC GGTTGGAACA GCGGCAAACG TAAAGTTGAT
GCGCTGGATT TGTTCGCTGA CGGTAGTTAT GAATTGTTTG GTCGTCAGCA CAATCTAATG
TTTGGTGGCA GTTACAGCAA ACAAAACAAT CGTTACTTCA GTTCATGGGC CAACATCTTC
CCGGATGAAA TTGGCAGTTT CTACAACTTT AATGGCAATT TCCCACAAAC CGACTGGTCA
CCACAGAGCC TGGCGCAGGA CGATACCACA CATATGAAAT CGTTATATGC TGCCACTCGT
GTCACCCTTG CCGATCCGCT GCATCTGATC CTCGGCGCTC GTTATACCAA CTGGCGGGTT
GATACGCTGA CTTACAGCAT GGAGAAAAAC CACACCACGC CTTATGCTGG TCTGGTGTTT
GACATCAATG ACAACTGGTC GACCTACGCC AGCTATACCT CTATTTTCCA GCCGCAAAAT
GATCGTGACA GTTCAGGTAA ATATCTGACT CCAATCACCG GTAACAACTA CGAGTTGGGT
CTGAAATCGG ACTGGATGAA TAGCCGTCTG ACCACCACGT TAGCCATCTT CCGTATTGAG
CAGGATAATG TCGCTCAGTC CACCGGTACA CCTATCCCCG GCAGCAACGG CGAAACCGCC
TATAAAGCGG TGGATGGGAC AGTCAGTAAA GGTGTGGAAT TTGAACTCAA CGGCGCAATT
ACCGACAACT GGCAGCTGAC GTTTGGCGCA ACGCGCTATA TTGCAGAGGA TAACGAAGGA
AACGCCGTTA ATCCTAATCT GCCACGCTCC ACGATTAAAA TGTTCACCAG CTATCGGTTG
CCTGTCATGC TAGAGTTGAC GGTCGGCGGT GGTGTTAACT GGCAAAATCG CGTGTATACC
GACACCGTGA CACCGTATGG CACCTTCCGC GCCGAGCAAG GTAGCTACGC GCTGGTGGAT
CTCTTCACCC GCTACCAGGT GACGAAAAAC TTCTCGTTAC AGGGGAACGT CAATAACCTG
TTCGACAAAA CCTACGATAC CAACGTGGAA GGTTCTATCG TCTACGGCGC ACCGCGTAAT
TTCAGCATTA CCGGCACGTA TCAATTCTGA
 
Protein sequence
MLSTQFNRDN QHQAIIKSSL LAGCIALALL PSAAFAAPVT EETVIVEGSA TAPDDGENDY 
NVTSTSAGTK MQMTQRDIPQ SVTIVSQQRM EDQQLQTLGE VMENTLGISK SQADSDRALY
YSRGFQIDNY MVDGIPTYFE SRWNLGDALS DMALFERVEV VRGATGLMTG TGNPSAAINM
VRKHATSREF KGDVSAEYGS WNKERYVADL QSPLTEDGKI RARIVGGYQN NDSWLDRYNS
EKTFFSGIVD ADLGDLTMLS AGYEYQRIDV NSPTWGGLPR WNTDGSSNSY DRARSTAPDW
AYNDKEINKV FMTLKQRFAD TWQATLNATH SEVEFDSKMM YVDAYVNKAD GMLVGPYSNY
GPGFDYVGGT GWNSGKRKVD ALDLFADGSY ELFGRQHNLM FGGSYSKQNN RYFSSWANIF
PDEIGSFYNF NGNFPQTDWS PQSLAQDDTT HMKSLYAATR VTLADPLHLI LGARYTNWRV
DTLTYSMEKN HTTPYAGLVF DINDNWSTYA SYTSIFQPQN DRDSSGKYLT PITGNNYELG
LKSDWMNSRL TTTLAIFRIE QDNVAQSTGT PIPGSNGETA YKAVDGTVSK GVEFELNGAI
TDNWQLTFGA TRYIAEDNEG NAVNPNLPRS TIKMFTSYRL PVMLELTVGG GVNWQNRVYT
DTVTPYGTFR AEQGSYALVD LFTRYQVTKN FSLQGNVNNL FDKTYDTNVE GSIVYGAPRN
FSITGTYQF