Gene EcSMS35_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1947 
Symbol 
ID6146065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1968414 
End bp1969436 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID641616823 
Productiron(III) ABC transporter, periplasmic iron(III)-binding protein 
Protein accessionYP_001743999 
Protein GI170681965 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0791434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTAA CCAGACGGAG GTTTACACAG ATTCTTGCGT CGACATTGTT CCTGCATCAT 
CTGCCGTCCT TTGCACAATC AGTCAAATTC TGGTCCTCAC TGACGCTTCC CGAAGCCCAA
AACATTACAC GCATCGTCAG CGCAGGCGCG CCCGCCGATT TACTATTGCT GGCTGTCGCG
CCAGAAAAAA TGGTCGGTTT TTCCTCTTTT GATTTTGCCC GTCAGGCATT AATTCCATTG
CCAGAGCACA TTCGCCAGTT CCCCAGGCTG GGACGACTCG CCGGGCGCGC CAGCACACTC
TCGCTGGAAG GGCTTATGGC GTTACATCCC GATTTGGTCG TTGATTGCGG CAATACGGAT
GAAACCTGGA TCTCCCAGGC ACGGCAGGTT AGCGAACAGA CACAAATACC CTGGTTATTG
CTTAACGGGA AACTGGAACA ATCAGCAGAA CAGTTAACAA CGCTTGGCAA AACGTTAGGC
GAAGAGCACC GCGCCGCAGA ACAAGCCAAT CTCGCCAGCC GCTTCGTTGG TGAAGCTCAG
GCATTCGCCA CCTCACCCGC CGCTAACCTC AGCTTTTATG CTGCGCGCGG TCCTCGAGGG
CTGGAAACGG GCTTACAGGG TTCGTTGCAT ACCGAGGCGG CGGAATTATT AGGTTTGCAC
AACGTCGCGC AAATAGCCGA TCGCCACGGT CTGACACAGG TTTCCATGGA AAATCTCCTG
CGCTGGCAGC CGGATATTAT TCTGGTTCAG GAGGCCGTTA CTGCAGATTT TATTCGTCGT
GATCCTCTCT GGCAGGGCGT GAAAGCGGTT GCGGAACAAC GCATCTTATT TTTAAGTGGC
CTGCCCTTTG GCTGGCTGGA TGCCCCGCCG GGAATCAACC GTCTTCTGGG ATTACGCAGA
CTTCACGCCT GGCTGGATCC CGCCATCAAT CGCCAGTTTA AAAGTGACAT GCAGCATTAC
GCCCAACTGT TCTGGCATTG TTCACTCAGT GACGCCGACT ATCAAAAATT GGTGGCGAGC
TAA
 
Protein sequence
MSLTRRRFTQ ILASTLFLHH LPSFAQSVKF WSSLTLPEAQ NITRIVSAGA PADLLLLAVA 
PEKMVGFSSF DFARQALIPL PEHIRQFPRL GRLAGRASTL SLEGLMALHP DLVVDCGNTD
ETWISQARQV SEQTQIPWLL LNGKLEQSAE QLTTLGKTLG EEHRAAEQAN LASRFVGEAQ
AFATSPAANL SFYAARGPRG LETGLQGSLH TEAAELLGLH NVAQIADRHG LTQVSMENLL
RWQPDIILVQ EAVTADFIRR DPLWQGVKAV AEQRILFLSG LPFGWLDAPP GINRLLGLRR
LHAWLDPAIN RQFKSDMQHY AQLFWHCSLS DADYQKLVAS