Gene EcSMS35_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3808 
Symbol 
ID6143448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3874090 
End bp3875082 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID641618634 
Productiron chelate ABC transporter permease 
Protein accessionYP_001745774 
Protein GI170682306 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0609] ABC-type Fe3+-siderophore transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0360927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCAAGG ATTCACTCTC TTCCGCCAGG GTCTTTATGG GGCTATCACT ATTATTGCTC 
GCTCTGGTGC TGTTTGGTGC CAGTCAGGGA GCGTTAAAGA TCAGTTTTGA TGCCCTTTTT
GATGAGGAAT ACCGCGATAT CTGGCTCAAT ATTCGTCTAC CAAGGGTTTT GCTGGCGGTG
CTGGTAGGTG CAGCGTTGGC AACCGCAGGG GTAATTATGC AAGGGCTTTT TCGCAACCCA
ATGGCTGACC CTGGATTACT TGGCGTCAGT AGCGGTTCTG CATTAATGGT TGGCGTTGCT
ATTGTGCTGC CCTTCTCCTT CCCAGTGGTG CTGGTGCTCT ATGAGCAAAT GGTGTTTGCC
ATTGCCGGAA GTTTAGTGGT CTGCACCATC ATTTTTCTCA TCACGCAGCG CCATCGCGAT
GGCAGCATGA TGCAATTATT ACTCGCCGGT ATCGCCATCA ATGCCCTGTG CGGCGCAGCG
ATCGGCATCC TGAGCTATAT CGGCGATGAG CAGCAGCTAC GACAACTCAC ATTGTGGATG
ATGGGCAATC TTGGACAGGC GCAATGGCCG ACGTTATTGG TTGCCAGTTC ATTCATCCTA
CCGGCCATTA TCGCAACAAC TTGTCTCGCC GGAACGCTGA ATTTACTGCA GCTCGGTGAT
GAAGAAGCCC ACTACCTCGG CGTGAACGTT AAGCGTAAAC GCCAGCAATT ACTGTTAGTG
AGCTCACTGC TCGTTGGTGC CGCCGTATCG GTAAGCGGCA TTATCGGCTT TATTGGCCTG
GTGATCCCGC ATCTGATTCG CATGACTACC GGGGCAAATC ACCGCTGGCT AATCCCTTGT
TCCGCCCTCG CCGGAGCCTG TTTATTGCTG ATGGCAGACA CGCTTGCCCG CACGCTGGTA
CAGCCAGCAG AAATGCCCGT GGGATTATTA ACCAGCCTGC TTGGTGGCCC TTATTTTATG
TGGTTGATTC TGCGCAACCG GAGGATCACA TGA
 
Protein sequence
MLKDSLSSAR VFMGLSLLLL ALVLFGASQG ALKISFDALF DEEYRDIWLN IRLPRVLLAV 
LVGAALATAG VIMQGLFRNP MADPGLLGVS SGSALMVGVA IVLPFSFPVV LVLYEQMVFA
IAGSLVVCTI IFLITQRHRD GSMMQLLLAG IAINALCGAA IGILSYIGDE QQLRQLTLWM
MGNLGQAQWP TLLVASSFIL PAIIATTCLA GTLNLLQLGD EEAHYLGVNV KRKRQQLLLV
SSLLVGAAVS VSGIIGFIGL VIPHLIRMTT GANHRWLIPC SALAGACLLL MADTLARTLV
QPAEMPVGLL TSLLGGPYFM WLILRNRRIT