Gene EcSMS35_3060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3060 
Symbol 
ID6144830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3149691 
End bp3150734 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content45% 
IMG OID641617929 
Productsolute-binding family 7 protein 
Protein accessionYP_001745080 
Protein GI170679959 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.591808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.298604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTACC GACAATTACT CCCCTTTCTT TTATCAGGAT TTCTTCTGTT ATCTCCCTCG 
ATTTTTGCTG CTGAAAAAAT TCTTCGTTAT ACAGACCATG AACCCTACGG GGGAATGCGT
ACACAAATAA TTAAAGAAAT ATTTTTTGCA GAAATTGAAA AAGAGTCGCA GGGGCGTTTG
AAAATCGAAC CACACTGGAA CGGTGAAACC GCCATCAGCT ATGACGCTCT GACGACAATA
AGCGATGGCA GCAAAGCTGA TATGGGTATA GTGGTGCCGG AATACACCGC GAAACAATTG
CCGCTTCATC AAATCTTCAA GAGCTTTGCT ATTGGCCCGG ATCATGGAGC CAGTCAGGTA
GAATTCTTTC GTCGCGTATA TGCGGAAATT CCCGAATTTA ACGCTGAACT TGAGCGTAAC
AATATCGTGA ATTTACAGTT TTTCCTTGGC TACCCGGTAG GCTTTTTCTC TACCAGGCCC
ATTGATAAAT TGACTGCGCT TCAGGGAACC ACCTGGCGAA CAGCCAGTTT CTGGCATCGG
GCTTATTTAA CTCATACGGG GGCAAAAACC GTAACTTTAC CGTGGAATGA TCAAATAACT
AAAGCACTCA TGGATGGAAA ACTGGATGGT TTAATGGTCA ATCTCGATAG CGGATATGAC
ATCCATGCTG AACGTGCTGC GCCGAATGTG TTGCTCTCAC CTTCTCTCTG GCTTGGTCAT
GTTTATCTGT TGGTAATGAA TAAACAATCG TGGGAAAACC TTGATAACAG AGATCGTGAG
GCTATTCAAC GAGCTGCCAT TACAACCGAG AAAGCACTGG GCAAGGCATT AGATAACAAC
CTGATCAGCA TGGTAAAAAC GCTTGAGCAG GAAGGTGCAC AGGTTCGCTA TCTGAAAAAA
TCAGGGCTGG ACGCCTGGCA GAAAGCGATC GGTTATCAGC AAGAACAAGC ACAGTGGGTA
GAAAAGCAAA ATAAGGAAGG CGTGGAGAAA GCCGGGGAAG TCATGCAAAA AGTTGCCAAT
ATACTCGATG AAACAATGCG TTAA
 
Protein sequence
MHYRQLLPFL LSGFLLLSPS IFAAEKILRY TDHEPYGGMR TQIIKEIFFA EIEKESQGRL 
KIEPHWNGET AISYDALTTI SDGSKADMGI VVPEYTAKQL PLHQIFKSFA IGPDHGASQV
EFFRRVYAEI PEFNAELERN NIVNLQFFLG YPVGFFSTRP IDKLTALQGT TWRTASFWHR
AYLTHTGAKT VTLPWNDQIT KALMDGKLDG LMVNLDSGYD IHAERAAPNV LLSPSLWLGH
VYLLVMNKQS WENLDNRDRE AIQRAAITTE KALGKALDNN LISMVKTLEQ EGAQVRYLKK
SGLDAWQKAI GYQQEQAQWV EKQNKEGVEK AGEVMQKVAN ILDETMR