Gene EcSMS35_4302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4302 
SymbolkdgT 
ID6146417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4404112 
End bp4405101 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content53% 
IMG OID641619123 
Product2-keto-3-deoxygluconate permease 
Protein accessionYP_001746247 
Protein GI170682314 
COG category 
COG ID 
TIGRFAM ID[TIGR00793] 2-keto-3-deoxygluconate transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.405227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00796393 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATGC AGATAAAACG CTCGATTGAG AAAATCCCGG GGGGGATGAT GCTCGTCCCG 
CTATTCCTTG GCGCACTGTG CCACACCTTC TCGCCGGGGG CGGGGAAATA TTTTGGATCA
TTCACCAACG GGATGATTAC CGGTACGGTG CCCATTCTGG CGGTGTGGTT TTTTTGCATG
GGGGCGTCAA TAAAATTAAG CGCGACGGGA ACGGTATTGC GTAAATCCGG CACGCTGGTG
GTAACAAAAA TTGCCGTCGC GTGGGTGGTT GCGGCGATTG CCTCGCGCAT TATTCCGGAA
CATGGTGTTG AAGTTGGATT CTTTGCCGGA CTTTCAACGC TGGCGCTGGT GGCGGCGATG
GATATGACCA ACGGCGGACT TTACGCTTCC ATCATGCAGC AGTACGGCAC AAAAGAAGAA
GCTGGGGCAT TTGTGTTGAT GTCATTGGAG TCCGGGCCGC TCATGACGAT GATTATTCTG
GGCACTGCCG GGATTGCCTC GTTTGAACCG CATGTTTTCG TCGGCGCAGT ATTACCGTTC
CTGGTGGGCT TTGCCCTTGG GAACCTTGAC CCTGAATTAC GAGAATTTTT CAGCAAAGCG
GTGCAGACGC TGATTCCATT CTTTGCCTTC GCGCTGGGCA ATACCATTGA TTTGACTGTA
ATTGCCCAGA CAGGTTTGCT GGGGATCCTG TTGGGTGTGG CAGTAATTAT CGTGACCGGT
ATTCCGTTGA TTATCGCTGA TAAATTGATT GGCGGTGGCG ATGGCACTGC CGGAATTGCC
GCTTCCAGTT CCGCAGGGGC CGCGGTAGCG ACACCTGTGC TGATTGCAGA AATGGTGCCT
GCGTTTAAAC CGATGGCTCC GGCAGCAACT TCGCTGGTAG CGACGGCGGT CATTGTGACT
TCGATTCTGG TGCCAATTCT TACCTCTATC TGGTCACGTA AAGTCAAAGC CAGAGCGGCG
AAAATCGAAA TTTTAGGTAC GGTGAAATAA
 
Protein sequence
MEMQIKRSIE KIPGGMMLVP LFLGALCHTF SPGAGKYFGS FTNGMITGTV PILAVWFFCM 
GASIKLSATG TVLRKSGTLV VTKIAVAWVV AAIASRIIPE HGVEVGFFAG LSTLALVAAM
DMTNGGLYAS IMQQYGTKEE AGAFVLMSLE SGPLMTMIIL GTAGIASFEP HVFVGAVLPF
LVGFALGNLD PELREFFSKA VQTLIPFFAF ALGNTIDLTV IAQTGLLGIL LGVAVIIVTG
IPLIIADKLI GGGDGTAGIA ASSSAGAAVA TPVLIAEMVP AFKPMAPAAT SLVATAVIVT
SILVPILTSI WSRKVKARAA KIEILGTVK