Gene EcSMS35_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3302 
Symbol 
ID6143607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3377563 
End bp3379035 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content48% 
IMG OID641618132 
Productmannitol dehydrogenase 
Protein accessionYP_001745282 
Protein GI170683628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.059304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGGA ACGAAAGTAT CGCTGCCGCC CACTATGATC AGGTAACCAC CTGGCCTCGG 
GACGGGTTAC AGGCAGATAT TGTCCATATT GGTTTTGGCG CTTTTCATCG CGGACACCAG
GCTGTCTACA CGGATCTCAC TAATCAACTT TCGGACACCC GCTGGGGGAT CTTTGAGATC
AACCTGTTTG GTGATGCTCA ACTGATCGAA AACTTAAATG CGCAAAATGG GCTGTTTTCG
GTTGTGGAAA CATCTGCATC GCAATCTACC TCACGCCTTG TGCGTTCGGT GGCTGGCGGT
ATTCATACCC CCAGAGATGG CATTGCCGCA GCAATCCATA AACTGGCTGA ACCTCAGGTA
AAAATCGTTT CATTAACCAT CACCGAGAAA GGTTATTGCC TCGATCCGCA AACACGTTCA
CTTGATCTCA CCAATGGATT AATCCAACAC GATCTACAAA ACCCGGATGC GCCTCTGTCA
GCTATTGGCG TGATCGTCTG TGCTTTGCAA CAACGTAAAG CGACAGGACT CGCCGCTTTT
AGTGTGCTCT CCTGTGACAA CTTGCCAGAC AATGGGCATC TGACGCGCAA TGCCGTTCTC
GGTTTTGCCC GACAACTGGA CCAGCCCCTG GCTCAGTGGA TCGAAGAAAA TGTCTCATTT
CCAGGCACGA TGGTAGATCG CATTGTTCCG GCAATGACTG AATCGCAATT CGCCTTACTG
GAAACGAAAA CCGGTTATGC CGATCCCTGC GGGATCGTCT GCGAATCATT TCGTCAGTGG
GTGATCGAAG ATAATTTTGT GCGCGGACGA CCACAATGGG ATAAAGCCGG CGCGATGTTT
GTCAGCAATG TTCAGCCTTA TGAAGAGATG AAGTTACGCA TGTTAAATGG TAGCCATTCA
TTTCTGGCTT ATAACGGCTC GCTGGCAGGC TATGAGTTTA TCTGGCAATG CATGGAAGAC
GCTAATTTTC GTTCCATTAC CCACCAACTG ATGATTAATG AACAAGCCCG AACACTTAAT
CCAGACTTAA ATATCAACAT CCAGGAATAC GCCGACCTGT TAATTGAACG CTTTAGCAAC
CGTAACGTTG CACACCGTAC CGGGCAAATA GCCATGGACG GTTCACAAAA GCTTCCCCAG
CGAGCGCTGA CGCCCTGGCT GAAATTGCAT CAGCAAAAGC AAAACAATGC TGTTCTGTCA
CTGCTCATTG CTGGTTGGTT GCATTATGTC ATTGATGCTG TTGAGAAAAG CCAGTCTGCC
GCTGATCCAA TGAATGACCA ATTTCAGGCG CTAATAAAGG AACAACAAGA CGCATGGCAA
CAGGCGCTCG CATTACTGCA CCTTAGCGCC ATATTTGGTG ATTTAAGCAA CCATCAGCCA
TTTATAAATG AAATAAAAAT CGCCTTTGCG AATATAAAAA ACAAAGGCAT CAAGGCCACC
ATCAGCCAAT TATTATCGGA TGAGCAGAAA TGA
 
Protein sequence
MSGNESIAAA HYDQVTTWPR DGLQADIVHI GFGAFHRGHQ AVYTDLTNQL SDTRWGIFEI 
NLFGDAQLIE NLNAQNGLFS VVETSASQST SRLVRSVAGG IHTPRDGIAA AIHKLAEPQV
KIVSLTITEK GYCLDPQTRS LDLTNGLIQH DLQNPDAPLS AIGVIVCALQ QRKATGLAAF
SVLSCDNLPD NGHLTRNAVL GFARQLDQPL AQWIEENVSF PGTMVDRIVP AMTESQFALL
ETKTGYADPC GIVCESFRQW VIEDNFVRGR PQWDKAGAMF VSNVQPYEEM KLRMLNGSHS
FLAYNGSLAG YEFIWQCMED ANFRSITHQL MINEQARTLN PDLNINIQEY ADLLIERFSN
RNVAHRTGQI AMDGSQKLPQ RALTPWLKLH QQKQNNAVLS LLIAGWLHYV IDAVEKSQSA
ADPMNDQFQA LIKEQQDAWQ QALALLHLSA IFGDLSNHQP FINEIKIAFA NIKNKGIKAT
ISQLLSDEQK