Gene EcSMS35_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1623 
Symbol 
ID6145606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1612446 
End bp1613906 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content50% 
IMG OID641616499 
Productmannitol dehydrogenase family protein 
Protein accessionYP_001743677 
Protein GI170679858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000610622 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACGCTCCCTG TTTATGATCG TAATAACCTG 
GCCCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGTGTGTAT
GCCGATATTC TTGCTACGGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAATTTAATC
GGCGGCGAAC AGCAAATTGC CGATTTACAA CAGCAAGATA ATCTTTATAC CGTTGCGGAA
ATGTCGGCCG ATGCGTGGAC GGCTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA
CAGATTGATG GCTTAGAAAC CGTGTTGGCT GCGATGTGTG AACCGCAAAT CGCGATTGTC
TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC
GATCACCCGA TGGTCGTTGC CGACGTACAA AATCCCCACC AGCCGAAAAC TGCAACAGGG
GTGATTGTCG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG
TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCG
CAAGCTGTTG ATGTAAAACT GGCACAATGG ATCGAAGAAA ACGTGACTTT CCCATCAACA
ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAACTT
ACCGGTGTGC GCGATCCTGC TGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATAGAA
GATAACTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCCGAACT GGTTAGTGAT
GTGCTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG
TATCTGGGTT ATCTTGCCGG ATATCAGCAC ATTAATGACT GCATGGAAGA TGAACATTAT
CGTCATGCAG CGTATGCCTT GATGTTGCAG GAACAAGCGC CGACGTTGAA AGTGCAGGGC
GTTGATTTGC AAGATTACGC TAACCGATTA ATTGAACGCT ATAGCAACCC GGCGCTACGT
CACCGAACCT GGCAGATTGC GATGGATGGC AGCCAGAAAT TGCCACAGCG GATGTTGGAT
TCTGTTCGCT GGCATCTGGC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG
GGTTGGATGC GCTATGTCGG TGGTGTTGAT GAACAGGGAA ATCCGATTGA AATCAGTGAT
CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC
CAGTCATTGC TGGCGATTAA GGCGATCTTT GGTGATGATT TGCCAGACAA TAGCTTGTTT
ACTGCAAAAG TGACGGAAGC GTACTTGTCT TTATTAGCGC ATGGTGCGAA AGCGACCGTG
GCGAAATATT CCGTGAAGTA A
 
Protein sequence
MGNNLLSAKA TLPVYDRNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI 
GGEQQIADLQ QQDNLYTVAE MSADAWTARV VGVVKKALHV QIDGLETVLA AMCEPQIAIV
SLTITEKGYF HSPATGQLML DHPMVVADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM
SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEENVTFPST MVDRIVPAVT EDTLAKIEQL
TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA
YLGYLAGYQH INDCMEDEHY RHAAYALMLQ EQAPTLKVQG VDLQDYANRL IERYSNPALR
HRTWQIAMDG SQKLPQRMLD SVRWHLAHDS KFDLLALGVA GWMRYVGGVD EQGNPIEISD
PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GDDLPDNSLF TAKVTEAYLS LLAHGAKATV
AKYSVK