Gene EcolC_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2116 
Symbol 
ID6067095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2312954 
End bp2314414 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content50% 
IMG OID641601524 
Productmannitol dehydrogenase domain-containing protein 
Protein accessionYP_001725083 
Protein GI170020129 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000373303 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACACTCCCTG TTTATGATCG TAATAACCTG 
GCCCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGCGTGTAT
GCCGATATTC TTGCTACAGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAACTTAATC
GGCGGCGAAC AGCAAATTGC CGATTTACAG CAGCAAGATA ATCTTTATAC CGTTGCGGAA
ATGTCGGCCG ATGCGTGGAC GGTTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA
CAGATGGATG GCTTAGAAAC CGTGTTGGCT GCGATGTGTG AACCGCAAAT CGCGATTGTC
TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC
GATCACCCGA TGGTCGCTGC CGACGTGCAA AATCCCCACC AGCCGAAAAC AGCAACAGGG
GTGATTGTTG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG
TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCG
CAAGCCGTTG ATGTAAAACT GGCACAATGG ATCGAAGATA ACGTGACTTT CCCATCAACA
ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAACTT
ACCGGTGTGC GCGATCCTGC TGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATAGAA
GATAATTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCAGAACT GGTTAGCGAT
GTACTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG
TATTTGGGTT ATCTTGCCGG ATATCAGCAC ATTAATGACT GTATGGAAGA TGAACATTAT
CGTCATGCGG CGTATGGCTT GATGTTGCAG GAACAAGCGC CGACGCTGAA AGTGCAGGGC
GTTGATTTGC AGGATTACGC TAACCGATTA ATTGCACGCT ATAGCAACCC GGCGTTACGT
CATCGAACCT GGCAGATTGC GATGGATGGC AGCCAGAAAT TGCCACAGCG GATGTTGGAT
TCTGTTCGCT GGCATCTGAC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG
GGTTGGATGC GTTATGTCGG TGGTGTTGAT GAACAGGAAA ATCCGATAGA AATCAGTGAC
CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC
CAGTCATTGC TGGCGATTAA GGCGATCTTT GGTGATGATT TGCCAGCCAA TAGCTTGTTT
ACTGCAAAAG TGACGGAAGC GTACTTGTCT TTATTAGCGC ATGGCGCGAA AGCGACCGTG
GCAAAATATT CCGTGAAGTA A
 
Protein sequence
MGNNLLSAKA TLPVYDRNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI 
GGEQQIADLQ QQDNLYTVAE MSADAWTVRV VGVVKKALHV QMDGLETVLA AMCEPQIAIV
SLTITEKGYF HSPATGQLML DHPMVAADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM
SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEDNVTFPST MVDRIVPAVT EDTLAKIEQL
TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA
YLGYLAGYQH INDCMEDEHY RHAAYGLMLQ EQAPTLKVQG VDLQDYANRL IARYSNPALR
HRTWQIAMDG SQKLPQRMLD SVRWHLTHDS KFDLLALGVA GWMRYVGGVD EQENPIEISD
PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GDDLPANSLF TAKVTEAYLS LLAHGAKATV
AKYSVK