Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2158 |
Symbol | |
ID | 6968327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2069950 |
End bp | 2071410 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643386053 |
Product | mannitol dehydrogenase family protein |
Protein accession | YP_002270542 |
Protein GI | 209399925 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0246] Mannitol-1-phosphate/altronate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000417107 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACACTCCCTG TTTATGATCG TAATAACCTG GCCCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGCGTGTAT GCCGATATTC TTGCTACAGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAACTTAATC GGCGGCGAAC AGCAAATTGC CGATTTACAT CAGCAAGATA ATCTTTATAC CGTTGCGGAA ATGTCGGCCG ATGCGTGGAC GGCTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA CAGATAGATG GCTTAGAAAC CGTGTTGGCT GCGATGTGTG AACCGCAAAT CGCGATTGTC TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC GATCACCCGA TGGTCGCTGC CGACGTGCAA AATCCCCACC AGCCGAAAAC AGCAACAGGG GTGATTGTTG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCA CAAGCCGTTG ATGTAAAACT AGCACAATGG ATCGAAGATA ACGTGACTTT CCCATCAACA ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAGCTT ACCGGTGTGC GCGATCCTGC TGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATCGAA GATAACTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCCGAACT GGTTAGTGAT GTGCTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG TATCTGGGTT ATCTTGCCGG ATATCAGCAC ATTAATGACT GTATGGAAGA TGAACATTAT CGTCATGCGG CGTATGCCTT GATGTTGCAG GAACAAGCGC CGACGCTGAA AGTGCAGGGC GTTGATTTGC AAGATTACGC TAACCGATTA ATTGCACGCT ATAGCAACCC GGCGTTACGT CATCGAACCT GGCAGATTGC GATGGATGGC AGCCAGAAAT TGCCACAGCG GATGTTGGAT TCTGTTCGCT GGCATCTGGC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG GGTTGGATGC GTTATGTCGG TGGTGTTGAT GAACAGGGAA ATCCGATAGA AATCAGTGAC CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC CAGTCATTGC TGGCGATTAA GGCGATTTTT GGTGGTGATT TGCCAGACAA TAGCTTGTTT ACTGCAAAAG TGACGGAAGC GTACTTGTCT TTATTAGCGC ATGGTGCGAA AGCGACCGTG GCGAAATATT CCGTGAAGTA A
|
Protein sequence | MGNNLLSAKA TLPVYDRNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI GGEQQIADLH QQDNLYTVAE MSADAWTARV VGVVKKALHV QIDGLETVLA AMCEPQIAIV SLTITEKGYF HSPATGQLML DHPMVAADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEDNVTFPST MVDRIVPAVT EDTLAKIEQL TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA YLGYLAGYQH INDCMEDEHY RHAAYALMLQ EQAPTLKVQG VDLQDYANRL IARYSNPALR HRTWQIAMDG SQKLPQRMLD SVRWHLAHDS KFDLLALGVA GWMRYVGGVD EQGNPIEISD PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GGDLPDNSLF TAKVTEAYLS LLAHGAKATV AKYSVK
|
| |