Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1623 |
Symbol | |
ID | 6145606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1612446 |
End bp | 1613906 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616499 |
Product | mannitol dehydrogenase family protein |
Protein accession | YP_001743677 |
Protein GI | 170679858 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0246] Mannitol-1-phosphate/altronate dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000610622 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACGCTCCCTG TTTATGATCG TAATAACCTG GCCCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGTGTGTAT GCCGATATTC TTGCTACGGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAATTTAATC GGCGGCGAAC AGCAAATTGC CGATTTACAA CAGCAAGATA ATCTTTATAC CGTTGCGGAA ATGTCGGCCG ATGCGTGGAC GGCTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA CAGATTGATG GCTTAGAAAC CGTGTTGGCT GCGATGTGTG AACCGCAAAT CGCGATTGTC TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC GATCACCCGA TGGTCGTTGC CGACGTACAA AATCCCCACC AGCCGAAAAC TGCAACAGGG GTGATTGTCG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCG CAAGCTGTTG ATGTAAAACT GGCACAATGG ATCGAAGAAA ACGTGACTTT CCCATCAACA ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAACTT ACCGGTGTGC GCGATCCTGC TGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATAGAA GATAACTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCCGAACT GGTTAGTGAT GTGCTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG TATCTGGGTT ATCTTGCCGG ATATCAGCAC ATTAATGACT GCATGGAAGA TGAACATTAT CGTCATGCAG CGTATGCCTT GATGTTGCAG GAACAAGCGC CGACGTTGAA AGTGCAGGGC GTTGATTTGC AAGATTACGC TAACCGATTA ATTGAACGCT ATAGCAACCC GGCGCTACGT CACCGAACCT GGCAGATTGC GATGGATGGC AGCCAGAAAT TGCCACAGCG GATGTTGGAT TCTGTTCGCT GGCATCTGGC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG GGTTGGATGC GCTATGTCGG TGGTGTTGAT GAACAGGGAA ATCCGATTGA AATCAGTGAT CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC CAGTCATTGC TGGCGATTAA GGCGATCTTT GGTGATGATT TGCCAGACAA TAGCTTGTTT ACTGCAAAAG TGACGGAAGC GTACTTGTCT TTATTAGCGC ATGGTGCGAA AGCGACCGTG GCGAAATATT CCGTGAAGTA A
|
Protein sequence | MGNNLLSAKA TLPVYDRNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI GGEQQIADLQ QQDNLYTVAE MSADAWTARV VGVVKKALHV QIDGLETVLA AMCEPQIAIV SLTITEKGYF HSPATGQLML DHPMVVADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEENVTFPST MVDRIVPAVT EDTLAKIEQL TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA YLGYLAGYQH INDCMEDEHY RHAAYALMLQ EQAPTLKVQG VDLQDYANRL IERYSNPALR HRTWQIAMDG SQKLPQRMLD SVRWHLAHDS KFDLLALGVA GWMRYVGGVD EQGNPIEISD PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GDDLPDNSLF TAKVTEAYLS LLAHGAKATV AKYSVK
|
| |