Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0973 |
Symbol | gatD |
ID | 6142996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 983209 |
End bp | 984249 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615860 |
Product | galactitol-1-phosphate dehydrogenase |
Protein accession | YP_001743052 |
Protein GI | 170680364 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.465963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAG TGGTGAATGA TACTGATGGT ATCGTGCGCG TAGCAGAAAG CGTCATTCCT GAAATTAAAC ATCAGGATGA GGTGCGGGTA AAAATTGCCA GCTCGGGCTT ATGTGGTTCC GATTTACCCA GGATATTTAA AAATGGTGCA CATTATTATC CAATAACGTT AGGCCATGAA TTTAGCGGCT ATATTGATGC GGTGGGATCC GGTATTGATG ATTTACATCC CGGCGATGCG GTTGCCTGTG TGCCGTTATT ACCCTGTTTT ACTTGTCCAG AGTGTCTGAA AGGGTTTTAT TCCCAGTGCG CAAAATATGA TTTTATTGGC TCACGGCGTG ATGGTGGATT TGCCGAATAT ATTGTCGTTA AGCGAAACAA TGTCTTTGCA CTACCTGCGG ATATGCCTAT TGAGGATGGC GCTTTTATTG AACCGATTAC CGTTGGCCTG CATGCGTTCC ATTTAGCGCA AGGATGTGAG AATAAAAACG TTATTATTAT TGGTGCCGGA ACCATTGGCC TGCTGGCAAT TCAGTGCGCT GTCGCGCTGG GAGCAAAGAG TGTGACGGCT ATCGACATTA GCTCAGAAAA ACTGGCACTG GCAAAATCTT TCGGTGCGAT GCAAACATTT AACAGCCGTG AAATGAGCGC GCCGCAAATA CAGGGCGTTT TACGCGAGCT GCGCTTTAAT CAGCTTATCC TCGAGACGGC TGGCGTCCCG CAAACCGTCG AACTGGCGGT AGAGATTGCC GGTCCTCATG CCCAACTGGC GCTGGTGGGC ACGTTGCATC AGGATCTGCA TTTAACATCG GCAACGTTTG GCAAAATATT GCGTAAAGAG CTGACGGTGA TTGGCAGTTG GATGAACTAC TCCAGCCCTT GGCCGGGGCA GGAGTGGGAA ACGGCGAGCC GGTTGCTGAC AGAACGTAAG TTAAGCCTGG AGCCATTAAT CGCTCACCGT GGAAGCTTTG AAAGCTTCGC CCAGGCGGTG CGTGACATCG CTCGTAATGC TATGCCGGGC AAAGTGTTGC TCATTCCCTG A
|
Protein sequence | MKSVVNDTDG IVRVAESVIP EIKHQDEVRV KIASSGLCGS DLPRIFKNGA HYYPITLGHE FSGYIDAVGS GIDDLHPGDA VACVPLLPCF TCPECLKGFY SQCAKYDFIG SRRDGGFAEY IVVKRNNVFA LPADMPIEDG AFIEPITVGL HAFHLAQGCE NKNVIIIGAG TIGLLAIQCA VALGAKSVTA IDISSEKLAL AKSFGAMQTF NSREMSAPQI QGVLRELRFN QLILETAGVP QTVELAVEIA GPHAQLALVG TLHQDLHLTS ATFGKILRKE LTVIGSWMNY SSPWPGQEWE TASRLLTERK LSLEPLIAHR GSFESFAQAV RDIARNAMPG KVLLIP
|
| |