Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1147 |
Symbol | gatD |
ID | 6269023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1038734 |
End bp | 1039774 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641725276 |
Product | galactitol-1-phosphate dehydrogenase |
Protein accession | YP_001879794 |
Protein GI | 187733813 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAG TGGTGAATGA TACTGATGGT ATCGTGCGCG TTGCAGAAAG CGTCATTCCT GAAATTAAAC ATCAGGATGA GGTGCGGGTA AAAATTGCCA GCTCGGGCTT ATGTGGTTCC GATTTACCCA GGATATTTAA AAATGGTGCA CATTATTATC CAATAACGTT AGGCCATGAA TTTAGCGGCT ATATTGATGC GGTGGGATCC GGTGTTGATG ATTTACACCC TGGCGATGCG GTTGCCTGTG TGCCGTTATT ACCCTGTTTT ACTTGTCCAG AGTGTCTGAA AGGGTTTTAT TCCCAGTGCG CAAAATATGA TTTTATTGGC TCGCGGCGTG ATGGTGGATT TGCTGAATAT ATTGTCGTTA AGCGAAAAAA TGTCTTTGCT CTACCCACGG ATATGCCTAT TGAGGATGGG GCTTTTATTG AGCCGATTAC CGTTGGTCTG CATGCTTTTC ATTTAGCGCA AGGTTGTGAG AATAAAAACG TTATTATTAT TGGTGCCGGA ACCATTGGCC TGCTGGCTAT TCAGTGCGCT GTCGCGCTGG GAGCAAAGAG TGTGACGGCT ATCGACATTA GCTCAGAAAA ACTGGCACTG GCAAAATTTT TCGGTGCGAT GCAAACATTT AACAGTAGCG AAATGAGCGC GCCGCAAATG CAGAGCGTTT TACGCGAACT GCGTTTTAAT CAGCTTATCC TCGAGACGGC TGGCGTACCG CAAACTGTCG AACTGGCGGT AGAGATTGCC GGTCCTCATG CCCAACTGGC GCTGGTGGGC ACGTTGCATC AGGATCTGCA TTTAACATCG GCAACGTTTG GCAAAATATT GCGTAAAGAG CTGACGGTTA TCGGCAGTTG GATGAACTAT TCCAGCCCTT GGCCGGGGCA GGAGTGGGAA ACGGCGAGCC GGTTGCTGAC AGAACGTAAG TTAAGCCTGG AGCCATTAAT CGCTCACCGT GGAAGCTTTG AAAGCTTCAC CCAGGCGGTG CGTGACATCG CTCGTAATGC CATGCCGGGC AAAGTGTTGC TCATTCCATG A
|
Protein sequence | MKSVVNDTDG IVRVAESVIP EIKHQDEVRV KIASSGLCGS DLPRIFKNGA HYYPITLGHE FSGYIDAVGS GVDDLHPGDA VACVPLLPCF TCPECLKGFY SQCAKYDFIG SRRDGGFAEY IVVKRKNVFA LPTDMPIEDG AFIEPITVGL HAFHLAQGCE NKNVIIIGAG TIGLLAIQCA VALGAKSVTA IDISSEKLAL AKFFGAMQTF NSSEMSAPQM QSVLRELRFN QLILETAGVP QTVELAVEIA GPHAQLALVG TLHQDLHLTS ATFGKILRKE LTVIGSWMNY SSPWPGQEWE TASRLLTERK LSLEPLIAHR GSFESFTQAV RDIARNAMPG KVLLIP
|
| |