Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4352 |
Symbol | |
ID | 6270716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4068325 |
End bp | 4069335 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728160 |
Product | oxidoreductase, zinc-binding dehydrogenase family |
Protein accession | YP_001882573 |
Protein GI | 187732291 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAA TTACCCAGGT TTTATTTTCA GATATTGGGA AAGTCACCAC TCAATATGTT GAAGTACCAC ACCAGGAACT TAAACCGCAC GAAGTGCGGA TTGCGCCTGT GTTCTATGGG ATATGCGGTT CGGATCTGCA TGTTCTGAAA GGCGGTCATC CGTTTGCCAA ACCACCTGTC GTCCCCGGTC ATGAAATTGC AGCGCGCGTT ACGGAAGTTG GCAGCGACGT TAAAAATGTA CAGCCGGGCG ATCATGTTGT GGTCGATCCC ATCATGGCTT GCATGGAATG CCGAGCCTGC AAAGCAGGAC GTTTTAATCT TTGTGAACCA CCGCAGGTTG CTAGTTTTCG CGCACCGGGC TTTGCTCGCT CACAACACAT TGTTCCTGCG CGTAATTGCC ATGTCGCACC AGCCTCTTTA CCGCTAAAAG TGTTGGCCTT TGCCGAACCG GCGGCTTGTG CCCGTCACTG CGTTAACCGA ATGCCGAAAG CTTCTCTGGA AAGCGTACTG GTAATTGGTG CCGGAACGAT AGGCTTATCC ATCGTGCAGG CACTGCGCAT TATGGGGGCA GGTAAGATTA CCGTGATTGA ACCTGACGCT GCCAAACGCG CGCTGGCGTT AAAGCTGGGC GCAGCAGAAG TTTGGGCACT AGGTGAGCTG GCCGCAGATG TGCGATTTAC GGGGGCGATT GATGTCGTTG CAGCGCAGGC CACGCTTAAC GATGCATGTA CCCGTGTATA TGCCGGAGGC ACCGTCGTGT GCATGGGCGT ACCAAGTGGG CCGCGTGAAA TACCATTACC GATGATGCAA CGTTTCGAGC GTGACTTGCT CAACTCTGGC ATGTACATCC CTGAAGATTT CGATGCTGTT ATCGAATGGC TGGCGGATGG GCGGTTTGAT ACCAGTGAAC TGGTTACCGA TTTATTTGCC ATTGAGGATG CAGCGGCGGC ATTTGAACGC GCGCAGCAAA ATGACTCCAT AAAGGTCATG CTGCAATTTG CGCCGGAATG A
|
Protein sequence | MDKITQVLFS DIGKVTTQYV EVPHQELKPH EVRIAPVFYG ICGSDLHVLK GGHPFAKPPV VPGHEIAARV TEVGSDVKNV QPGDHVVVDP IMACMECRAC KAGRFNLCEP PQVASFRAPG FARSQHIVPA RNCHVAPASL PLKVLAFAEP AACARHCVNR MPKASLESVL VIGAGTIGLS IVQALRIMGA GKITVIEPDA AKRALALKLG AAEVWALGEL AADVRFTGAI DVVAAQATLN DACTRVYAGG TVVCMGVPSG PREIPLPMMQ RFERDLLNSG MYIPEDFDAV IEWLADGRFD TSELVTDLFA IEDAAAAFER AQQNDSIKVM LQFAPE
|
| |