Gene SbBS512_E4352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4352 
Symbol 
ID6270716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4068325 
End bp4069335 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID641728160 
Productoxidoreductase, zinc-binding dehydrogenase family 
Protein accessionYP_001882573 
Protein GI187732291 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA TTACCCAGGT TTTATTTTCA GATATTGGGA AAGTCACCAC TCAATATGTT 
GAAGTACCAC ACCAGGAACT TAAACCGCAC GAAGTGCGGA TTGCGCCTGT GTTCTATGGG
ATATGCGGTT CGGATCTGCA TGTTCTGAAA GGCGGTCATC CGTTTGCCAA ACCACCTGTC
GTCCCCGGTC ATGAAATTGC AGCGCGCGTT ACGGAAGTTG GCAGCGACGT TAAAAATGTA
CAGCCGGGCG ATCATGTTGT GGTCGATCCC ATCATGGCTT GCATGGAATG CCGAGCCTGC
AAAGCAGGAC GTTTTAATCT TTGTGAACCA CCGCAGGTTG CTAGTTTTCG CGCACCGGGC
TTTGCTCGCT CACAACACAT TGTTCCTGCG CGTAATTGCC ATGTCGCACC AGCCTCTTTA
CCGCTAAAAG TGTTGGCCTT TGCCGAACCG GCGGCTTGTG CCCGTCACTG CGTTAACCGA
ATGCCGAAAG CTTCTCTGGA AAGCGTACTG GTAATTGGTG CCGGAACGAT AGGCTTATCC
ATCGTGCAGG CACTGCGCAT TATGGGGGCA GGTAAGATTA CCGTGATTGA ACCTGACGCT
GCCAAACGCG CGCTGGCGTT AAAGCTGGGC GCAGCAGAAG TTTGGGCACT AGGTGAGCTG
GCCGCAGATG TGCGATTTAC GGGGGCGATT GATGTCGTTG CAGCGCAGGC CACGCTTAAC
GATGCATGTA CCCGTGTATA TGCCGGAGGC ACCGTCGTGT GCATGGGCGT ACCAAGTGGG
CCGCGTGAAA TACCATTACC GATGATGCAA CGTTTCGAGC GTGACTTGCT CAACTCTGGC
ATGTACATCC CTGAAGATTT CGATGCTGTT ATCGAATGGC TGGCGGATGG GCGGTTTGAT
ACCAGTGAAC TGGTTACCGA TTTATTTGCC ATTGAGGATG CAGCGGCGGC ATTTGAACGC
GCGCAGCAAA ATGACTCCAT AAAGGTCATG CTGCAATTTG CGCCGGAATG A
 
Protein sequence
MDKITQVLFS DIGKVTTQYV EVPHQELKPH EVRIAPVFYG ICGSDLHVLK GGHPFAKPPV 
VPGHEIAARV TEVGSDVKNV QPGDHVVVDP IMACMECRAC KAGRFNLCEP PQVASFRAPG
FARSQHIVPA RNCHVAPASL PLKVLAFAEP AACARHCVNR MPKASLESVL VIGAGTIGLS
IVQALRIMGA GKITVIEPDA AKRALALKLG AAEVWALGEL AADVRFTGAI DVVAAQATLN
DACTRVYAGG TVVCMGVPSG PREIPLPMMQ RFERDLLNSG MYIPEDFDAV IEWLADGRFD
TSELVTDLFA IEDAAAAFER AQQNDSIKVM LQFAPE