Gene BAS0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0641 
Symbol 
ID2849808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp693271 
End bp694323 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content38% 
IMG OID637503882 
Productalcohol dehydrogenase, zinc-containing 
Protein accessionYP_026918 
Protein GI49183666 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAT TACTTTGGCA TAATCAACGT GATGTAAGAG TAGAAGAAGT ACCAGAACCA 
ACTGTAAAAC CAGGAGCAGT TAAGATTAAA GTTAAATGGT GTGGTATCTG CGGGACAGAC
TTGCATGAAT ATTTAGCAGG ACCTATTTTT ATTCCGACAG AAGAGCATCC ATTAACACAT
GTAAAAGCAC CGGTTATTTT AGGTCATGAG TTTAGCGGTG AGGTAGTTGA AATCGGTGAA
GGCGTTACAT CTCATAAAGT GGGAGACCGC GTCGTTGTAG AACCAATTTA TTCTTGTGGT
AAATGTGAAG CTTGTAAACA TGGACATTAC AATGTTTGTG AACAACTTGT TTTCCACGGT
CTTGGCGGAG AAGGCGGCGG TTTCTCTGAA TATACAGTAG TACCAGAAGA TATGGTTCAC
CATATTCCAG ATGAAATGAC GTATGAACAA GGTGCTCTTG TAGAACCAGC AGCAGTAGCG
GTTCATGCAG TACGTCAAAG TAAATTAAAA GAAGGGGAAG CAGTAGCAGT CTTTGGTTGT
GGTCCAATTG GACTTCTTGT TATCCAAGCA GCTAAAGCAG CAGGAGCAAC TCCTGTTATT
GCAGTTGAAC TTTCTAAAGA ACGTCAAGAG TTAGCAAAAT TAGCAGGTGC GGATTACGTA
TTGAATCCAG CGACTCAAGA TGTATTAGCT GAAATTCGCA ACTTAACAAA TAGTTTAGGT
GTAAATGTAA GCTTTGAAGT AACTGGTGTT GAAGTAGTAC TTCGTCAAGC AATTGAAAGC
ACAAGCTTTG AAGGACAAAC TGTAATTGTT AGTGTATGGG AAAAAGACGC AACAATTACT
CCAAATAATC TTGTACTAAA AGAAAAAGAA GTGGTTGGTA TTTTAGGATA CCGTCATATC
TTCCCAGCTG TTATTAAATT AATTAGCTCT GGTCAAATTC AAGCAGAGAA ATTAATTACG
AAAAAAATCA CAGTAGATCA AGTTGTTGAA GAAGGATTTG AAGCACTTGT AAAAGATAAA
ACACAAGTGA AAATTCTTGT TTCACCTAAA TAA
 
Protein sequence
MKALLWHNQR DVRVEEVPEP TVKPGAVKIK VKWCGICGTD LHEYLAGPIF IPTEEHPLTH 
VKAPVILGHE FSGEVVEIGE GVTSHKVGDR VVVEPIYSCG KCEACKHGHY NVCEQLVFHG
LGGEGGGFSE YTVVPEDMVH HIPDEMTYEQ GALVEPAAVA VHAVRQSKLK EGEAVAVFGC
GPIGLLVIQA AKAAGATPVI AVELSKERQE LAKLAGADYV LNPATQDVLA EIRNLTNSLG
VNVSFEVTGV EVVLRQAIES TSFEGQTVIV SVWEKDATIT PNNLVLKEKE VVGILGYRHI
FPAVIKLISS GQIQAEKLIT KKITVDQVVE EGFEALVKDK TQVKILVSPK