Gene BAS4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4389 
Symbol 
ID2851685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4300936 
End bp4302186 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content38% 
IMG OID637507626 
Productputative deaminase 
Protein accessionYP_030636 
Protein GI49187384 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGTGA AATCACAATA TTGGTTAACG AATGTAAAAC TTGAATGTGG GTACGTATAT 
GAGGAATCAC GAATTACAAG TACAGAAACG GAAATTTGTA GTTTATTTAT TGAAGATGGA
AAGATTACGA ACATATTACC TGGAATTGTA TCAGAGCAAG ATGGTGAAGT AGTAAATGCA
AATGGTTTAC TGGCATTACC AGCATTTGAA GAAATGCATA TCCATATTGA TAAGACGTAT
TATGGTGGAC CATGGAAAGC GTGTACACCA GTAAAGAGTA TATTCACTCG TATTCATGAG
GAGCAAACTA TTCTTCCGAA GCAATTAGAA ACTGCAAAGA GTAGAGCGGA GAAAATGTTG
CAATTACTTC TTGAAAATGG AGCGACAAAT ATTCGAACAC ATTGCAATAT TGATCCTGTA
ATTGGTCTTG GGAATTTAGA AGCAACGATT GCGGCACTGG AAACATATAA AGATAAGTTA
TCAGCAAAAA TTGTAGCATT CCCGCAACAC GGTTTGTTAC GAAGTAATTC CGTAGGGCTT
GTAAAAGACG CTATGCGTAT GGGAGCTCAT TTAGTAGGTG GAGTAGACCC AGCTACAGTA
GATGGTAATA TTGAAAAATC ATTGAATACA ATTATGGATA TTGCAGTAGA GTTTGATTCA
GACATTGATA TTCATTTACA TGACGCGGAT CAACTCGGAA CATTTACAAT GAAGAGATTA
GCTGCATTAA CAGAAGAAGC AGGGTGGCAA GGAAGAGTTA CGATTAGTCA TGCTCTTGGA
CTTGGAGATG TATCTGTAGA AGAAGCGGGG GAAATGGCTG AACGACTTGC GGCATTAGGT
ATTGATATAA CGTCAACAGT TCCAGTTAGT AGACATGTAA TTCCAGTTCC GTTATTAAAT
CGTAAAGGTG TAAAAGTTTC GCTAGGAAAC GATAGTATAA CTGATCATTG GTCTCCGTTT
GGGACAGGAG ATATGTTGCA AAAGGCAAAT TGTCTAGCGG AGAGATTTAG ATGGATTGAT
GAGCGCTCTT TAGGGAAAGC ACTTCAGTTT ATTACGGGTG GGAAATCTAT ATTAGATGAT
CAAGGAAATC GTCAATGGCC AAAAATTGGG GATGAAGCAA ATATCGTCTT TACAGAAGCA
TCATGTTCAG CCGAGGTAGT AGCAAGACAA ACAGAGCGCT GTGCGGTTTT ATATAAAGGA
AATGTTGTTG CTGGTAGTTT GGAAAAAACG GCTATAAATA ATCTTGTTTG A
 
Protein sequence
MSVKSQYWLT NVKLECGYVY EESRITSTET EICSLFIEDG KITNILPGIV SEQDGEVVNA 
NGLLALPAFE EMHIHIDKTY YGGPWKACTP VKSIFTRIHE EQTILPKQLE TAKSRAEKML
QLLLENGATN IRTHCNIDPV IGLGNLEATI AALETYKDKL SAKIVAFPQH GLLRSNSVGL
VKDAMRMGAH LVGGVDPATV DGNIEKSLNT IMDIAVEFDS DIDIHLHDAD QLGTFTMKRL
AALTEEAGWQ GRVTISHALG LGDVSVEEAG EMAERLAALG IDITSTVPVS RHVIPVPLLN
RKGVKVSLGN DSITDHWSPF GTGDMLQKAN CLAERFRWID ERSLGKALQF ITGGKSILDD
QGNRQWPKIG DEANIVFTEA SCSAEVVARQ TERCAVLYKG NVVAGSLEKT AINNLV