Gene BAS4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4388 
Symbol 
ID2851689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4299678 
End bp4300943 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content38% 
IMG OID637507625 
Productputative deaminase 
Protein accessionYP_030635 
Protein GI49187383 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTGAGA GGATGATGGA AATGCAAAAT GCGTATTGGT TAACGAATGT ACGATTAGAA 
ACAGGTTACA AGTTTAATAA TGAAGTAGTT ACAGGTACAG AAACAGCTTT GCATCATTTA
CTTATACAAG ATGGAAAGAT TGAAAAGATT GTACTTGCGG ATGTACCGCT TCAAACAGAA
TATGAAACGA AAGATGCGAA AGAATTGCTT GTGCTTCCGT CGTTTGTGGA AAATCATTTT
CACTTGGATA AGACAAAACT TGGTGGTCCA TGGGAAGCAT GTACACCAGT AAAAAATATT
ATTGAGAGAT TAGAGTTAGA ACAGCAGGAG TTACCGATTT TAGCTCAAAC AACTGGAGAG
AGAGCAGAGT TATTACTAAG AAACATTTTA AATGCTGGTT CGACTCATAT TCGAACGCAT
GTAAATATTG ATCCGTATAT CGGTCTTAAA AACTTAGAAT CTGTACGTCA AACTTTAGAG
AATATGAAAG ATGCGTTTAC ATATGAAATC GTAGCATTCC CGCAGCATGG TTTACTTCGT
ACAGAAGCGC ATTCTCTTAT GAGAGAAGCG ATGAAAATGG GCGCAACTTT AGTAGGTGGT
GTAGATCCAG CTACTGTAGA TAATAATATT GAAAAATCAC TTTTTGATAT GATGGAAATC
GCTGTAGAGG CAAATGCTGA TGTTGATTTG CATTTACATG ATGCAGGGCA TTTAGGTATT
TATACAATTA AAAAGTTAGC TCAGTATACA GAGGAGGCTA GTTGGGACGG ACGTGTTGCA
GTCAGTCATG CGTTTAGCCT AGGGGATGTA TCTAAAGAAG AAGGAGCAGA TATGGCAGAC
TTATTAGCTG AAAGAGGAAT GTCTATTATT ACGACAGTAC CAATTAACAG AAATATGCCG
CCAGTACCAT TACTGACAGA AAAGGGTGTT CCCATTTCTT TAGGTTGCGA TAGTATGTTT
GATTCATGGG GGCCATTTGG AAATGCTGAT ATTTTTGAAA GAGTAGGGCG TTTAGCAGAA
AAGTATCGCT GGATGGATGA GAAGTCTTTA GCTTCTTCTT TAGCGTATAT TACAGGTGGA
AAAACGCCAT TGGATCAAGA AGGAAATCAA GTTTGGCCTA AAGTAGGAGA TAAAGCTGAT
TTCGTCTTCT TACAAGCTAC TTGTTCAGCA GAAGCGATTG CTAGACGAGC AAAGCGACCA
GCTGTAATGA GAGACGGAAA AATAGTAGCA GGTTCCTTGC AACATGTTCA AGGAGTATTG
ATTTAA
 
Protein sequence
MFERMMEMQN AYWLTNVRLE TGYKFNNEVV TGTETALHHL LIQDGKIEKI VLADVPLQTE 
YETKDAKELL VLPSFVENHF HLDKTKLGGP WEACTPVKNI IERLELEQQE LPILAQTTGE
RAELLLRNIL NAGSTHIRTH VNIDPYIGLK NLESVRQTLE NMKDAFTYEI VAFPQHGLLR
TEAHSLMREA MKMGATLVGG VDPATVDNNI EKSLFDMMEI AVEANADVDL HLHDAGHLGI
YTIKKLAQYT EEASWDGRVA VSHAFSLGDV SKEEGADMAD LLAERGMSII TTVPINRNMP
PVPLLTEKGV PISLGCDSMF DSWGPFGNAD IFERVGRLAE KYRWMDEKSL ASSLAYITGG
KTPLDQEGNQ VWPKVGDKAD FVFLQATCSA EAIARRAKRP AVMRDGKIVA GSLQHVQGVL
I