Gene BAS4740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4740 
Symbol 
ID2851474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4622474 
End bp4623574 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content40% 
IMG OID637507974 
Productdihydroorotase 
Protein accessionYP_030984 
Protein GI49187731 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID[TIGR03583] probable amidohydrolase EF_0837/AHA_3915 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.185439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC GATTCGTACT ACGTAATGTG AAACGTGTGA ACGGGGAAGA GATTGACATT 
GTAATTGAAA ATAATAAAAT CGCACAGGTG ACGAAAGCTG GTGCTGGCGA GGGTGGAAAG
GTTCTTGATT ACTCAGGTAC TTACGTATCG AGTGGTTGGA TTGATTTGCA CGTTCATGCT
TTTCCAGAGT TTGATCCGTA TGGCGATGAG GTGGACGAAA TTGGCGTTAA GCAAGGGGTA
ACGACAATTG TTGATGCAGG TAGCTGCGGT GCTGATCGCA TTGCAGATTT AGTAAAAAGT
AGAGAACAGG CAAAGACGAA TTTATTTGCT TTTTTAAATA TTTCTCGCAT CGGTTTGAAA
CGAATTGATG AATTATCCAA TATGGAATGG ATCGATAAAG AGAAAGTAAT ACAAGCAGTA
GAAAAGTATA AAGATGTAAT CGTTGGGTTA AAGGCGAGAA TGAGTAAAAG TGTCGTTTGT
GATAGTGGAA TTGAACCGCT TCATATAGCG CGTGATTTAT CCCGTGAAAC ATCATTACCG
ATTATGGTAC ATATCGGTTC AGCGCCCCCT CGCATTGAGG AAGTTGTACC TCTTTTAGAA
AAAGATGATG TTATTACACA TTACTTAAAC GGGAAAGAAA ATAATTTATT TGATGAAGAA
GGCAAACCGC TACCTGTGTT ACTAGATGCA GTGAATCGCG GTGTGCATTT AGATGTTGGG
CATGGTAATG CTAGTTTTTC TTTTAAAGTA GCAGAGGCAG CAAAGCGTCA CGATATTGCC
TTTCATACAA TTAGTACAGA TATTTACCGG AAGAATCGCG TGCACGGTCC AGTGTATAGT
ATGGCTCACG TTCTTTCGAA ATTCCTTTAC TTAGGTTATC CGCTAGAAGA AGTGATTGAT
GCGGTTACGA AACATGCGGC AGAATGGCTT AAGAAACCTG AGCTTGGCCG CATTCAAGAA
GGAGATATTG CAAACTTAAC TTTATTTACG GTGAAAGATG AGAAGGTTAA GTTAATAGAT
TCAGAAGGGG ATCAGCGCAT TGCTGAAAGA AGAATTGATA CGAAAGGGGT TGTAGTCAAT
GGGTCATTCA TTGAATGCTA A
 
Protein sequence
MTERFVLRNV KRVNGEEIDI VIENNKIAQV TKAGAGEGGK VLDYSGTYVS SGWIDLHVHA 
FPEFDPYGDE VDEIGVKQGV TTIVDAGSCG ADRIADLVKS REQAKTNLFA FLNISRIGLK
RIDELSNMEW IDKEKVIQAV EKYKDVIVGL KARMSKSVVC DSGIEPLHIA RDLSRETSLP
IMVHIGSAPP RIEEVVPLLE KDDVITHYLN GKENNLFDEE GKPLPVLLDA VNRGVHLDVG
HGNASFSFKV AEAAKRHDIA FHTISTDIYR KNRVHGPVYS MAHVLSKFLY LGYPLEEVID
AVTKHAAEWL KKPELGRIQE GDIANLTLFT VKDEKVKLID SEGDQRIAER RIDTKGVVVN
GSFIEC