Gene BAS3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3739 
SymbolpyrC 
ID2852141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3707491 
End bp3708777 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content40% 
IMG OID637506977 
Productdihydroorotase 
Protein accessionYP_029990 
Protein GI49186738 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.170477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATT TGTTTAAAAA TGGTCGTTAT ATGAATGAAG AAGGAAAAAT CGTAGCAACG 
GATCTTCTAG TACAAGACGG TAAAATCGCT AAAGTAGCAG AAAATATTAC GGCAGATAAT
GCTGAAGTGA TCGATGTGAA CGGAAAGTTA ATCGCACCTG GATTAGTAGA TGTACACGTA
CACCTTCGTG AACCAGGTGG TGAACATAAA GAAACAATTG AAACAGGTAC ATTAGCAGCG
GCAAAAGGTG GATTCACTAC AATTTGCGCA ATGCCAAATA CACGCCCAGT ACCAGATTGC
AGAGAACATA TGGAAGACTT GCAAAATCGT ATTAAAGAAA AAGCACATGT TAACGTACTA
CCATATGGAG CAATTACAGT ACGTCAAGCC GGTTCTGAAA TGACAGATTT CGAAACATTA
AAAGAGCTTG GAGCATTTGC TTTCACTGAT GACGGTGTAG GCGTACAAGA TGCTAGCATG
ATGTTAGCTG CTATGAAGCG TGCAGCGAAA TTAAATATGG CAGTAGTTGC GCACTGTGAA
GAGAATACTC TTATTAATAA AGGTTGTGTA CATGAAGGGA AGTTTTCTGA GAAACACGGA
TTAAACGGTA TCCCATCAGT ATGTGAATCT GTACATATTG CAAGGGATAT ACTGCTTGCT
GAAGCAGCAG ATTGTCACTA TCACGTATGT CACGTAAGTA CGAAAGGCTC TGTACGCGTA
ATTCGTGATG CAAAGCGCGC TGGAATTAAA GTAACAGCAG AGGTAACGCC TCATCACTTA
GTGTTATGTG AAGATGATAT CCCATCAGCT GATCCTAATT TTAAAATGAA CCCACCGCTT
CGTGGAAAAG AAGACCACGA AGCATTAATT GAAGGTTTAT TAGATGGAAC AATCGATATG
ATCGCAACTG ACCATGCACC GCATACAGCA GAAGAGAAAG CGCAAGGAAT TGAAAGAGCA
CCATTCGGGA TTACTGGTTT TGAAACTGCA TTCCCACTTC TATACACAAA CCTTGTGAAA
AAAGGAATTA TTACACTAGA GCAGTTAATT CAATTCTTAA CAGAAAAGCC AGCTGATACA
TTCGGCTTAG AAGCAGGTCG CCTGAAAGAA GGTAGAACAG CTGATATTAC AATCATTGAT
TTAGAACAAG AAGAAGAGAT TGACCCAACA ACATTCTTAT CAAAAGGAAA AAATACACCA
TTCGCAGGTT GGAAATGCCA AGGATGGCCG GTAATGACAA TCGTTGGTGG TAAGATCGCA
TGGCAAAAGG AGAGTGCATT AGTATGA
 
Protein sequence
MNYLFKNGRY MNEEGKIVAT DLLVQDGKIA KVAENITADN AEVIDVNGKL IAPGLVDVHV 
HLREPGGEHK ETIETGTLAA AKGGFTTICA MPNTRPVPDC REHMEDLQNR IKEKAHVNVL
PYGAITVRQA GSEMTDFETL KELGAFAFTD DGVGVQDASM MLAAMKRAAK LNMAVVAHCE
ENTLINKGCV HEGKFSEKHG LNGIPSVCES VHIARDILLA EAADCHYHVC HVSTKGSVRV
IRDAKRAGIK VTAEVTPHHL VLCEDDIPSA DPNFKMNPPL RGKEDHEALI EGLLDGTIDM
IATDHAPHTA EEKAQGIERA PFGITGFETA FPLLYTNLVK KGIITLEQLI QFLTEKPADT
FGLEAGRLKE GRTADITIID LEQEEEIDPT TFLSKGKNTP FAGWKCQGWP VMTIVGGKIA
WQKESALV