Gene GBAA_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5100 
Symbol 
ID2815328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4621378 
End bp4622478 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content40% 
IMG OID637791761 
Productdihydroorotase 
Protein accessionYP_021750 
Protein GI47530401 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID[TIGR03583] probable amidohydrolase EF_0837/AHA_3915 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC GATTCGTACT ACGTAATGTG AAACGTGTGA ACGGGGAAGA GATTGACATT 
GTAATTGAAA ATAATAAAAT CGCACAGGTG ACGAAAGCTG GTGCTGGCGA GGGTGGAAAG
GTTCTTGATT ACTCAGGTAC TTACGTATCG AGTGGTTGGA TTGATTTGCA CGTTCATGCT
TTTCCAGAGT TTGATCCGTA TGGCGATGAG GTGGACGAAA TTGGCGTTAA GCAAGGGGTA
ACGACAATTG TTGATGCAGG TAGCTGCGGT GCTGATCGCA TTGCAGATTT AGTAAAAAGT
AGAGAACAGG CAAAGACGAA TTTATTTGCT TTTTTAAATA TTTCTCGCAT CGGTTTGAAA
CGAATTGATG AATTATCCAA TATGGAATGG ATCGATAAAG AGAAAGTAAT ACAAGCAGTA
GAAAAGTATA AAGATGTAAT CGTTGGGTTA AAGGCGAGAA TGAGTAAAAG TGTCGTTTGT
GATAGTGGAA TTGAACCGCT TCATATAGCG CGTGATTTAT CCCGTGAAAC ATCATTACCG
ATTATGGTAC ATATCGGTTC AGCGCCCCCT CGCATTGAGG AAGTTGTACC TCTTTTAGAA
AAAGATGATG TTATTACACA TTACTTAAAC GGGAAAGAAA ATAATTTATT TGATGAAGAA
GGCAAACCGC TACCTGTGTT ACTAGATGCA GTGAATCGCG GTGTGCATTT AGATGTTGGG
CATGGTAATG CTAGTTTTTC TTTTAAAGTA GCAGAGGCAG CAAAGCGTCA CGATATTGCC
TTTCATACAA TTAGTACAGA TATTTACCGG AAGAATCGCG TGCACGGTCC AGTGTATAGT
ATGGCTCACG TTCTTTCGAA ATTCCTTTAC TTAGGTTATC CGCTAGAAGA AGTGATTGAT
GCGGTTACGA AACATGCGGC AGAATGGCTT AAGAAACCTG AGCTTGGCCG CATTCAAGAA
GGAGATATTG CAAACTTAAC TTTATTTACG GTGAAAGATG AGAAGGTTAA GTTAATAGAT
TCAGAAGGGG ATCAGCGCAT TGCTGAAAGA AGAATTGATA CGAAAGGGGT TGTAGTCAAT
GGGTCATTCA TTGAATGCTA A
 
Protein sequence
MTERFVLRNV KRVNGEEIDI VIENNKIAQV TKAGAGEGGK VLDYSGTYVS SGWIDLHVHA 
FPEFDPYGDE VDEIGVKQGV TTIVDAGSCG ADRIADLVKS REQAKTNLFA FLNISRIGLK
RIDELSNMEW IDKEKVIQAV EKYKDVIVGL KARMSKSVVC DSGIEPLHIA RDLSRETSLP
IMVHIGSAPP RIEEVVPLLE KDDVITHYLN GKENNLFDEE GKPLPVLLDA VNRGVHLDVG
HGNASFSFKV AEAAKRHDIA FHTISTDIYR KNRVHGPVYS MAHVLSKFLY LGYPLEEVID
AVTKHAAEWL KKPELGRIQE GDIANLTLFT VKDEKVKLID SEGDQRIAER RIDTKGVVVN
GSFIEC