Gene BAS0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0226 
Symbol 
ID2852981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp226839 
End bp227957 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content36% 
IMG OID637503431 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_026511 
Protein GI49183259 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA AATCTATGGA TACGCTAGCT GCACAAATGG AGGACTTTTT TCCAGTACGT 
GATGTAGATC ATTTGGAATT TTACGTAGGA AATGCAAAGC AATCAAGTTA TTATCTTGCG
AGAGCATTCG GATTCAAAAT TGTGGCTTAC TCTGGATTAG AAACTGGTAA TCGTGAAAAA
GTATCTTATG TTCTTGTGCA AAAAAACATG CGTTTTGTTG TGTCTGGGGC TTTAAGTAGT
GACAATCGTA TTGCAGAGTT TGTAAAGACT CATGGTGATG GCGTGAAAGA TGTAGCATTA
CTTGTTGACG ATGTTGATAA AGCATACTCA GAAGCAGTGA AACGTGGTGC CGTCGCAATT
GCTCCGCCTG TAGAGTTAAC AGATGAGAAC GGTACATTGA AAAAAGCAGT TATTGGTACG
TATGGTGATA CAATTCATAC GCTTGTAGAG CGTAAAAATT ATAAAGGGAC ATTTATGCCA
GGATTCCAAA AGGCTGAGTT TGATATTCCA TTTGAAGAGT CAGGTTTAAT TGCTGTAGAC
CATGTAGTTG GTAATGTTGA AAAGATGGAA GAGTGGGTTA GTTATTACGA GAACGTTATG
GGCTTTAAAC AAATGATCCA TTTTGATGAT GATGATATTA GTACAGAGTA TTCAGCATTA
ATGTCGAAGG TTATGACAAA TGGAAGTCGT ATTAAGTTCC CTATTAACGA GCCAGCAGAT
GGAAAGAGAA AATCACAAAT TCAAGAATAT CTAGAGTTCT ATAATGGAGC AGGTGTACAG
CATCTTGCTT TACTAACAAA TGACATTGTT AAAACAGTAG AAGCGCTACG TGCAAATGGT
GTGGAGTTTT TAGATACACC AGATACTTAT TATGATGAGT TAACTGCACG AGTTGGAAAA
ATTGATGAGG AAATTGATAA GTTGAAAGAA TTAAAGATTT TAGTAGATCG CGATGATGAA
GGATACTTAC TACAAATCTT TACGAAACCA ATTGTAGATC GTCCAACTTT ATTTATTGAA
ATCATTCAGC GTAAAGGTTC TCGTGGATTT GGAGAAGGAA ACTTTAAAGC GTTATTCGAA
TCAATTGAAA GAGAACAAGA GCGTCGCGGG AATTTATAA
 
Protein sequence
MKQKSMDTLA AQMEDFFPVR DVDHLEFYVG NAKQSSYYLA RAFGFKIVAY SGLETGNREK 
VSYVLVQKNM RFVVSGALSS DNRIAEFVKT HGDGVKDVAL LVDDVDKAYS EAVKRGAVAI
APPVELTDEN GTLKKAVIGT YGDTIHTLVE RKNYKGTFMP GFQKAEFDIP FEESGLIAVD
HVVGNVEKME EWVSYYENVM GFKQMIHFDD DDISTEYSAL MSKVMTNGSR IKFPINEPAD
GKRKSQIQEY LEFYNGAGVQ HLALLTNDIV KTVEALRANG VEFLDTPDTY YDELTARVGK
IDEEIDKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE
SIEREQERRG NL