Gene GBAA_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_1072 
SymbolhemY-1 
ID2815558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1053241 
End bp1054611 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content42% 
IMG OID637788038 
Productprotoporphyrinogen oxidase 
Protein accessionYP_017697 
Protein GI47526348 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCACT TACAAAAAGA TATTCGTGAC AAGAACTTGC CGATCGATAC ATTACTGATA 
GAAGCATCGG GTAAACTTGG CGGGAAAATT CAAACCGTTC GAAAAGATGG ATTTACAATT
GAACGCGGAC CGGATTCTTT CTTAGCACGA AAAGAAAGTG CAGCTAGATT AGTGAAAGAA
TTAGGTCTTG GCGATGAGCT TGTAAATAAT CAGGCCGGTC AATCATTTAT CCTCGTAAAC
AATCGGTTAC ATAAAATGCC GAGCGGATCA ATGATGGGAA TTCCAACGCA AATTACGCCG
TTTCTATTTT CTGGGCTGTT CTCCCCAATT GGGAAACTAA GAGCTGGTTT TGATCTATTA
ATGCCAAGAT CAAAACCAGT ATCTGACCAA TCACTCGGGC ACTTTTTCAG ACATCGCCTC
GGAAATGAAG TGGTTGAAAA TTTAATAGAA CCATTACTAT CTGGTATTTA TGCAGGGGAT
ATTGATGAAA TGAGCTTAAT GTCAACATTC CCGCAAATGT ATCAAATTGA GCAGAAACAT
CGCAGTATTT CACTCGGTAT GCGTACGCTC GCCCCGAAAG CAGAGAAAGC TGAACCGAAA
AAGGGAATCT TCCAAACAGT GAAAACCGGT TTAGAATCTA TCGTAGAATC TCTCGAATTA
AAGATGCATG AAGGTACGAT AATAAAGGGA ACTCGCATAG AAAAAGTTGC AAAACAGGGT
GATGGCTATG CGATTACTCT TAGTAACGGA AAAGAAATAG AAGCGGACGC GGTCGTAGTG
GCAAGCTCAC ATAAAGTATT GCCATCTATG TTTGCGCAGT ACAAGCAATT TCGTTTCTTC
CGCAACATTC CATCCACATC AGTTGCGAAT GTGGCAATGG CTTTCCCGAA ATCAGCCATT
CAGCGGGATA TTGATGGTAC AGGATTTGTT GTCTCTCGAA ATAGTGATTA CACAATTACA
GCATGTACGT GGACGCATAA AAAGTGGCCA CATACAACGC CAGAAGGAAA AACGCTTCTT
CGATGTTACG TTGGACGACC TGGTGATGAA GCGGTTGTAG AACAAACAGA AGAGGAACTC
GTTCAGCTCG TACTAGAAGA CTTACGAAAG ACGATGGATA TTACAGAGGA TCCAGAGTTT
ACAGTCGTAA GTCGCTGGAA AGAAGCAATG CCCCAATATA CAGTAGGCCA TAACGAGCGA
ATGAAGAAAC TCACAACATT TATGGAGAAA GAGTTGCCAG GTATATACTT GGCAGGTAGT
TCTTACGCTG GTTCTGGTCT TCCGGACTGT ATTGATCAAG GTGAGAAGGC TGCAAAACGT
GTACTCTCTC ATTTGGAGAA AGTAATGAAT ACGGAATTAA TCGCACAATA A
 
Protein sequence
MYHLQKDIRD KNLPIDTLLI EASGKLGGKI QTVRKDGFTI ERGPDSFLAR KESAARLVKE 
LGLGDELVNN QAGQSFILVN NRLHKMPSGS MMGIPTQITP FLFSGLFSPI GKLRAGFDLL
MPRSKPVSDQ SLGHFFRHRL GNEVVENLIE PLLSGIYAGD IDEMSLMSTF PQMYQIEQKH
RSISLGMRTL APKAEKAEPK KGIFQTVKTG LESIVESLEL KMHEGTIIKG TRIEKVAKQG
DGYAITLSNG KEIEADAVVV ASSHKVLPSM FAQYKQFRFF RNIPSTSVAN VAMAFPKSAI
QRDIDGTGFV VSRNSDYTIT ACTWTHKKWP HTTPEGKTLL RCYVGRPGDE AVVEQTEEEL
VQLVLEDLRK TMDITEDPEF TVVSRWKEAM PQYTVGHNER MKKLTTFMEK ELPGIYLAGS
SYAGSGLPDC IDQGEKAAKR VLSHLEKVMN TELIAQ