Gene BAS1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1001 
Symbol 
ID2849110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1053083 
End bp1054504 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content42% 
IMG OID637504260 
Productprotoporphyrinogen oxidase 
Protein accessionYP_027274 
Protein GI49184022 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000915854 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGAAAA AAGTTGTAAT CATCGGCGGT GGCATCACAG GATTAACAAC AATGTATCAC 
TTACAAAAAG ATATTCGTGA CAAGAACTTG CCGATCGATA CATTACTGAT AGAAGCATCG
GGTAAACTTG GCGGGAAAAT TCAAACCGTT CGAAAAGATG GATTTACAAT TGAACGCGGA
CCGGATTCTT TCTTAGCACG AAAAGAAAGT GCAGCTAGAT TAGTGAAAGA ATTAGGTCTT
GGCGATGAGC TTGTAAATAA TCAGGCCGGT CAATCATTTA TCCTCGTAAA CAATCGGTTA
CATAAAATGC CGAGCGGATC AATGATGGGA ATTCCAACGC AAATTACGCC GTTTCTATTT
TCTGGGCTGT TCTCCCCAAT TGGGAAACTA AGAGCTGGTT TTGATCTATT AATGCCAAGA
TCAAAACCAG TATCTGACCA ATCACTCGGG CACTTTTTCA GACATCGCCT CGGAAATGAA
GTGGTTGAAA ATTTAATAGA ACCATTACTA TCTGGTATTT ATGCAGGGGA TATTGATGAA
ATGAGCTTAA TGTCAACATT CCCGCAAATG TATCAAATTG AGCAGAAACA TCGCAGTATT
TCACTCGGTA TGCGTACGCT CGCCCCGAAA GCAGAGAAAG CTGAACCGAA AAAGGGAATC
TTCCAAACAG TGAAAACCGG TTTAGAATCT ATCGTAGAAT CTCTCGAATT AAAGATGCAT
GAAGGTACGA TAATAAAGGG AACTCGCATA GAAAAAGTTG CAAAACAGGG TGATGGCTAT
GCGATTACTC TTAGTAACGG AAAAGAAATA GAAGCGGACG CGGTCGTAGT GGCAAGCTCA
CATAAAGTAT TGCCATCTAT GTTTGCGCAG TACAAGCAAT TTCGTTTCTT CCGCAACATT
CCATCCACAT CAGTTGCGAA TGTGGCAATG GCTTTCCCGA AATCAGCCAT TCAGCGGGAT
ATTGATGGTA CAGGATTTGT TGTCTCTCGA AATAGTGATT ACACAATTAC AGCATGTACG
TGGACGCATA AAAAGTGGCC ACATACAACG CCAGAAGGAA AAACGCTTCT TCGATGTTAC
GTTGGACGAC CTGGTGATGA AGCGGTTGTA GAACAAACAG AAGAGGAACT CGTTCAGCTC
GTACTAGAAG ACTTACGAAA GACGATGGAT ATTACAGAGG ATCCAGAGTT TACAGTCGTA
AGTCGCTGGA AAGAAGCAAT GCCCCAATAT ACAGTAGGCC ATAACGAGCG AATGAAGAAA
CTCACAACAT TTATGGAGAA AGAGTTGCCA GGTATATACT TGGCAGGTAG TTCTTACGCT
GGTTCTGGTC TTCCGGACTG TATTGATCAA GGTGAGAAGG CTGCAAAACG TGTACTCTCT
CATTTGGAGA AAGTAATGAA TACGGAATTA ATCGCACAAT AA
 
Protein sequence
MRKKVVIIGG GITGLTTMYH LQKDIRDKNL PIDTLLIEAS GKLGGKIQTV RKDGFTIERG 
PDSFLARKES AARLVKELGL GDELVNNQAG QSFILVNNRL HKMPSGSMMG IPTQITPFLF
SGLFSPIGKL RAGFDLLMPR SKPVSDQSLG HFFRHRLGNE VVENLIEPLL SGIYAGDIDE
MSLMSTFPQM YQIEQKHRSI SLGMRTLAPK AEKAEPKKGI FQTVKTGLES IVESLELKMH
EGTIIKGTRI EKVAKQGDGY AITLSNGKEI EADAVVVASS HKVLPSMFAQ YKQFRFFRNI
PSTSVANVAM AFPKSAIQRD IDGTGFVVSR NSDYTITACT WTHKKWPHTT PEGKTLLRCY
VGRPGDEAVV EQTEEELVQL VLEDLRKTMD ITEDPEFTVV SRWKEAMPQY TVGHNERMKK
LTTFMEKELP GIYLAGSSYA GSGLPDCIDQ GEKAAKRVLS HLEKVMNTEL IAQ