Gene BAS1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1156 
Symbol 
ID2851769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1203111 
End bp1204538 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content39% 
IMG OID637504413 
Productanthranilate synthase component I 
Protein accessionYP_027427 
Protein GI49184175 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.306407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGGCA TGATGACAAA AGAGGAATTT ATAAAACAGA AAGAACAAAG AAAAACATTT 
TTGGTAATCG CTGAAGAAGA AGGAGATAGC ATTACGCCAA TTTCTTTATA TAGACGTATG
AAAGGTAAAA AGAAATTTTT ATTAGAAAGC TCACAGCTTC ATCAAGATAA AGGGCGTTAT
TCTTACTTAG GTTGTAATCC ATATGGTGAA GTGAAAAGCG TTGGTACGGA AGTGGAACGA
ACGATTTACG GCAGGGCAGA AAAGTTGCAA AGTAACGTAC TACAAGTGTT AGAAGAAATA
ATCGCACCAT CACAAGTAGA CAGTCCATTT CCATTTTGCG GAGGAGCAGT TGGATACATT
GGCTATGACG TCATTCGGCA ATATGAAAAC ATTGGAGCAG ATTTACATGA TCCATTGAAT
ATTCCAGAAG TACACCTTTT ACTGTACCGT GAGTTTATCG TGTATGACCA CTTACGCCAA
AAGTTGTCGT TTGTATATGT ATGCAGGGAA GATGATTCAG CAGATTATGA AGAAGTATAC
GAAAGGCTGC GAGTATACAA AGAGGAAGTG CTACAGGGAG AAGAAGCGGA AGTAACTGAA
ATAAGATCAA CATTATCATT CACTTCTTCT ATAACGGAAA GAGAATTTTG CGTGATGGTA
GAAACGGCGA AAGAACACAT CGGGGCCGGG GACATATTTC AAGTTGTATT ATCTCAGCGT
TTGCAAAGCG AATGTATTGG AGATCCATTC GCGTTATATC GAAAACTTCG AATTGCCAAT
CCATCACCAT ATATGTTTTA TATCGATTTT CAAGATTATG TTGTACTCGG TTCTTCGCCA
GAAAGTTTGT TATCAGTAAG GGAGGATAAA GTGATGACGA ATCCAATTGC TGGTACGAGG
CCGAGAGGGA AAACGAAGGA GGAAGATACG GAGATTGAAA AAGAACTGTT AGAAAATGAG
AAAGAGCGAG CGGAGCATAT GATGCTTGTA GATCTTGGGC GAAATGATAT TGGTAGAGTG
AGTGAAATCG GCTCAGTTAC GATAGATAAA TATATGAAAG TAGAAAAATA TTCTCACGTT
ATGCACATTG TATCTGAAGT TTACGGAACA TTGCGAAAAC AAATGAGCGG ATTTGATGCA
TTAGCGTACT GTTTACCAGC GGGGACGGTA TCAGGTGCTC CGAAAATTAG AGCGATGGAA
ATTATAAATG AGCTAGAGAA TGAAAAAAGA AATGTGTACG CCGGTGCAGT TGGATACGTT
AGTTTTTCAG GGAATCTTGA TATGGCGCTC GCCATTCGAA CAATGGTTGT AAAGGATGAA
AAAGCATACG TTCAGGCAGG AGCGGGTATT GTTTACGATT CAGATCCAGT AGCTGAATAT
GAAGAAACAT TAAATAAAGC GAGAGCGCTA TTGGAGGTAA TGAAATGA
 
Protein sequence
MKGMMTKEEF IKQKEQRKTF LVIAEEEGDS ITPISLYRRM KGKKKFLLES SQLHQDKGRY 
SYLGCNPYGE VKSVGTEVER TIYGRAEKLQ SNVLQVLEEI IAPSQVDSPF PFCGGAVGYI
GYDVIRQYEN IGADLHDPLN IPEVHLLLYR EFIVYDHLRQ KLSFVYVCRE DDSADYEEVY
ERLRVYKEEV LQGEEAEVTE IRSTLSFTSS ITEREFCVMV ETAKEHIGAG DIFQVVLSQR
LQSECIGDPF ALYRKLRIAN PSPYMFYIDF QDYVVLGSSP ESLLSVREDK VMTNPIAGTR
PRGKTKEEDT EIEKELLENE KERAEHMMLV DLGRNDIGRV SEIGSVTIDK YMKVEKYSHV
MHIVSEVYGT LRKQMSGFDA LAYCLPAGTV SGAPKIRAME IINELENEKR NVYAGAVGYV
SFSGNLDMAL AIRTMVVKDE KAYVQAGAGI VYDSDPVAEY EETLNKARAL LEVMK