Gene BAS1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1003 
Symbol 
ID2849333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1056137 
End bp1058503 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content38% 
IMG OID637504262 
Productendonuclease/exonuclease/phosphatase family protein 
Protein accessionYP_027276 
Protein GI49184024 
COG category[R] General function prediction only 
COG ID[COG2374] Predicted extracellular nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0487379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTGA AGAAATTGTT AAGTGTCTTT TTATCATTCA TACTATTGCT TTCATTTACT 
GGAACTTTAG TACAAGCAGA AGAAACTACT TCTATGTCAG TAGAAAAAGC AATTCAAGTA
TTTAAGCAGC AAGGGAAAAC GAAGGGGACA GTGGAAGGAT ATATCGTTGG ATATACGCAA
AGTCCTTCTA AGTACACGAA GGATCCGGCT AAATTTGATG ATACAAATGT AGCAATTGCC
GATTCACCAA ACGAAACAAA TCCAGACAAG ATCATGCCTG TTCAGTTGCC AAAAGGTGAC
GTGAGAGCGG CAGTGAATGT AAAAGATCAT CCTGAAAATA TCGGGAAGAA AGTTAGTTTA
ACAGGGACTC TTGAATTATA TTTTAGTAGC CCAGGTTTAA AATCAGTAAC AGCTCATAAG
TTTCAAGGGG AAGAACAAAA CCGTGTTAGT GATGTAGTGG CTTCACCGAA TGGAGGAGAA
GTTGCAAAAG GAACAGCAGT AACGTTAACA ACGAACACAG AAGGTGCAAC AATCTACTAT
ACGTTAGATG GCTCAAACCC TACAAATAAA AGTGTTCGTT ATAACGGACA AATCATCGTG
AATGAAAATA GTGTAATGAA AGCAATTGCA GAGAAAGAAG GGCTTACTTC TTCACCAATT
TCAACTTTTT CATTTGTTAT CGTAAACAAT GAACCGGTTC GTATTCATGA TATTCAAGGG
AAATCACATC TTTCCCCTTA CAACGGGAAG AAAGTATACA ATGTGGAAGG CGTTGTAACA
GCACTTGATA AAAATGGTTT TTATATAGAA GATAATCAGC CAGATAATGA TCCAGCTACT
TCAGAAGGTA TGTATGTGTA CAAAAAAGAT GCGAATGTAG CAGTAGGAGA TCTTATTCAA
GTTGATGGAG TAGTAGAAGA ATATGTTGGG CCTGGATATG CAGAAAGGTT TGAAACAGAT
TTAACGACGA CTGAAATTAA AGCGAGCCGC GTTGCTGTAA TCGCAAAAGA TCAACCTTTA
CCAGCACCGA TTGTACTTGG AGAAAACGGT GTGAAAATTC CTGATCAAAT TATCGATAAT
GATGCATTCG GTTTATTTGA TCCAAACGAA GATGCAATCG ACTTTTATGA AAGTGTAGAA
GGTATGCGCG TGACGATGCC AACGCCAAAA ATTATCGCAC CTCAGAAAAA TGGGAATTTA
TATGTAACAG TGAAAAATGG CGGAAATAAA GTCGTAACGA AATATGGCAC ACCTGTATTA
GATGAAAATC AATTAAACCC AGAACGTCTT TCTGTAAAAG TACCTCGTGA TTACGTAGCG
AAAGTAGGGG ATACTTTCAC TGGAGATATA ATAGGGGTAG TCGGATATGA TTACGGTTCA
TTCCGCATTG CGCCAATAAC GGAATTACCA TCTGTAGTAG ACGGTGGATT TAAGCAAATA
GGCGCAAATA TTCAGCCGCG TCTTGATAAG TTAACGGTTG CTACATATAA TATTGAAAAC
TTCTCGGCGA ATAAAAAAGA AACAACAGAT GAAAAAGTAA AAGCGTTAGC GTATTCTATT
AAATACAATT TAAAAATGCC AGATATTATC GGTGTAGAGG AAATGCAAGA TAATAACGGA
ACGGTTAATG ACGGTACAAC AGATGCTTCG TTAAGCGCAA AACGTATTAT TGATGCAGTG
CTAGAAATTC GCGGGCCAAA GTATGAGTAT GTAGAAATTG CTCCAAACAA CAATTTAGAC
GGAGGAGCAC CAGGAGCGAA TATTCGCGTC GGTTTCTTCT ATAACCCATC TCGTGTGAAA
TTAGCGACAG TACCGAAGTT GCTTGATAAA AACGTTGTCC GTATTGGAGA CGAAAATCCA
TTGTTTGAAA GTACACGTAA ACCGCTTGCG GCAGAATTTA CGTTCCAAGG GCAAAACTTA
GTTGTCGTTG CAAATCACTT AAACTCAAAA ATAGGAGATG CAACGCCATT TGGAAAAGTG
CAGCCGCTCG TATTAAAGAG TGAAGAAAAA CGAGTTCAAT TAGCACAAGA AGTAAACAAT
TTCGTACAAG GTATTCAGAA AAAGAACACG AATGCACCAG TTGTTGTGTT AGGAGATATG
AACGATTTTG AGTTTGCTAA ACCACTAAAA ACACTAGAAG GAACAATCTT GAAAAATATG
TTAAACACAG TGCCGAAAGA AAATCGCTAC ACGTATATTC ATGAAGGAAA TGCACAAGTG
CTAGATCATA TTTTAGTAAC AAACAACATC GCACCGCACA CAATCGTAGA TCCAGTACAC
TTAAACACAA ACATTATGAA AGAGCACGGA CGTGTAAGTG ACCACGACCC AGTACTTGCT
CAAATTGATT TGAAGAAGGC ATCTTAA
 
Protein sequence
MAVKKLLSVF LSFILLLSFT GTLVQAEETT SMSVEKAIQV FKQQGKTKGT VEGYIVGYTQ 
SPSKYTKDPA KFDDTNVAIA DSPNETNPDK IMPVQLPKGD VRAAVNVKDH PENIGKKVSL
TGTLELYFSS PGLKSVTAHK FQGEEQNRVS DVVASPNGGE VAKGTAVTLT TNTEGATIYY
TLDGSNPTNK SVRYNGQIIV NENSVMKAIA EKEGLTSSPI STFSFVIVNN EPVRIHDIQG
KSHLSPYNGK KVYNVEGVVT ALDKNGFYIE DNQPDNDPAT SEGMYVYKKD ANVAVGDLIQ
VDGVVEEYVG PGYAERFETD LTTTEIKASR VAVIAKDQPL PAPIVLGENG VKIPDQIIDN
DAFGLFDPNE DAIDFYESVE GMRVTMPTPK IIAPQKNGNL YVTVKNGGNK VVTKYGTPVL
DENQLNPERL SVKVPRDYVA KVGDTFTGDI IGVVGYDYGS FRIAPITELP SVVDGGFKQI
GANIQPRLDK LTVATYNIEN FSANKKETTD EKVKALAYSI KYNLKMPDII GVEEMQDNNG
TVNDGTTDAS LSAKRIIDAV LEIRGPKYEY VEIAPNNNLD GGAPGANIRV GFFYNPSRVK
LATVPKLLDK NVVRIGDENP LFESTRKPLA AEFTFQGQNL VVVANHLNSK IGDATPFGKV
QPLVLKSEEK RVQLAQEVNN FVQGIQKKNT NAPVVVLGDM NDFEFAKPLK TLEGTILKNM
LNTVPKENRY TYIHEGNAQV LDHILVTNNI APHTIVDPVH LNTNIMKEHG RVSDHDPVLA
QIDLKKAS