Gene GBAA_3299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3299 
Symbol 
ID2817361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3035928 
End bp3038810 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content33% 
IMG OID637790061 
Productcollagenase 
Protein accessionYP_019933 
Protein GI47528584 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.675823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA ACTTAAAATT TACTCAAATG ATGATTGGGA TTAGTACGAT GGCATTATCA 
TTTGGAAGTA TTCAAACACA CGTATCAGCA GAAGAAACAG CATCCTATAA TATCTTACAG
ATGAAACCAA TAGGGACAGA AACTTCAAAA GATGAAATTG TACATGCTAC AAAAGCGGAC
GAAACTTTGA CTTTTGAAGA GCGTTTAAAA GTAGGAGATT TTTCACAACG TCCTACCCTG
GTTATGGAAC GTGATGAAAT TCAATTAAAG CAAAGCTACA CTCTGGCAGA ACTGAATAAA
ATGCCTAATA GCGAACTCAT TGATACATTA TCAAAAATTT CTTGGAATCA AATTACGGAT
TTATTTCAAT TCAATCAAGA TACGAAAGCA TTTTATCAGA ATAAAGAGCG CATGAACGTT
ATCATTAATG AATTAGGACA AAGAGGAAGG ACGTTTACGA AAGAAAACTC AAAGGGCATT
GAAACATTTG TTGAAGTATT ACGTTCTGCT TTTTATGTGG GATATTATAA TAATGAATTA
AGTTACTTAA AAGAGAGAAG CTTCCATGAA AAATGTTTAC CAGCATTGAA AGCGATTGCG
AAAAATCCAA ACTTTACATT AGGTACAGCT GAGCAAGATA GAGTAGTAGC TGCGTATGGA
AAATTAATTG GTAATGCCTC TAGTGATACT GAAACAGTAC AGTATGCGGT AAATGTTTTA
AAACAATATA ATGATAATCT TACTACGTAT GTAAGCGATT ATGCGAAAGG ACAAGCTGTA
TATGAAATTG TAAAAGGAAT TGATTATGAT ATACAGTCTT ATTTGCAAGA TACGAATAAG
CAACCTAATG AAACAATGTG GTATGGAAAG ATTGATAACT TTATAAATGA GGTTAATAGA
ATTGCTCTCG TGGGGAATAT AACAAATGAA AATAGTTGGC TAATTAATAA TGGCATTTAT
TATGCAGGTC GTTTAGGGAA ATTTCATAGT AATCCATACA AAGGATTGGA AGTTATTACA
CAAGCAATGA GCTTGTATCC TCGTCTAAGT GGACCTTATT TTGTAGCAGT AGAACAAATT
AAAACAAACT ATGGTGGAAA AGATTATAGT GGAAAGGCAG TAGATCTACA GAAAATACGT
GAAGAAGGGA AACGACAATA CTTACCTAAA ACATATACAT TTGATGACGG ATCAATTGTC
TTCAAGACGG GAGATAAAGT AACAGAAGAA AAAATTAAGA GATTATATTG GGCAGCCAAA
GAAGTAAAAG CGCAATATCA CCGTGTAATT GGTAATGATA AAGCACTAGA ACCGGGTAAC
GCTGATGATG TACTAACAAT AGTAATTTAT AATAATCCAG ATGAATATCA ATTAAATAGA
CAATTATATG GATATGAAAC AAACAACGGT GGAATTTATA TTGAAGAGAA GAGGACCTTC
TTTACATATG AGCGTACGCC AAAGCAGAGT ATTTATAGTT TAGAAGAGTT ATTCCGTCAT
GAATTCACTC ATTATTTACA AGGAAGGTAT GAGGTTCCTG GTTTATTTGG AAGCGGAGAA
ATGTATCAAA ATGAACGATT AACTTGGTTC CAAGAAGGGA ATGCAGAATT TTTTGCAGGA
TCTACACGTA CAAATAATGT TGTTCCACGT AAAAGTATGA TAAGTGGCTT ATCATCTGAT
CCAGCAAGCC GTTATACAGC AAAGCAAACT TTGTTCTCAA AATATGGATC ATGGGACTTT
TATAAGTATT CCTTTGCACT ACAGTCATAT TTGTATAATC ATCAATTTGA CACATTTGAT
AAGCTTCAAG ATTTAATCCG TGTAAACGAT GTGAAAAATT ATGACTCATA TCGTGAATCA
TTGAGCAACA ATACACAATT GAATGCAGAA TATCAAGCGT ATATGCAGCA GTTGATTGAT
AATCAAGATA AATATAATGT ACCGCAAGTA ACAAATGATT ATTTAATTCA ACACGCACCA
AAGCCGTTAG CTGAAGTGAA AAACGAAATT GTGGATGTAG CAAATATAAA AGATGCAAAA
ATTACAAAAT ATGAGTCGCA ATTCTTTAAT ACATTTACCG TGGAAGGCAA GTACACAGGT
GGTACATCAA AAGGTGAGTC TGAAGATTGG AAAACGATGA GTAAACAAGT AAATCGAACT
TTGGAGCAGT TATCCCAAAA AGGTTGGAGT GGTTATAAAA CAGTTACAGC CTATTTTGTA
AACTATCGTG TGAATGCAGC TAACCAGTTT GAATATGATA TTGTTTTTCA TGGTGTTGCA
ACAGAGGAAA AGGAAAAAAC AAATACTATA GTAAATATGA ATGGACCATA CAGCGGGATA
GTAAATGAAG AGATTCAATT TCATAGCGAT GGTACAAAAA GTGAAAATGG AAAAGTTACT
TCTTATCTAT GGAACTTTGG AGATGGTACA ACAAGTACAG AAGCAAATCC TACCCATGTA
TATGAAGAAA AAGGAACATA CACTGTGGAA CTAACTGTGA AAGATCGTAG AGGAAAAGAA
AGCAAAGAAC AAACAAAAGT TACTGTAAAA CAAGATCCGC AAACAGGTGA ATTCCATGAA
GAGGAGAAGG TACTCCTGTT TAATACGCTT GTAAAAGGAA ATCTGGTTAC TCCTGATCAA
ACAGATGTTT ATACGTTTGA TGTTACAGAT ACAAAAGAAG TAGATATTTC TGTGGTAAAT
GAACAAAATA TTGGGATGAC ATGGGTACTT TATCATGAAT CAGACATGCA AAATTATGTA
GCTTGTGGTG AAGATGAAGG AAATGTTATA AAGGGGAAAT TCGAAGCGAA ACCAGGCAAA
TATTATTTGA ATGTTTATAA ATTCGATGAT AAAAATGGTG AATATTCATT ATTGGTAAAA
TAA
 
Protein sequence
MNKNLKFTQM MIGISTMALS FGSIQTHVSA EETASYNILQ MKPIGTETSK DEIVHATKAD 
ETLTFEERLK VGDFSQRPTL VMERDEIQLK QSYTLAELNK MPNSELIDTL SKISWNQITD
LFQFNQDTKA FYQNKERMNV IINELGQRGR TFTKENSKGI ETFVEVLRSA FYVGYYNNEL
SYLKERSFHE KCLPALKAIA KNPNFTLGTA EQDRVVAAYG KLIGNASSDT ETVQYAVNVL
KQYNDNLTTY VSDYAKGQAV YEIVKGIDYD IQSYLQDTNK QPNETMWYGK IDNFINEVNR
IALVGNITNE NSWLINNGIY YAGRLGKFHS NPYKGLEVIT QAMSLYPRLS GPYFVAVEQI
KTNYGGKDYS GKAVDLQKIR EEGKRQYLPK TYTFDDGSIV FKTGDKVTEE KIKRLYWAAK
EVKAQYHRVI GNDKALEPGN ADDVLTIVIY NNPDEYQLNR QLYGYETNNG GIYIEEKRTF
FTYERTPKQS IYSLEELFRH EFTHYLQGRY EVPGLFGSGE MYQNERLTWF QEGNAEFFAG
STRTNNVVPR KSMISGLSSD PASRYTAKQT LFSKYGSWDF YKYSFALQSY LYNHQFDTFD
KLQDLIRVND VKNYDSYRES LSNNTQLNAE YQAYMQQLID NQDKYNVPQV TNDYLIQHAP
KPLAEVKNEI VDVANIKDAK ITKYESQFFN TFTVEGKYTG GTSKGESEDW KTMSKQVNRT
LEQLSQKGWS GYKTVTAYFV NYRVNAANQF EYDIVFHGVA TEEKEKTNTI VNMNGPYSGI
VNEEIQFHSD GTKSENGKVT SYLWNFGDGT TSTEANPTHV YEEKGTYTVE LTVKDRRGKE
SKEQTKVTVK QDPQTGEFHE EEKVLLFNTL VKGNLVTPDQ TDVYTFDVTD TKEVDISVVN
EQNIGMTWVL YHESDMQNYV ACGEDEGNVI KGKFEAKPGK YYLNVYKFDD KNGEYSLLVK