Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3299 |
Symbol | |
ID | 2817361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 3035928 |
End bp | 3038810 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637790061 |
Product | collagenase |
Protein accession | YP_019933 |
Protein GI | 47528584 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.675823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA ACTTAAAATT TACTCAAATG ATGATTGGGA TTAGTACGAT GGCATTATCA TTTGGAAGTA TTCAAACACA CGTATCAGCA GAAGAAACAG CATCCTATAA TATCTTACAG ATGAAACCAA TAGGGACAGA AACTTCAAAA GATGAAATTG TACATGCTAC AAAAGCGGAC GAAACTTTGA CTTTTGAAGA GCGTTTAAAA GTAGGAGATT TTTCACAACG TCCTACCCTG GTTATGGAAC GTGATGAAAT TCAATTAAAG CAAAGCTACA CTCTGGCAGA ACTGAATAAA ATGCCTAATA GCGAACTCAT TGATACATTA TCAAAAATTT CTTGGAATCA AATTACGGAT TTATTTCAAT TCAATCAAGA TACGAAAGCA TTTTATCAGA ATAAAGAGCG CATGAACGTT ATCATTAATG AATTAGGACA AAGAGGAAGG ACGTTTACGA AAGAAAACTC AAAGGGCATT GAAACATTTG TTGAAGTATT ACGTTCTGCT TTTTATGTGG GATATTATAA TAATGAATTA AGTTACTTAA AAGAGAGAAG CTTCCATGAA AAATGTTTAC CAGCATTGAA AGCGATTGCG AAAAATCCAA ACTTTACATT AGGTACAGCT GAGCAAGATA GAGTAGTAGC TGCGTATGGA AAATTAATTG GTAATGCCTC TAGTGATACT GAAACAGTAC AGTATGCGGT AAATGTTTTA AAACAATATA ATGATAATCT TACTACGTAT GTAAGCGATT ATGCGAAAGG ACAAGCTGTA TATGAAATTG TAAAAGGAAT TGATTATGAT ATACAGTCTT ATTTGCAAGA TACGAATAAG CAACCTAATG AAACAATGTG GTATGGAAAG ATTGATAACT TTATAAATGA GGTTAATAGA ATTGCTCTCG TGGGGAATAT AACAAATGAA AATAGTTGGC TAATTAATAA TGGCATTTAT TATGCAGGTC GTTTAGGGAA ATTTCATAGT AATCCATACA AAGGATTGGA AGTTATTACA CAAGCAATGA GCTTGTATCC TCGTCTAAGT GGACCTTATT TTGTAGCAGT AGAACAAATT AAAACAAACT ATGGTGGAAA AGATTATAGT GGAAAGGCAG TAGATCTACA GAAAATACGT GAAGAAGGGA AACGACAATA CTTACCTAAA ACATATACAT TTGATGACGG ATCAATTGTC TTCAAGACGG GAGATAAAGT AACAGAAGAA AAAATTAAGA GATTATATTG GGCAGCCAAA GAAGTAAAAG CGCAATATCA CCGTGTAATT GGTAATGATA AAGCACTAGA ACCGGGTAAC GCTGATGATG TACTAACAAT AGTAATTTAT AATAATCCAG ATGAATATCA ATTAAATAGA CAATTATATG GATATGAAAC AAACAACGGT GGAATTTATA TTGAAGAGAA GAGGACCTTC TTTACATATG AGCGTACGCC AAAGCAGAGT ATTTATAGTT TAGAAGAGTT ATTCCGTCAT GAATTCACTC ATTATTTACA AGGAAGGTAT GAGGTTCCTG GTTTATTTGG AAGCGGAGAA ATGTATCAAA ATGAACGATT AACTTGGTTC CAAGAAGGGA ATGCAGAATT TTTTGCAGGA TCTACACGTA CAAATAATGT TGTTCCACGT AAAAGTATGA TAAGTGGCTT ATCATCTGAT CCAGCAAGCC GTTATACAGC AAAGCAAACT TTGTTCTCAA AATATGGATC ATGGGACTTT TATAAGTATT CCTTTGCACT ACAGTCATAT TTGTATAATC ATCAATTTGA CACATTTGAT AAGCTTCAAG ATTTAATCCG TGTAAACGAT GTGAAAAATT ATGACTCATA TCGTGAATCA TTGAGCAACA ATACACAATT GAATGCAGAA TATCAAGCGT ATATGCAGCA GTTGATTGAT AATCAAGATA AATATAATGT ACCGCAAGTA ACAAATGATT ATTTAATTCA ACACGCACCA AAGCCGTTAG CTGAAGTGAA AAACGAAATT GTGGATGTAG CAAATATAAA AGATGCAAAA ATTACAAAAT ATGAGTCGCA ATTCTTTAAT ACATTTACCG TGGAAGGCAA GTACACAGGT GGTACATCAA AAGGTGAGTC TGAAGATTGG AAAACGATGA GTAAACAAGT AAATCGAACT TTGGAGCAGT TATCCCAAAA AGGTTGGAGT GGTTATAAAA CAGTTACAGC CTATTTTGTA AACTATCGTG TGAATGCAGC TAACCAGTTT GAATATGATA TTGTTTTTCA TGGTGTTGCA ACAGAGGAAA AGGAAAAAAC AAATACTATA GTAAATATGA ATGGACCATA CAGCGGGATA GTAAATGAAG AGATTCAATT TCATAGCGAT GGTACAAAAA GTGAAAATGG AAAAGTTACT TCTTATCTAT GGAACTTTGG AGATGGTACA ACAAGTACAG AAGCAAATCC TACCCATGTA TATGAAGAAA AAGGAACATA CACTGTGGAA CTAACTGTGA AAGATCGTAG AGGAAAAGAA AGCAAAGAAC AAACAAAAGT TACTGTAAAA CAAGATCCGC AAACAGGTGA ATTCCATGAA GAGGAGAAGG TACTCCTGTT TAATACGCTT GTAAAAGGAA ATCTGGTTAC TCCTGATCAA ACAGATGTTT ATACGTTTGA TGTTACAGAT ACAAAAGAAG TAGATATTTC TGTGGTAAAT GAACAAAATA TTGGGATGAC ATGGGTACTT TATCATGAAT CAGACATGCA AAATTATGTA GCTTGTGGTG AAGATGAAGG AAATGTTATA AAGGGGAAAT TCGAAGCGAA ACCAGGCAAA TATTATTTGA ATGTTTATAA ATTCGATGAT AAAAATGGTG AATATTCATT ATTGGTAAAA TAA
|
Protein sequence | MNKNLKFTQM MIGISTMALS FGSIQTHVSA EETASYNILQ MKPIGTETSK DEIVHATKAD ETLTFEERLK VGDFSQRPTL VMERDEIQLK QSYTLAELNK MPNSELIDTL SKISWNQITD LFQFNQDTKA FYQNKERMNV IINELGQRGR TFTKENSKGI ETFVEVLRSA FYVGYYNNEL SYLKERSFHE KCLPALKAIA KNPNFTLGTA EQDRVVAAYG KLIGNASSDT ETVQYAVNVL KQYNDNLTTY VSDYAKGQAV YEIVKGIDYD IQSYLQDTNK QPNETMWYGK IDNFINEVNR IALVGNITNE NSWLINNGIY YAGRLGKFHS NPYKGLEVIT QAMSLYPRLS GPYFVAVEQI KTNYGGKDYS GKAVDLQKIR EEGKRQYLPK TYTFDDGSIV FKTGDKVTEE KIKRLYWAAK EVKAQYHRVI GNDKALEPGN ADDVLTIVIY NNPDEYQLNR QLYGYETNNG GIYIEEKRTF FTYERTPKQS IYSLEELFRH EFTHYLQGRY EVPGLFGSGE MYQNERLTWF QEGNAEFFAG STRTNNVVPR KSMISGLSSD PASRYTAKQT LFSKYGSWDF YKYSFALQSY LYNHQFDTFD KLQDLIRVND VKNYDSYRES LSNNTQLNAE YQAYMQQLID NQDKYNVPQV TNDYLIQHAP KPLAEVKNEI VDVANIKDAK ITKYESQFFN TFTVEGKYTG GTSKGESEDW KTMSKQVNRT LEQLSQKGWS GYKTVTAYFV NYRVNAANQF EYDIVFHGVA TEEKEKTNTI VNMNGPYSGI VNEEIQFHSD GTKSENGKVT SYLWNFGDGT TSTEANPTHV YEEKGTYTVE LTVKDRRGKE SKEQTKVTVK QDPQTGEFHE EEKVLLFNTL VKGNLVTPDQ TDVYTFDVTD TKEVDISVVN EQNIGMTWVL YHESDMQNYV ACGEDEGNVI KGKFEAKPGK YYLNVYKFDD KNGEYSLLVK
|
| |