Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_0555 |
Symbol | |
ID | 2817276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 558387 |
End bp | 561284 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637787540 |
Product | collagenase,putative |
Protein accession | YP_017177 |
Protein GI | 47525828 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAGAT CAATAAAGTG ATGCTTAGCA TTAGTACAAT GGCTCTATCA CTGGGGGCAA TTCAAACTCA TGTATCAGCA GAAGAAAAGG TGCCATATAA TGTATTACAT TCGAAACCAG TTGGAATTGA AAAACCAGTA GATGAGATTG GACACGTTTC TAAAGCGGAG GAAACATTAT CGTTTCAAGA ACGGCTAAAA GTAGGAGACT TTTCACAGCG ACCAGCATCT ATTACGAACA AAGTGACAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC AAAATGAATA ATCAAGAATT AGTTGAAACG TTAGGCAGTA TTAAGTGGCA TCAAATTACA GACTTATTCC AGTTTAACGA AGATGCAAAA GCCTTTTATA AAGATAAAGG GAAAATGCAA GTCGTTATAG ATGAATTAGC TCATCGGGGT AGTACATTTA CAAAGGATGA TTCAAAAGGA ATTCAAACGT TTACGGAAGT GTTACGTTCA GCTTTTTATC TGGCATTTTA TAATAACGAG TTAAGTGAAT TAAATGAAAG AAGCTTCCAG GACAAATGTT TACCTGCTTT AAAAGCAATC GCAAAAAATC CAAACTTTAA GCTTGGTACA ACTGAACAAG ATACAGTCGT ATCTGCATAC GGTAAATTAA TTAGTAATGC GTCAAGCGAT GTTGAAACAG TTCAATACGC ATCGAATATT TTAAAGCAAT ACAATGATAA TTTTACTACG TATGTAAATG ATCGAATGAA GGGACAAGCA ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CTTACTTAAT TGAAGCTCGT AAAGAAGCAA ATGAAACGAT GTGGTACGGG AAAGTAGATG GGTTTATTAA TGAAATAAAT CGTATCGCTC TTCTAAATGA AGTAACACAA GAAAATAAGT GGCTCGTTAA TAATGGAATT TATTTTGCAA GTCGTTTAGG GAAGTTTCAC AGTAATCCAA ATAAAGGTTT AGAAGTTGTT ACACAAGCAA TGCATATGTA TCCGCGCTTA AGTGAGCCGT ACTTTGTCGC AGTAGAACAA ATTACAACAA ATTATAATGG AAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTTGATGA TGGATCTATC GTATTTAAAA CAGGAGATAA AGTATCGGAA GAAAAAATTA AGAGACTATA CTGGGCTGCA AAGGAAGTAA AAGCACAGTA TCATCGTGTA ATTGGGAATG ACAAAGCGTT AGAGCCGGGC AATGCAGATG ATATATTAAC GATAGTAATT TATAACAGTC CAGAAGAGTA CCAGTTAAAT AGACAACTGT ATGGATACGA AACAAATAAC GGTGGAATTT ATATTGAAGA AACAGGAACA TTCTTTACAT ATGAGCGCAC GCCAGAACAA AGCATTTATA GTTTAGAAGA GTTATTCCGT CATGAGTTTA CTCATTATCT TCAAGGAAGA TATGAAGTGC CAGGTTTATT CGGAAGAGGA GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GCAATGCAGA GTTTTTCGCA GGGTCTACTC GAACAAATAA TGTAGTGCCG AGAAAGAGCA TCATTAGTGG ATTATCATCT GATCCTGCAA GCCGTTATAC AGCAGAGCGC ACATTATTTG CTAAATATGG TTCTTGGGAT TTCTATAATT ACTCGTTCGC ATTGCAATCT TACTTATATA CCCATCAATT TGAAACGTTT GATAAAATTC AAGATTTAAT TCGTGCAAAT GACGTAAAAA ATTATGATGC CTATCGTGAG AATTTAAGTA AAGACCTTAA ACTAAACGAA GAGTATCAAG AGTATATGCA GCACCTAATC GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATACC CCAAAATCAT TAACGGCAGT AGAGAAAGAA ATTACTGAAA CGTTGCCGAT GAAAGATGCA AAAATGACAA AACATAGCTC CCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA GGTAGTGTAA CAAAAGGTGA TTCAGAAGAT TGGAACGCAA TGAGTAAGAA AGTAAATGAA GCTTTGGAAC AACTTGCGCA AAAAGAATGG AGTGGCTATA AAACTGTTAC AGCATATTTT GTCAATTACG GTGTAAATAG CTCAAATCAA TTTGAATATG ATGTAGTCTT CCACGGTATC GCAAAAGATG ACGAAGAAAA TAAAGCTCCA ACGGTTAATA TAAATGGCCC TTATAATGGA CTTGTAAAAG AAGGTATTCA ATTTAAAAGT GACGGCTCAA AAGATGAAGA TGGAAAAATC GTTTCTTATT TATGGGACTT TGGAGATGGA AGCACAAGTG CAGAAGTAAA TCCGGTACAT GTATATGAAA GCGAAGGTTC ATATAAAGTA GCGTTAATAG TAAAAGATGA TAAAGGAAAA GAGAGCAAAA GCGAAATAAC GGTTACGGTT AAAGGTGGAA GTTTAACAGA ATCAGAACCA AATAATCGCC CAGAGGAAGC AAATCGTATT GGACTAAACA CTACTATAAA AGGTAGTCTT ATCGGCGGGG ATCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AAATATTGAT ATTTCCGTTT TAAATGAATA TGGAATCGGG ATGACATGGG TACTTCACCA TGAATCAGAT ATGCAAAATT ATGCTGCTTA CGGTCAAGCA AATGGAAATC ATATAGAGGC AAACTTTAAT GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGTGA TGGAACATAC GAATTATCAG TAAAATAA
|
Protein sequence | MNKKSKINKV MLSISTMALS LGAIQTHVSA EEKVPYNVLH SKPVGIEKPV DEIGHVSKAE ETLSFQERLK VGDFSQRPAS ITNKVTVKQV KESYSMADLN KMNNQELVET LGSIKWHQIT DLFQFNEDAK AFYKDKGKMQ VVIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE LSELNERSFQ DKCLPALKAI AKNPNFKLGT TEQDTVVSAY GKLISNASSD VETVQYASNI LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLIEAR KEANETMWYG KVDGFINEIN RIALLNEVTQ ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA KEVKAQYHRV IGNDKALEPG NADDILTIVI YNSPEEYQLN RQLYGYETNN GGIYIEETGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF DKIQDLIRAN DVKNYDAYRE NLSKDLKLNE EYQEYMQHLI DNQDKYNVPE VADDYLAEHT PKSLTAVEKE ITETLPMKDA KMTKHSSQFF NTFTLEGTYT GSVTKGDSED WNAMSKKVNE ALEQLAQKEW SGYKTVTAYF VNYGVNSSNQ FEYDVVFHGI AKDDEENKAP TVNINGPYNG LVKEGIQFKS DGSKDEDGKI VSYLWDFGDG STSAEVNPVH VYESEGSYKV ALIVKDDKGK ESKSEITVTV KGGSLTESEP NNRPEEANRI GLNTTIKGSL IGGDHTDVYT FNVASAKNID ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEANFN AKPGKYYLYV YKYDNGDGTY ELSVK
|
| |