Gene GBAA_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_0555 
Symbol 
ID2817276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp558387 
End bp561284 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content34% 
IMG OID637787540 
Productcollagenase,putative 
Protein accessionYP_017177 
Protein GI47525828 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AATCAAAGAT CAATAAAGTG ATGCTTAGCA TTAGTACAAT GGCTCTATCA 
CTGGGGGCAA TTCAAACTCA TGTATCAGCA GAAGAAAAGG TGCCATATAA TGTATTACAT
TCGAAACCAG TTGGAATTGA AAAACCAGTA GATGAGATTG GACACGTTTC TAAAGCGGAG
GAAACATTAT CGTTTCAAGA ACGGCTAAAA GTAGGAGACT TTTCACAGCG ACCAGCATCT
ATTACGAACA AAGTGACAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC
AAAATGAATA ATCAAGAATT AGTTGAAACG TTAGGCAGTA TTAAGTGGCA TCAAATTACA
GACTTATTCC AGTTTAACGA AGATGCAAAA GCCTTTTATA AAGATAAAGG GAAAATGCAA
GTCGTTATAG ATGAATTAGC TCATCGGGGT AGTACATTTA CAAAGGATGA TTCAAAAGGA
ATTCAAACGT TTACGGAAGT GTTACGTTCA GCTTTTTATC TGGCATTTTA TAATAACGAG
TTAAGTGAAT TAAATGAAAG AAGCTTCCAG GACAAATGTT TACCTGCTTT AAAAGCAATC
GCAAAAAATC CAAACTTTAA GCTTGGTACA ACTGAACAAG ATACAGTCGT ATCTGCATAC
GGTAAATTAA TTAGTAATGC GTCAAGCGAT GTTGAAACAG TTCAATACGC ATCGAATATT
TTAAAGCAAT ACAATGATAA TTTTACTACG TATGTAAATG ATCGAATGAA GGGACAAGCA
ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CTTACTTAAT TGAAGCTCGT
AAAGAAGCAA ATGAAACGAT GTGGTACGGG AAAGTAGATG GGTTTATTAA TGAAATAAAT
CGTATCGCTC TTCTAAATGA AGTAACACAA GAAAATAAGT GGCTCGTTAA TAATGGAATT
TATTTTGCAA GTCGTTTAGG GAAGTTTCAC AGTAATCCAA ATAAAGGTTT AGAAGTTGTT
ACACAAGCAA TGCATATGTA TCCGCGCTTA AGTGAGCCGT ACTTTGTCGC AGTAGAACAA
ATTACAACAA ATTATAATGG AAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA
CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTTGATGA TGGATCTATC
GTATTTAAAA CAGGAGATAA AGTATCGGAA GAAAAAATTA AGAGACTATA CTGGGCTGCA
AAGGAAGTAA AAGCACAGTA TCATCGTGTA ATTGGGAATG ACAAAGCGTT AGAGCCGGGC
AATGCAGATG ATATATTAAC GATAGTAATT TATAACAGTC CAGAAGAGTA CCAGTTAAAT
AGACAACTGT ATGGATACGA AACAAATAAC GGTGGAATTT ATATTGAAGA AACAGGAACA
TTCTTTACAT ATGAGCGCAC GCCAGAACAA AGCATTTATA GTTTAGAAGA GTTATTCCGT
CATGAGTTTA CTCATTATCT TCAAGGAAGA TATGAAGTGC CAGGTTTATT CGGAAGAGGA
GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GCAATGCAGA GTTTTTCGCA
GGGTCTACTC GAACAAATAA TGTAGTGCCG AGAAAGAGCA TCATTAGTGG ATTATCATCT
GATCCTGCAA GCCGTTATAC AGCAGAGCGC ACATTATTTG CTAAATATGG TTCTTGGGAT
TTCTATAATT ACTCGTTCGC ATTGCAATCT TACTTATATA CCCATCAATT TGAAACGTTT
GATAAAATTC AAGATTTAAT TCGTGCAAAT GACGTAAAAA ATTATGATGC CTATCGTGAG
AATTTAAGTA AAGACCTTAA ACTAAACGAA GAGTATCAAG AGTATATGCA GCACCTAATC
GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATACC
CCAAAATCAT TAACGGCAGT AGAGAAAGAA ATTACTGAAA CGTTGCCGAT GAAAGATGCA
AAAATGACAA AACATAGCTC CCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA
GGTAGTGTAA CAAAAGGTGA TTCAGAAGAT TGGAACGCAA TGAGTAAGAA AGTAAATGAA
GCTTTGGAAC AACTTGCGCA AAAAGAATGG AGTGGCTATA AAACTGTTAC AGCATATTTT
GTCAATTACG GTGTAAATAG CTCAAATCAA TTTGAATATG ATGTAGTCTT CCACGGTATC
GCAAAAGATG ACGAAGAAAA TAAAGCTCCA ACGGTTAATA TAAATGGCCC TTATAATGGA
CTTGTAAAAG AAGGTATTCA ATTTAAAAGT GACGGCTCAA AAGATGAAGA TGGAAAAATC
GTTTCTTATT TATGGGACTT TGGAGATGGA AGCACAAGTG CAGAAGTAAA TCCGGTACAT
GTATATGAAA GCGAAGGTTC ATATAAAGTA GCGTTAATAG TAAAAGATGA TAAAGGAAAA
GAGAGCAAAA GCGAAATAAC GGTTACGGTT AAAGGTGGAA GTTTAACAGA ATCAGAACCA
AATAATCGCC CAGAGGAAGC AAATCGTATT GGACTAAACA CTACTATAAA AGGTAGTCTT
ATCGGCGGGG ATCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AAATATTGAT
ATTTCCGTTT TAAATGAATA TGGAATCGGG ATGACATGGG TACTTCACCA TGAATCAGAT
ATGCAAAATT ATGCTGCTTA CGGTCAAGCA AATGGAAATC ATATAGAGGC AAACTTTAAT
GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGTGA TGGAACATAC
GAATTATCAG TAAAATAA
 
Protein sequence
MNKKSKINKV MLSISTMALS LGAIQTHVSA EEKVPYNVLH SKPVGIEKPV DEIGHVSKAE 
ETLSFQERLK VGDFSQRPAS ITNKVTVKQV KESYSMADLN KMNNQELVET LGSIKWHQIT
DLFQFNEDAK AFYKDKGKMQ VVIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE
LSELNERSFQ DKCLPALKAI AKNPNFKLGT TEQDTVVSAY GKLISNASSD VETVQYASNI
LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLIEAR KEANETMWYG KVDGFINEIN
RIALLNEVTQ ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ
ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA
KEVKAQYHRV IGNDKALEPG NADDILTIVI YNSPEEYQLN RQLYGYETNN GGIYIEETGT
FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA
GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF
DKIQDLIRAN DVKNYDAYRE NLSKDLKLNE EYQEYMQHLI DNQDKYNVPE VADDYLAEHT
PKSLTAVEKE ITETLPMKDA KMTKHSSQFF NTFTLEGTYT GSVTKGDSED WNAMSKKVNE
ALEQLAQKEW SGYKTVTAYF VNYGVNSSNQ FEYDVVFHGI AKDDEENKAP TVNINGPYNG
LVKEGIQFKS DGSKDEDGKI VSYLWDFGDG STSAEVNPVH VYESEGSYKV ALIVKDDKGK
ESKSEITVTV KGGSLTESEP NNRPEEANRI GLNTTIKGSL IGGDHTDVYT FNVASAKNID
ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEANFN AKPGKYYLYV YKYDNGDGTY
ELSVK