Gene GBAA_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3584 
Symbol 
ID2814484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3288707 
End bp3291622 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content36% 
IMG OID637790325 
Productcollagenase 
Protein accessionYP_020218 
Protein GI47528869 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.629747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG 
AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGCTTC ATATCGGAAT
GTGCTCAAAA TGGAACCAGT TGGTGTACAA TTACCAGTAC AAGAATTAGC TCATTCATCA
AAAGTACTTG AAAATAAGTC TTTTGAGAAG AGGTTACAAT TTGCTGATTT GTCGCAAAGA
CCGCCTGAAT TGAAAAAGGA GAGTAAGCAA TTAGCTACAG CAAAAACTTA TACAATTGCT
GAGTTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTGTAACAAT CGATTGGGAG
CAAATTACTG GGCTATTTCA GTTTAACAAG GATAGTCTTG CCTTCTATCA AAATGATAGT
AGGATACAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGATTATAC GAAAGATGAT
TCCAAAGGGA TTGAAACTTT AGTAGAGGTA TTACGATCAG GATTTTATTT AGGGTTTTAT
CATACAGAAT TAAGTAAACT AAATGAGCGG AGCTATCATG ATAAATGCTT ACCTGCATTA
AAAACGATTG CGAATAACGC GAATTTCAAA CTCGGTACGT TAGAACAAAA TAGAGTTGTA
TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT
GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT GGGTAGATAA TCTTTCAGCT
GGAAATGCGA TTTACGATAT TATGCAAGGC GTTGACTACG ATATTCAATC GTATTTGTAC
GATACGAGAA AAGCACCGAA AGATACAGTA TGGTATCAAA AAATTGATAG CTATATTAAT
GAATTAAGTC GTTTTGCTTT AATTGGAACG GTGACAGAGA AGAATGGTTG GCTTATTAAT
AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGGAC GAAAGGGTTG
CAAGTTGTAA CAGATGCCAT GAAAATGTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCG
GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCAA ATGGAAACGT TGTGAATTTA
GATCAAATAC GAGAAGATGG TAAGAAGAAA TATTTACCGA AAACGTATAC ATTTGACGAT
GGGACAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTCTATAT
TGGGCGGCAA AAGAAGTGAA GGCTCAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA
GAAAAAGGGA ATGCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT
CAATTTAACC GTCAATTGTA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA
ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAGGAA
TTGTTCCGGC ACGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG
GGACAAGGTA AGATTTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG CAATGCAGAG
TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA
ATATCTTCAA ATCCGGCAGA ACGTTATACG GCAGAGAGAA CGTTAAATGC AAAGTACGGA
ACATGGGATT TTTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT
GATATGTTTG ACAAAGTTCA TGATCTTATT AGAAAAAATG ATGTAACAGC ATATGATGCA
TATCGCTCTG CATTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA
ATGTTAGTCG ACAATCGTGA TAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCA
ACTCACGCAC CGAAACCAGT CTCAGATATT GTGGCAGAAA TTACGGCAGA AGCGAAATTA
AGTAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTC ATACATTTAC ACTGCAAGGA
ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAATCAAT TACACAAAAC
GTAAATGATA CGTTAAAACG TTTAAGTGCA AAAGAATGGA CAGGCTATAA AACAGTAACA
GCTTATTTCG TAAATTACCG TGTGAATGCA TCAGGACAAT TTGAATATGA CGTTGTATTC
CATGGTATGA ATACAGAAGA AGGCGCTGTG AATAAAGCAC CAGTTGCGGT TATAAATGGT
CCCTATAGTG GGAATGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA
GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG CAATGAACAA
AATCCAACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAGATTAAC AGTAACAGAT
GATAAAGGGT TAACGAATAC TGTTACAACG AATGTAACAG TTCAAAAGAA AGAAGATAAC
AGTGTAGAAA AAGAACCAAA CAATTCATTC CAGACAGCAA ATACACTGCA ATTCAATCAA
GTTTTACGCG CAAGTTTAGG AAATGGTGAT ACGAGTGATT TCTTTGAAAT AAATGTGGAA
ACGGCGAAAA ATCTGCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT
CTTTATTCGG AAGCAGATTT AAATAACTAT ATTACGTATG CCCAGCAAGA GGGGAATAAG
TTAGTAGGAA GTTACTACAC GTATCCAGGT AAGTATTATT TACATGTGTA TCAGTATGGT
GGTGGATTTG GGAATTATAC GGTAGAAGTG AAGTAG
 
Protein sequence
MKGYSKKVLV GVSFASLMLG SFQGGALAEG TKGEQASYRN VLKMEPVGVQ LPVQELAHSS 
KVLENKSFEK RLQFADLSQR PPELKKESKQ LATAKTYTIA ELNQLSNQQL VDLLVTIDWE
QITGLFQFNK DSLAFYQNDS RIQAIIDKLK QQGQDYTKDD SKGIETLVEV LRSGFYLGFY
HTELSKLNER SYHDKCLPAL KTIANNANFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA
AKIFKQYNDN FSTWVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN
ELSRFALIGT VTEKNGWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA
AEQIATNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GTIVLKAGDK VTEEKVKRLY
WAAKEVKAQF HRTVESDQPL EKGNADDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG
TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE
FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY
DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNRDKYN VPLVSDDYLA
THAPKPVSDI VAEITAEAKL SNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKSITQN
VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGMNTEEGAV NKAPVAVING
PYSGNVNEAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNEQ NPTHVYTKEG TYTARLTVTD
DKGLTNTVTT NVTVQKKEDN SVEKEPNNSF QTANTLQFNQ VLRASLGNGD TSDFFEINVE
TAKNLQINVT KENNIGVNWV LYSEADLNNY ITYAQQEGNK LVGSYYTYPG KYYLHVYQYG
GGFGNYTVEV K