Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3584 |
Symbol | |
ID | 2814484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 3288707 |
End bp | 3291622 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637790325 |
Product | collagenase |
Protein accession | YP_020218 |
Protein GI | 47528869 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.629747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGCTTC ATATCGGAAT GTGCTCAAAA TGGAACCAGT TGGTGTACAA TTACCAGTAC AAGAATTAGC TCATTCATCA AAAGTACTTG AAAATAAGTC TTTTGAGAAG AGGTTACAAT TTGCTGATTT GTCGCAAAGA CCGCCTGAAT TGAAAAAGGA GAGTAAGCAA TTAGCTACAG CAAAAACTTA TACAATTGCT GAGTTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTGTAACAAT CGATTGGGAG CAAATTACTG GGCTATTTCA GTTTAACAAG GATAGTCTTG CCTTCTATCA AAATGATAGT AGGATACAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGATTATAC GAAAGATGAT TCCAAAGGGA TTGAAACTTT AGTAGAGGTA TTACGATCAG GATTTTATTT AGGGTTTTAT CATACAGAAT TAAGTAAACT AAATGAGCGG AGCTATCATG ATAAATGCTT ACCTGCATTA AAAACGATTG CGAATAACGC GAATTTCAAA CTCGGTACGT TAGAACAAAA TAGAGTTGTA TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT GGGTAGATAA TCTTTCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGC GTTGACTACG ATATTCAATC GTATTTGTAC GATACGAGAA AAGCACCGAA AGATACAGTA TGGTATCAAA AAATTGATAG CTATATTAAT GAATTAAGTC GTTTTGCTTT AATTGGAACG GTGACAGAGA AGAATGGTTG GCTTATTAAT AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGGAC GAAAGGGTTG CAAGTTGTAA CAGATGCCAT GAAAATGTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCG GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCAA ATGGAAACGT TGTGAATTTA GATCAAATAC GAGAAGATGG TAAGAAGAAA TATTTACCGA AAACGTATAC ATTTGACGAT GGGACAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTCTATAT TGGGCGGCAA AAGAAGTGAA GGCTCAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA GAAAAAGGGA ATGCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT CAATTTAACC GTCAATTGTA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAGGAA TTGTTCCGGC ACGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGACAAGGTA AGATTTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG CAATGCAGAG TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA ATATCTTCAA ATCCGGCAGA ACGTTATACG GCAGAGAGAA CGTTAAATGC AAAGTACGGA ACATGGGATT TTTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT GATATGTTTG ACAAAGTTCA TGATCTTATT AGAAAAAATG ATGTAACAGC ATATGATGCA TATCGCTCTG CATTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA ATGTTAGTCG ACAATCGTGA TAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCA ACTCACGCAC CGAAACCAGT CTCAGATATT GTGGCAGAAA TTACGGCAGA AGCGAAATTA AGTAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTC ATACATTTAC ACTGCAAGGA ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAATCAAT TACACAAAAC GTAAATGATA CGTTAAAACG TTTAAGTGCA AAAGAATGGA CAGGCTATAA AACAGTAACA GCTTATTTCG TAAATTACCG TGTGAATGCA TCAGGACAAT TTGAATATGA CGTTGTATTC CATGGTATGA ATACAGAAGA AGGCGCTGTG AATAAAGCAC CAGTTGCGGT TATAAATGGT CCCTATAGTG GGAATGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG CAATGAACAA AATCCAACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAGATTAAC AGTAACAGAT GATAAAGGGT TAACGAATAC TGTTACAACG AATGTAACAG TTCAAAAGAA AGAAGATAAC AGTGTAGAAA AAGAACCAAA CAATTCATTC CAGACAGCAA ATACACTGCA ATTCAATCAA GTTTTACGCG CAAGTTTAGG AAATGGTGAT ACGAGTGATT TCTTTGAAAT AAATGTGGAA ACGGCGAAAA ATCTGCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT CTTTATTCGG AAGCAGATTT AAATAACTAT ATTACGTATG CCCAGCAAGA GGGGAATAAG TTAGTAGGAA GTTACTACAC GTATCCAGGT AAGTATTATT TACATGTGTA TCAGTATGGT GGTGGATTTG GGAATTATAC GGTAGAAGTG AAGTAG
|
Protein sequence | MKGYSKKVLV GVSFASLMLG SFQGGALAEG TKGEQASYRN VLKMEPVGVQ LPVQELAHSS KVLENKSFEK RLQFADLSQR PPELKKESKQ LATAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RIQAIIDKLK QQGQDYTKDD SKGIETLVEV LRSGFYLGFY HTELSKLNER SYHDKCLPAL KTIANNANFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTWVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN ELSRFALIGT VTEKNGWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA AEQIATNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GTIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNADDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNRDKYN VPLVSDDYLA THAPKPVSDI VAEITAEAKL SNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKSITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGMNTEEGAV NKAPVAVING PYSGNVNEAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNEQ NPTHVYTKEG TYTARLTVTD DKGLTNTVTT NVTVQKKEDN SVEKEPNNSF QTANTLQFNQ VLRASLGNGD TSDFFEINVE TAKNLQINVT KENNIGVNWV LYSEADLNNY ITYAQQEGNK LVGSYYTYPG KYYLHVYQYG GGFGNYTVEV K
|
| |