Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B2998 |
Symbol | |
ID | 7184841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 2184317 |
End bp | 2187406 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643550053 |
Product | putative exonuclease |
Protein accession | YP_002445723 |
Protein GI | 218897312 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000658197 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.00861967 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGACCGA TTCAGCTTAT TATGACAGCA TTCGGGCCAT ATAAACAGAA AGAAGTAATC GATTTTGAAG ATCTTGGAGA ACATCGTATT TTCGCTATTT CTGGGAATAC GGGAGCTGGA AAAACAACGA TTTTTGATGC GATTTGTTAT GTATTATATG GTGAGGCTAG CGGAGAAGAA CGTAGTGATA CGAGCATGCT TCGAAGTCAA TTTGCTGATG ATAATGTTTA TACAAGTGTT GAACTAACCT TTCAGTTAAA AGGGAAACGC TATGAAATAA AACGACAACT TGGACATAAA AAACAAGGGA ATAAGACTAT TACAGGACAT GCAGTAGAAT TATATGAAGT TATTGATGGA GAAAACGTTC CAGCCGTCGA TCGTTTTCAT GTAACAGACG TAAACAAAAA AATGGAAGAT TTAATTGGGT TAAGTAAACA TCAATTTAGC CAAATTGTTA TGTTACCCCA AGGGGAATTC CGAAAACTAT TAACATCTGA GACGGAAAAC AAAGAAGAGA TTTTACGTCG TATTTTTAAA ACGGATCGTT ATAAATTAAT GCGTGAGTTA CTTGATCAAA AGAGAAAACA ATGGAAAGAC GTCTTGCAAG AAAAACAAAA AGAGAGAGAG CTCTATTTCC GTAATGTTTT TAAATTACCA ATCCGTGATG GAGCATTATT AGAGACATTA GTGGCACAAG AACATGTAAA TACGCATCAA GTAGTAGAAG CGTTAGAACA AGAAACAACC GTGTATAAGG CAGAGGTTGA GCAATTACAA GTAGAACAAG ATGTTCAAAC GAAGCAATTA AAGGATGCAG AAACACGTTT TCATGCAGCA AAATCTGTAA ATGAGAAATT TATAGATTTA CAACAAAAGA ATGAGAAATA TAACACTTTA CAAGAGAACC GTGCAGCAAT TGAAGGGAAA GAAAAATCCT TTAAACGTGC GGAACAAGCA AAACGCTTAT TACCATTTGA ACAATGGTAT GAAGAAGCGA TGCAAAATGA GCAGAAAGCT GAAAGTTTGT TAAAACAGAT AATCGTCAAA CAAGAACAGA TAATGAATAG TTTTGAGCTT GCTCAGGAGA AGTATGAAGT AGTAAAGAAT AAAGAACTTG AACGAGAAGA GGCTAAAAAA CTTGTTCAAA GATTAGAAGA GTTACAAGCG ATCATTGAAT CATTAGCTGA GAGAAAGCTG AATTTACAAA ATGCCGAAAT TCAAATAGGG AAATTAAAAG AAAGTATGCA AAAGTTGGAT CAACAACTAG AAGAGCATAC AAGTCAAAAA CAGGGAATGT CTGATGAATT GCAGCAATTA GAACAAGCAC TTGAGCAATA TGTAGCTAAA GTAGAAGAAC TAACCAATAT GCGAGAAGAT GCAAAAGTTT TAAAGCAAGC ATATGATGTT TGGCAAGAAA AACAAAAATT TGAGCAAGAA AAAGAAGCGG CAAATAATAA AATGCAAGTG GCAGTTCGTG CGTATGAAAA CATGGAGCGC CGCTGGTTAA GTGAACAAGC TGGTATACTA GCTCTTCATT TACATGATGG TGAATCTTGT CCGGTATGTG GTAGTACGAA TCACCCACAA AAAGCTACAG AGCAAAGTAA TGCGATCGAT GAAAAAGAGT TAAACGATTT AAGAGATAAG AAGAACATTG CTGAAAAATT AAATGTGCAA GTAGAGGAGA AATGGAATTT TTATCGACTA CAATATGAAC AAGTGATAGA AGAAGTTGTG AAGCGCGGCT ACAACTCGGA AAAATTAGCT GAAACATATA GTGCACTTGT TCAAAAAGGA AAACAATTAG CAGCAAACGT GAATACGTTA AAAGCAAGTG AAGAAACACG TAAGCAGATT GCTGTGAACA TGAAAAGCGT AGAAGAAAAA ATAGAAGAAC TTCAGAAACA AAAACGGGAA GTAGAAACAA TGCAGCACCG TACAGAAATG GAGTGCATGC AACTTCGTAC GTCATATGAA CATGATAAAC GAAATATTCC AGAAAACTTA CAAACAGTAC AAGCTTGGAA AGCACAGTTG GACCAAGCTA TGCAGGAACT TAGGTTAATG GAGGACGAAT GGAAGAAAGT GCAGGAAGCG TATCAGCATT GGCAAAATGA AAATATACGT ATTCAAGCTG AACAGAAAGG TGCTTCTAAT CAATTTGAAA GTGCAAAATT GAAGAAAGAA GAAACTTTCG CACGCTTTAT GAAAGAGCTA GAACAAAGCG GGTTTACAGA TCAATTCACG TATAAAGAAG CCAAATTAAG CGATGCCCAG ATGGAGATGT TACAAAAAGA AATTCAAGGT TATTATTCAT CTCTCGAAGT GCTTGCAAAA CAAATTGAAG AGTTAAACTC AGAATTAAAA GATAAAGAGT ATATGGATAT TACATCTTTA GGTGAACACA TAAAAGAACT AGAAATTAAT CTCGATATAA TTAAAGAAAA ACGTCAACGT GCGCAAAATG CGGTAACATA TATTTCTGAT TTACATGAAA ACATTAGACG TATTGACGAA CAAATTCACG ATGAAGAGAA AGCTTTCCAA GAACTTGTTG ATCTATATGA AGTAATGAAA GGTGATAACG AAAGTCGTAT ATCATTTGAA CGTTACATCC TAATTGAGTA TTTAGAACAA ATTGTTCAAA TCGCAAACGA ACGACTACGT AAATTATCAA ATGGACAATT TTATTTAAAA CGAAGTGAGC GAGTTGAAAA AAGAAACCGT CAAAGTGGAT TAGGATTAGA TGTGTACGAT GCATACACTG GCCAAACACG TGATGTAAAA ACATTATCTG GCGGAGAGAA ATTTAACGCA TCACTTTGCT TAGCGCTAGG AATGGCAGAT GTAATTCAAG CATATGAAGG TGGTATTTCC ATCGAAACGA TGTTCATCGA TGAAGGGTTC GGTTCATTAG ATGAAGAATC ATTAACGAAA GCAGTAGATG CCTTAATTGA CTTACAAAAA TCAGGCCGAT TTATCGGTGT CATTTCACAC GTACAAGAGC TGAAAAACGC AATGCCGGCT GTATTAGAAG TAACGAAGCA GAAGGATGGG TGTAGTCAGA CTAGGTTTGT GGTGAAGTAA
|
Protein sequence | MRPIQLIMTA FGPYKQKEVI DFEDLGEHRI FAISGNTGAG KTTIFDAICY VLYGEASGEE RSDTSMLRSQ FADDNVYTSV ELTFQLKGKR YEIKRQLGHK KQGNKTITGH AVELYEVIDG ENVPAVDRFH VTDVNKKMED LIGLSKHQFS QIVMLPQGEF RKLLTSETEN KEEILRRIFK TDRYKLMREL LDQKRKQWKD VLQEKQKERE LYFRNVFKLP IRDGALLETL VAQEHVNTHQ VVEALEQETT VYKAEVEQLQ VEQDVQTKQL KDAETRFHAA KSVNEKFIDL QQKNEKYNTL QENRAAIEGK EKSFKRAEQA KRLLPFEQWY EEAMQNEQKA ESLLKQIIVK QEQIMNSFEL AQEKYEVVKN KELEREEAKK LVQRLEELQA IIESLAERKL NLQNAEIQIG KLKESMQKLD QQLEEHTSQK QGMSDELQQL EQALEQYVAK VEELTNMRED AKVLKQAYDV WQEKQKFEQE KEAANNKMQV AVRAYENMER RWLSEQAGIL ALHLHDGESC PVCGSTNHPQ KATEQSNAID EKELNDLRDK KNIAEKLNVQ VEEKWNFYRL QYEQVIEEVV KRGYNSEKLA ETYSALVQKG KQLAANVNTL KASEETRKQI AVNMKSVEEK IEELQKQKRE VETMQHRTEM ECMQLRTSYE HDKRNIPENL QTVQAWKAQL DQAMQELRLM EDEWKKVQEA YQHWQNENIR IQAEQKGASN QFESAKLKKE ETFARFMKEL EQSGFTDQFT YKEAKLSDAQ MEMLQKEIQG YYSSLEVLAK QIEELNSELK DKEYMDITSL GEHIKELEIN LDIIKEKRQR AQNAVTYISD LHENIRRIDE QIHDEEKAFQ ELVDLYEVMK GDNESRISFE RYILIEYLEQ IVQIANERLR KLSNGQFYLK RSERVEKRNR QSGLGLDVYD AYTGQTRDVK TLSGGEKFNA SLCLALGMAD VIQAYEGGIS IETMFIDEGF GSLDEESLTK AVDALIDLQK SGRFIGVISH VQELKNAMPA VLEVTKQKDG CSQTRFVVK
|
| |