Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0673 |
Symbol | |
ID | 7978856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 739981 |
End bp | 743331 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644797658 |
Product | SMC domain protein |
Protein accession | YP_002948832 |
Protein GI | 239826208 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00700495 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCCGA TTTCATTAAC GATAGCGGGT TTGCACAGTT TTCGCGAGAA ACAAACGATT GATTTTCAAT CATTATGCGA AGGTGGTGTC TTTGGCATTT TCGGCCCAAC TGGAAGCGGG AAATCAACGA TTTTAGACGC GATTACACTT GCGTTATTTG GCAGTGTGGA ACGTGCGCCA AACCATACGC AAGGAATTAT GAATCATGCG GAAAATGAAC TGTTCGTTTC TTTTACGTTT GAATTAGAAA ATGCCACATG CACGAAGCGA TATACGGTAG AGCGCAGCTT TAAACGTGGC GATGAATGGC GGTTAAAAAG CGGAATATGC CGGCTTATCG AAGTTGGCGC AGAAACCGTT GTGTTGGCGG ATAAGTTGAC GGAAGTCAAT AAAGCGATAG AGCAATTGCT TGGTTTAACG ATGAAAGATT TTACGCGCGC CGTCGTGCTG CCGCAAGGGA AATTCGCCGA ATTTTTATCA TTGAAAGGCG CCGAACGACG GCAGATGCTA CAGCGCCTTT TTCATTTGGA ACCATATGGA GACAAGTTAA ATAAAAAATT GAAAGAAAAA CTCGCTGCCA TTTCTAATGA ATTAAATGAA GTGATCGCTG AGAAGACAGG TTTAGGAGAT GCATCGGAAG AAGCGCTTGA ACGCGCGAAA CAAGAGTTAG AGACACTTTG GGATTTACTG CAAAAGCGAA AAGCAGAATT ACATGATATG GAAGTAACGA TGGAACGAAC GAAACAGTTA TGGGCATGGC AAAGGGAAAA AGAAGAACTT GAAGCCGAAC TAGCGCGCCT TGCCAGTGAA GAATCACATA TTCGCTTATT AGAGAATAGA AAAGAACGCG CGGAACAAGC GGAACGAATG TGGCCATATC TCGAACAGTA TGAAGAAGCG CGCCGGTTTG TGATCAATGC AGAGGAGAAA CAAAAGGAAT TACAGAAAAA ACTCGTACAA GCTAAAGAAC TTTATGAAAA AGCGACGCAT CGTTATGAAC GTATTCGGCA AGAAAAGGCC GCTTCCGAAC CTCATTTATT AGCAAAAAAA GAACAGCTCA CCCAAGCAAA ACAGTTAGCG GCTCAAATCC ATGCGTTAGA AACAGAATTA AACGAAATGA GGAAACAAAT TCATTTGCTA GAAGAAGAAG AACGGAAAAA GCTTGTCGAA TGGCAGGAAG CAGCAAAGTT GTATGAACGC GGGCTGGAAA AACAACAAAT GTTAAAGGCA GAGCTGCAAA AATATGCAGC TTCTATAGAG CAAAAGGAAG TGGTTGAGCG AGCATACGAT GAAAAACAGC AAATTGAAAG AATAACAGAT GCGCTTCAAG ATATTCAACA ACGGCTGGCA CAAAAACAAA AAGCGCAACA GCAGGCCAAT AAGGAACGAG AAAAAAGAAA GCAGCGCGCA GAGACGGCAA AAGAAAAATT GCACGTTTTG TTCCAAAAAA TAGAGAAAGT CTATCATTCT GTCTGTGAGA GACAATGGCA GTTAGAAAAA CGGTTGTATC GGTATGAACA ACAGCTCGAG CAAGAACGCG AAAAAGCGGA GCAGGCGAAA ACTGCAGAGA TGGCAGCAAT TCTTGCTAGA CAACTTCGCC AAGGAGAACC ATGTCCTGTC TGCGGGTCAT GCGAACATCC GAATCCGTAT GTATATGAGC ACAATAACGT TAGCAGCGGG AAAATCGCTA TATTGGAACA GCAGGTAAAA CAAGGACAAA CTTATATGCA AGTGTTGCAC ACATTAAAAG CACAGCTTGA ACAATTGGCG CAATTTATTG GAAATGAGTG GACGTTCCAA CGATTCGACG TAAGTAAATT AAAGGTTGAA GAAGATATCG ATATTGCCGT GGAAGTGAAA GCACTTCAAC AAGATTGTTT GCAATTAAAA GAAGCCCTTC AGCAGGCGCT GCAAAAGTGG CGTGATGCCG AATCGTCACA GCAAGCCATC GAGCAAGAAT TAAGATTGCT AGAAAAAGAT GTACAAGAAT TACAAACGGA GCGGGAATAT CGTTTCAATG AATACAATCA ATTAGAAGCA AGCTGGCGTG AAAAATATAC CGATTTTTCT TTTGATACGA TCGAAGCGCT CCGCAACCAA ATGCGAAAAC ATGAAGAAAT CGTTCGGAAA TTGCAAAAAC GAATCGATGA CAGCATTCCG TTTCTTGAAA CTAAATTAAA TGAAAAAGAG CGATTAGCCC AACAGCAGCG AGAGCTGGAA ACGGAAAAAG TCCGCCTTGT TTCCTTGTGG AAGGCTAAAC AACAACTGGC AGATGACTAT AAACAGCAGT TAATGGAAAA AGCAGGTGGA AAGCAAGTAG AAGAGCAACT TCTTCAAGTG GAAAAACAGC TGCTTTACTT AAAAAAAGAA GAAGAAGAAG CATATCAAAG ATGGCTGCAG ACACAAAAAC AATACCAAGC TTTAGAAACG GAAGCAAAAG CGATACAGCA GTCGTTGGAA GAAGGAAAAG AACGGTACGA AGAAGCGAAA TTGCGTTGGC TTAATGAATT GAAAAAAACA ACATTTGCCG ATGAAAACGA AGTGAAAGAA GCAAAGGCGG CAGAAGAATT ACGCCTCGAG TGGGAACAGC AAATCAAACA TTATTGGCAG AAAGTGCAAC ATGCCCAGCA TCGTCTTCAG CAGCTGACCG AAGCGATCGG TGGGGAAATG ATTGATCAAC AACAATGGGA ACAATTGCAG ACTGTATATG AACAAATAAA ACAACAAGTA GATGAAGCCA TGCAGCAAGT GGGTGCCGCG CAAAACAAAG TGGAAGAATT AACGGAAAAA CATAAACGGT TTACGGAATT AGAGAAAAAA CAACAAGAGT TAGCCGCTCT TATGGATCGT TATAAACATT TGCAGTCGAT TTTAAAGGGA AACAGCTTTG TCGAATTTAT GGCGGAAGAA CAACTCATTC AAGTGACAAG AATGGCTTCC GAGCGATTAA GTTCGTTGAC AAGACAGCGC TATTCATTGG AAGTCGACTC ACAAGGAGGG TTTCTCATTC GCGATGATGC CAATGGCGGG GTAAAGCGTC CAGTGACAAC ATTATCAGGA GGAGAAACGT TTTTAACATC ATTATCGCTC GCATTGGCGT TATCAGCACA AATTCAATTG CGTGGGGAAT ATCCGCTTCA GTTTTTCTTC TTGGACGAAG GGTTTGGCAC GCTTGATGCG GAACTTCTTG ATACGGTTAT TTCTGCACTG GAAAAGCTTC ATTCGCAGCG GCTTTCTGTC GGGGTAATCA GTCATGTGCA AGAAATTCGC TCCCGCCTGC CGAAGCGGCT TATTGTTGAG CCGGCGGAAC CGTCTGGGCG AGGAACAAGA GTTAGATTAG AAGTCATGTA A
|
Protein sequence | MKPISLTIAG LHSFREKQTI DFQSLCEGGV FGIFGPTGSG KSTILDAITL ALFGSVERAP NHTQGIMNHA ENELFVSFTF ELENATCTKR YTVERSFKRG DEWRLKSGIC RLIEVGAETV VLADKLTEVN KAIEQLLGLT MKDFTRAVVL PQGKFAEFLS LKGAERRQML QRLFHLEPYG DKLNKKLKEK LAAISNELNE VIAEKTGLGD ASEEALERAK QELETLWDLL QKRKAELHDM EVTMERTKQL WAWQREKEEL EAELARLASE ESHIRLLENR KERAEQAERM WPYLEQYEEA RRFVINAEEK QKELQKKLVQ AKELYEKATH RYERIRQEKA ASEPHLLAKK EQLTQAKQLA AQIHALETEL NEMRKQIHLL EEEERKKLVE WQEAAKLYER GLEKQQMLKA ELQKYAASIE QKEVVERAYD EKQQIERITD ALQDIQQRLA QKQKAQQQAN KEREKRKQRA ETAKEKLHVL FQKIEKVYHS VCERQWQLEK RLYRYEQQLE QEREKAEQAK TAEMAAILAR QLRQGEPCPV CGSCEHPNPY VYEHNNVSSG KIAILEQQVK QGQTYMQVLH TLKAQLEQLA QFIGNEWTFQ RFDVSKLKVE EDIDIAVEVK ALQQDCLQLK EALQQALQKW RDAESSQQAI EQELRLLEKD VQELQTEREY RFNEYNQLEA SWREKYTDFS FDTIEALRNQ MRKHEEIVRK LQKRIDDSIP FLETKLNEKE RLAQQQRELE TEKVRLVSLW KAKQQLADDY KQQLMEKAGG KQVEEQLLQV EKQLLYLKKE EEEAYQRWLQ TQKQYQALET EAKAIQQSLE EGKERYEEAK LRWLNELKKT TFADENEVKE AKAAEELRLE WEQQIKHYWQ KVQHAQHRLQ QLTEAIGGEM IDQQQWEQLQ TVYEQIKQQV DEAMQQVGAA QNKVEELTEK HKRFTELEKK QQELAALMDR YKHLQSILKG NSFVEFMAEE QLIQVTRMAS ERLSSLTRQR YSLEVDSQGG FLIRDDANGG VKRPVTTLSG GETFLTSLSL ALALSAQIQL RGEYPLQFFF LDEGFGTLDA ELLDTVISAL EKLHSQRLSV GVISHVQEIR SRLPKRLIVE PAEPSGRGTR VRLEVM
|
| |