Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1406 |
Symbol | |
ID | 7976860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1476969 |
End bp | 1477955 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644798327 |
Product | restriction endonuclease |
Protein accession | YP_002949500 |
Protein GI | 239826876 |
COG category | [S] Function unknown |
COG ID | [COG4127] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000555478 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAT GGTGGATGAT TAGAGCTGGT GATAAAAACG AACTCATTCC AATTTGGCTA GAGAAAGGAA TTGCATCGAT CGGATGGTCT CAATTAGGAA ATCCAAAACA TTTTCGATCG AAAGAACAGT TGATTCAAAA GGCTGATCAA GTGTTTAGTG ACGCAAAGCC GAAATCAAGA AATAGTTGGG TCAGTCAAGT TTGGCGTTTT AGTCACGAAA TTAAAAAAGG CGATCGGGTG ATAACTTATT CAAAAGAAAA AAGAGAATAC ATTATCGGAA CAGTAACAGA GGAACATTTT TATGATACAA CGATTGGGCA TCCAGATTAT CCAAATGTCA TTCGAGTCAT GTGGGAAGAG ACAACTATTT CGAGAGATTC ATTATCTCAA GCAGCAAAAA ACAGTTTAGG CTCCACATTA ACGGTCTTTC GAGTGGATGA ATGGGGAAGT GAAATAGAAA AATTATTAAG CGATCCATCA TTATCGGTGT CTGTAAATAA AACGGATGAG ACGGAAGAAG ATGAAATGAT AGAAGATCTA GTTGGAAAAG CTTTAACGAT GATTCAAGAT AAAGTGGATA AGCTCGATCC ATGGCAAATG CAATATTTAG TTGGAGGACT GCTTCAAGCA ATGGGATATA ATGTACAAAT CAGTCCCAAA GGTCCTGATG GTGGAGTAGA TGTGTTAGCT TATAAAGATG CATTCGGGTT TGAAAAGCCG ATTATTAAAG TACAAGTCAA GCATCGCAAA AGCGCAGCTT CTGCTCCTGA AATTCAACAG CTTCTTGGGG CAAATCCAAT TGATGCAAAC TGTCTCTTCG TCTCAACTGG AGGATTTACT TCTCAAGCGG AAGCAGTGGC AAAACATAAT TCAGTAAAAT TAATTGATTT AGAGGAGCTT GTTAACTTAA TTGTTTATTG GTATGAGAAG ATGCCAAACG ATGCAAGAGC TTTGTTACCT TTACAAAAGA TATATGTGCC GGAATAA
|
Protein sequence | MQKWWMIRAG DKNELIPIWL EKGIASIGWS QLGNPKHFRS KEQLIQKADQ VFSDAKPKSR NSWVSQVWRF SHEIKKGDRV ITYSKEKREY IIGTVTEEHF YDTTIGHPDY PNVIRVMWEE TTISRDSLSQ AAKNSLGSTL TVFRVDEWGS EIEKLLSDPS LSVSVNKTDE TEEDEMIEDL VGKALTMIQD KVDKLDPWQM QYLVGGLLQA MGYNVQISPK GPDGGVDVLA YKDAFGFEKP IIKVQVKHRK SAASAPEIQQ LLGANPIDAN CLFVSTGGFT SQAEAVAKHN SVKLIDLEEL VNLIVYWYEK MPNDARALLP LQKIYVPE
|
| |