Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1284 |
Symbol | |
ID | 7976065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1335111 |
End bp | 1336439 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798228 |
Product | nucleoside recognition domain protein |
Protein accession | YP_002949401 |
Protein GI | 239826777 |
COG category | [S] Function unknown |
COG ID | [COG3314] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000041831 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAA ATACAAGCAA ACAGCCTTCG CTGCCGGTAC AAACTTCCAC TGGTCAGGCT TGGAAGTTTT TCATTTACAG CGCGATTGGC ATTTTCATGT TTTTCGTTCC CGTCCAAATT GGGGAGACGT CCTCGATTAT GCTTGACCAT ATCGTCTCAT GGATACGTAT GCAGTTTCCG GCGCTTGTAC CGTATTACGC GCTACTTGTA ATTGCTTTAG GCGCTGTATA TCCGTTTTAC ACTAAAACGT GGAACAAAGA TGTGGTTGCG ATCGTTTTTT CCTTGTTAAA AGTATTAGGA CTGATGGTAG CGATAATGTT AATGTTTAAA ATCGGTCCAT CTTGGCTGTT CAAACCGGAC ATGGGACCGT TTTTGTACGA TAAACTCGTC ATCTCCGTTG GTCTGTTAGT GCCGATCGGT TCCATTTTCC TCGCCCTTTT AGTCGGATAC GGGCTTTTGG AATTTGTCGG AGTTTTGATG CAACCAGTGA TGCGGCCGAT TTGGAAAACG CCAGGACGCT CGGCGATTGA TGCCGTTGCG TCGTTTGTCG GAAGTTACTC ACTTGGCCTG CTCATTACGA ACAAAGTATT TAAAGAAGGA AAATACACGA TTAAAGAAGC GGCCATAATT GCCACCGGCT TTTCGACCGT ATCGGTGACG TTCATGGTCG TTGTGGCGAA AACGCTGGGG TTGATGAACA TATGGAATAC GTATTTTTGG GTGACGTTTT TTGTAACTTT TGTGGTTACG GCTATTACCG TCCGGTTATG GCCGCTAAGC AAAATGAGCG ATGACTATTA TGATGGCAAA GGTGACCCTG AGGAAAAAGT TACGGGAAAT TACTTGAAAG AAGCATGGTC AGAAGCGATG AAAGCGGTGC AGCATTCGAA AGGGCTTTGG ACGAATGTGT GGGAAAACTT CAGAGATGGT TTTATCATGA CGATGAGCAT TTTGCCGTCG ATCATGTCGG TCGGACTGAT TGGGCTGCTG CTTGCTGAAT ATACGCCTCT GTTTGATTGG CTCGGCTATC TTTTCTATCC ATTTACCCTT CTATTGCAAA TTCCGGAGCC ATTGCTTGCT GCCAAAGCAT CAGCGATCGA GATTGCGGAA ATGTTTTTGC CAGCCTTGCT TGTAACGGAA GCACCTCTCG TCACTAAATT CATCATCGCG GTCGTTTCGA TTTCCGCCAT TTTGTTCTTT TCCGCTGTTA TTCCTTGTAT TTTGGCAACG GAAATCCCGA TCAGTATTCC AAAGCTATTA GTCATTTGGG CCGAGCGGAC GATTTTGACG CTTATAATCG CTACGCCGAT CGCGTATTTG CTGCTGTGA
|
Protein sequence | MKVNTSKQPS LPVQTSTGQA WKFFIYSAIG IFMFFVPVQI GETSSIMLDH IVSWIRMQFP ALVPYYALLV IALGAVYPFY TKTWNKDVVA IVFSLLKVLG LMVAIMLMFK IGPSWLFKPD MGPFLYDKLV ISVGLLVPIG SIFLALLVGY GLLEFVGVLM QPVMRPIWKT PGRSAIDAVA SFVGSYSLGL LITNKVFKEG KYTIKEAAII ATGFSTVSVT FMVVVAKTLG LMNIWNTYFW VTFFVTFVVT AITVRLWPLS KMSDDYYDGK GDPEEKVTGN YLKEAWSEAM KAVQHSKGLW TNVWENFRDG FIMTMSILPS IMSVGLIGLL LAEYTPLFDW LGYLFYPFTL LLQIPEPLLA AKASAIEIAE MFLPALLVTE APLVTKFIIA VVSISAILFF SAVIPCILAT EIPISIPKLL VIWAERTILT LIIATPIAYL LL
|
| |