Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1064 |
Symbol | |
ID | 7979189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1117943 |
End bp | 1119286 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798017 |
Product | sun protein |
Protein accession | YP_002949190 |
Protein GI | 239826566 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTA AAAACGTACG AGAAGTCGCT TTGGAAACAT TGTTGGCTGT CGAAGAAAAA GAAGCGTACA GCAATTTATT ATTAAATAAA ATGATTGAGA CGCATCGTCT TTCCGAAAGA GATGTCCGGC TGTTGACGGA AATTGTGTAC GGAACGATTC AGCGCCGGGA TACGCTCGAT TATTATTTGA CCCCGTTTTT GAAAAAAGCG CGCAAACTAG AGCGGTGGGT TCGCGTGCTT CTTCGCTTGA CCCTTTATCA AGTATTGTAT TTAGACCGCA TCCCTGACCG CGCGGCGATT TTCGAAGCGG TGGAAATTGC GAAAAAACGA GGGCATCAAG GGGTCGCTTC GCTGGTTAAC GGGGTGATGC GCGCCATTCA ACGTCAAGGG GTGCCGCCTC TTGAGAAAAT AGAAGATGAA GTGGAGCGGC TAGCCGTTGC GACAAGCCAT CCACTCTGGC TTGTCAAGCG CTGGGTCGAG CAATACGGAC TTGAAGAAAC AAAACGGATG TGTGAAACGA ACTTGTTGCC GCCAAAGCAA ACGGCTAGGG TGAATACCGC AAGAATAACT GTCGAGGAAG CAATAGACAA GCTCAAGCAA GAAGGAATGG AAGTGGTACT CGGCGAGGTG GCAGAGGAAG CGATTCAAGC AAAAAGAGGA AATCTTGCGC ACACGGATGC GTTTCGCCGC GGATGGATTA CAATTCAAGA CGAAAGTTCG ATGTTAGTGG CACGGGCGCT TGGACCGAAA GAACATGAGC GCGTTCTTGA CAGTTGTGCT GCGCCGGGCG GAAAATCGAC CCATATCGCA GAGTTAATGA ACAACACCGG ACAAGTAATA TGTGCGGATA TTCATGAACA TAAAGTGAAT TTAATCGAGG AGAATGCGAA ACGACTGCAG CTTACGAATA TTTCTGCGCG TGTGCTTGAT AGCAGACGTT TAGGCGAAGT ATTTGAGCGG GAGATGTTTG ATAAAATTTT AGTTGATGCC CCGTGCTCGG GGTTTGGCGT CATCCGCCGT AAACCAGATA TTAAATATGC AAAAACGGAA GCAGATATTC CTTCCCTTGT GGCGTTGCAA CGTGAAATTT TGCACGCAGT TGCGCCATTG TTAAAAAAGG GCGGTACGCT TGTATATAGC ACATGTACGA TTGACCGTGA TGAAAATGAG GCGGTCATTG CTCAATTTTT AGACGATCAC CCAGAATTTT CACCAGATGA AACGATGAAA CGGCGGTTGC CCGAAAAAGT ACAACCATAC GTTCACAACG GGCAACTGCA TCTTCTTCCA CATTATTTTG GGTCGGACGG ATTTTTTATC GCATCATTAC GAAAGAAGGT GTAG
|
Protein sequence | MKAKNVREVA LETLLAVEEK EAYSNLLLNK MIETHRLSER DVRLLTEIVY GTIQRRDTLD YYLTPFLKKA RKLERWVRVL LRLTLYQVLY LDRIPDRAAI FEAVEIAKKR GHQGVASLVN GVMRAIQRQG VPPLEKIEDE VERLAVATSH PLWLVKRWVE QYGLEETKRM CETNLLPPKQ TARVNTARIT VEEAIDKLKQ EGMEVVLGEV AEEAIQAKRG NLAHTDAFRR GWITIQDESS MLVARALGPK EHERVLDSCA APGGKSTHIA ELMNNTGQVI CADIHEHKVN LIEENAKRLQ LTNISARVLD SRRLGEVFER EMFDKILVDA PCSGFGVIRR KPDIKYAKTE ADIPSLVALQ REILHAVAPL LKKGGTLVYS TCTIDRDENE AVIAQFLDDH PEFSPDETMK RRLPEKVQPY VHNGQLHLLP HYFGSDGFFI ASLRKKV
|
| |