Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2381 |
Symbol | |
ID | 7979069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2421159 |
End bp | 2422328 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644799184 |
Product | Rhomboid family protein |
Protein accession | YP_002950344 |
Protein GI | 239827720 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATGGAA AAATGGAATT ATTATTTTGG CAGCTTGTTT ATTTTTTTAT AAAACAGCGT TATCGTATAA TTCAGCTGTC CAATCATTCA CATGAAATTT GGCTGGAGTC GCTTGAAAAT AAACACGTTC CTATTGTTCG GGTTGTGCGT TATGACATCG ATTGGAGCCA ATGGTTGAAG CGGGATATGG AATATGCTTG GCGCATTGCC GAACAAATTC AAAAACGGCG GATAAGAAGA TTCAGGGAAA TTGTCAATAT TTATGTTTCC ACATACCCTC CTGTTGACGA TTGGGAGTTT CTTATCGAGA AGCCGCTGCC GCTTTCACAA AAGCGAAATG CTGTTTTTCG AACATTTCTT ATTCATTCCG GAAACGTAAA TATGTCATTG CAACAGTTAA CGGAAATTGT CCAAACACCT ATTGTCCTTC CGACAGTAGT GAATGATATT GATCCATTTT TTGAAGCGGA GCGGCTGAAG CAGGATATTT TACGGGAAGC GAGAGAACAG CATGAAAGAG AAAGACGATT GTTTGAATAC GGGAAGCCTG TTTTTACATA TATTTTTATC GCATTGCAAG TGCTCGTCTT TCTGTTAATG GAATGGAGTG GAGGCAGCAC GAATCCGGCG GTTTTGATCC AATATGGCGC AAAATTTAAT CCCCTTATTC AAGAAGGGGA GTGGTGGCGC TTTTTTACAC CGATTTTTTT GCACATTGGC TTTTTGCATT TATTGATGAA TACGTTCGCC CTTTACTATT TAGGTATGAC GGTCGAACGT CTTTACGGAT CGTGGCGCTT TTTCTTTATC TATCTTATTG CGGGGTTTTT TGGGACGCTG GGCAGTTTTT TATTTACTAC TTCTCTTTCC GCAGGAGCAT CAGGCGCCAT TTTTGGTTTA TTCGGAGCGC TTCTTTATTT TGGCACTGTA TATCGGCATC TATTTTTCCA AACGATCGGA ACCAATATTA TCGGTTTGAT TATTATCAAC CTGTTATTTG GGATAATGGT GCCTGGAATT GATAATGCCG GGCATATTGG CGGATTAATT GGCGGATTTC TCGCTTCGGG TATTGTCCAT TTGCCAAACC ATCTTGATTG GAAGCGGCAA GTGCGAACAT TGCTAGTGAC GGTGAGCGCT GCCGCCTTAG GTTTATATAT CGGATTCTAA
|
Protein sequence | MNGKMELLFW QLVYFFIKQR YRIIQLSNHS HEIWLESLEN KHVPIVRVVR YDIDWSQWLK RDMEYAWRIA EQIQKRRIRR FREIVNIYVS TYPPVDDWEF LIEKPLPLSQ KRNAVFRTFL IHSGNVNMSL QQLTEIVQTP IVLPTVVNDI DPFFEAERLK QDILREAREQ HERERRLFEY GKPVFTYIFI ALQVLVFLLM EWSGGSTNPA VLIQYGAKFN PLIQEGEWWR FFTPIFLHIG FLHLLMNTFA LYYLGMTVER LYGSWRFFFI YLIAGFFGTL GSFLFTTSLS AGASGAIFGL FGALLYFGTV YRHLFFQTIG TNIIGLIIIN LLFGIMVPGI DNAGHIGGLI GGFLASGIVH LPNHLDWKRQ VRTLLVTVSA AALGLYIGF
|
| |