Gene GWCH70_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2381 
Symbol 
ID7979069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2421159 
End bp2422328 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content41% 
IMG OID644799184 
ProductRhomboid family protein 
Protein accessionYP_002950344 
Protein GI239827720 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATGGAA AAATGGAATT ATTATTTTGG CAGCTTGTTT ATTTTTTTAT AAAACAGCGT 
TATCGTATAA TTCAGCTGTC CAATCATTCA CATGAAATTT GGCTGGAGTC GCTTGAAAAT
AAACACGTTC CTATTGTTCG GGTTGTGCGT TATGACATCG ATTGGAGCCA ATGGTTGAAG
CGGGATATGG AATATGCTTG GCGCATTGCC GAACAAATTC AAAAACGGCG GATAAGAAGA
TTCAGGGAAA TTGTCAATAT TTATGTTTCC ACATACCCTC CTGTTGACGA TTGGGAGTTT
CTTATCGAGA AGCCGCTGCC GCTTTCACAA AAGCGAAATG CTGTTTTTCG AACATTTCTT
ATTCATTCCG GAAACGTAAA TATGTCATTG CAACAGTTAA CGGAAATTGT CCAAACACCT
ATTGTCCTTC CGACAGTAGT GAATGATATT GATCCATTTT TTGAAGCGGA GCGGCTGAAG
CAGGATATTT TACGGGAAGC GAGAGAACAG CATGAAAGAG AAAGACGATT GTTTGAATAC
GGGAAGCCTG TTTTTACATA TATTTTTATC GCATTGCAAG TGCTCGTCTT TCTGTTAATG
GAATGGAGTG GAGGCAGCAC GAATCCGGCG GTTTTGATCC AATATGGCGC AAAATTTAAT
CCCCTTATTC AAGAAGGGGA GTGGTGGCGC TTTTTTACAC CGATTTTTTT GCACATTGGC
TTTTTGCATT TATTGATGAA TACGTTCGCC CTTTACTATT TAGGTATGAC GGTCGAACGT
CTTTACGGAT CGTGGCGCTT TTTCTTTATC TATCTTATTG CGGGGTTTTT TGGGACGCTG
GGCAGTTTTT TATTTACTAC TTCTCTTTCC GCAGGAGCAT CAGGCGCCAT TTTTGGTTTA
TTCGGAGCGC TTCTTTATTT TGGCACTGTA TATCGGCATC TATTTTTCCA AACGATCGGA
ACCAATATTA TCGGTTTGAT TATTATCAAC CTGTTATTTG GGATAATGGT GCCTGGAATT
GATAATGCCG GGCATATTGG CGGATTAATT GGCGGATTTC TCGCTTCGGG TATTGTCCAT
TTGCCAAACC ATCTTGATTG GAAGCGGCAA GTGCGAACAT TGCTAGTGAC GGTGAGCGCT
GCCGCCTTAG GTTTATATAT CGGATTCTAA
 
Protein sequence
MNGKMELLFW QLVYFFIKQR YRIIQLSNHS HEIWLESLEN KHVPIVRVVR YDIDWSQWLK 
RDMEYAWRIA EQIQKRRIRR FREIVNIYVS TYPPVDDWEF LIEKPLPLSQ KRNAVFRTFL
IHSGNVNMSL QQLTEIVQTP IVLPTVVNDI DPFFEAERLK QDILREAREQ HERERRLFEY
GKPVFTYIFI ALQVLVFLLM EWSGGSTNPA VLIQYGAKFN PLIQEGEWWR FFTPIFLHIG
FLHLLMNTFA LYYLGMTVER LYGSWRFFFI YLIAGFFGTL GSFLFTTSLS AGASGAIFGL
FGALLYFGTV YRHLFFQTIG TNIIGLIIIN LLFGIMVPGI DNAGHIGGLI GGFLASGIVH
LPNHLDWKRQ VRTLLVTVSA AALGLYIGF