Gene GWCH70_1284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1284 
Symbol 
ID7976065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1335111 
End bp1336439 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content45% 
IMG OID644798228 
Productnucleoside recognition domain protein 
Protein accessionYP_002949401 
Protein GI239826777 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000041831 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA ATACAAGCAA ACAGCCTTCG CTGCCGGTAC AAACTTCCAC TGGTCAGGCT 
TGGAAGTTTT TCATTTACAG CGCGATTGGC ATTTTCATGT TTTTCGTTCC CGTCCAAATT
GGGGAGACGT CCTCGATTAT GCTTGACCAT ATCGTCTCAT GGATACGTAT GCAGTTTCCG
GCGCTTGTAC CGTATTACGC GCTACTTGTA ATTGCTTTAG GCGCTGTATA TCCGTTTTAC
ACTAAAACGT GGAACAAAGA TGTGGTTGCG ATCGTTTTTT CCTTGTTAAA AGTATTAGGA
CTGATGGTAG CGATAATGTT AATGTTTAAA ATCGGTCCAT CTTGGCTGTT CAAACCGGAC
ATGGGACCGT TTTTGTACGA TAAACTCGTC ATCTCCGTTG GTCTGTTAGT GCCGATCGGT
TCCATTTTCC TCGCCCTTTT AGTCGGATAC GGGCTTTTGG AATTTGTCGG AGTTTTGATG
CAACCAGTGA TGCGGCCGAT TTGGAAAACG CCAGGACGCT CGGCGATTGA TGCCGTTGCG
TCGTTTGTCG GAAGTTACTC ACTTGGCCTG CTCATTACGA ACAAAGTATT TAAAGAAGGA
AAATACACGA TTAAAGAAGC GGCCATAATT GCCACCGGCT TTTCGACCGT ATCGGTGACG
TTCATGGTCG TTGTGGCGAA AACGCTGGGG TTGATGAACA TATGGAATAC GTATTTTTGG
GTGACGTTTT TTGTAACTTT TGTGGTTACG GCTATTACCG TCCGGTTATG GCCGCTAAGC
AAAATGAGCG ATGACTATTA TGATGGCAAA GGTGACCCTG AGGAAAAAGT TACGGGAAAT
TACTTGAAAG AAGCATGGTC AGAAGCGATG AAAGCGGTGC AGCATTCGAA AGGGCTTTGG
ACGAATGTGT GGGAAAACTT CAGAGATGGT TTTATCATGA CGATGAGCAT TTTGCCGTCG
ATCATGTCGG TCGGACTGAT TGGGCTGCTG CTTGCTGAAT ATACGCCTCT GTTTGATTGG
CTCGGCTATC TTTTCTATCC ATTTACCCTT CTATTGCAAA TTCCGGAGCC ATTGCTTGCT
GCCAAAGCAT CAGCGATCGA GATTGCGGAA ATGTTTTTGC CAGCCTTGCT TGTAACGGAA
GCACCTCTCG TCACTAAATT CATCATCGCG GTCGTTTCGA TTTCCGCCAT TTTGTTCTTT
TCCGCTGTTA TTCCTTGTAT TTTGGCAACG GAAATCCCGA TCAGTATTCC AAAGCTATTA
GTCATTTGGG CCGAGCGGAC GATTTTGACG CTTATAATCG CTACGCCGAT CGCGTATTTG
CTGCTGTGA
 
Protein sequence
MKVNTSKQPS LPVQTSTGQA WKFFIYSAIG IFMFFVPVQI GETSSIMLDH IVSWIRMQFP 
ALVPYYALLV IALGAVYPFY TKTWNKDVVA IVFSLLKVLG LMVAIMLMFK IGPSWLFKPD
MGPFLYDKLV ISVGLLVPIG SIFLALLVGY GLLEFVGVLM QPVMRPIWKT PGRSAIDAVA
SFVGSYSLGL LITNKVFKEG KYTIKEAAII ATGFSTVSVT FMVVVAKTLG LMNIWNTYFW
VTFFVTFVVT AITVRLWPLS KMSDDYYDGK GDPEEKVTGN YLKEAWSEAM KAVQHSKGLW
TNVWENFRDG FIMTMSILPS IMSVGLIGLL LAEYTPLFDW LGYLFYPFTL LLQIPEPLLA
AKASAIEIAE MFLPALLVTE APLVTKFIIA VVSISAILFF SAVIPCILAT EIPISIPKLL
VIWAERTILT LIIATPIAYL LL