Gene GWCH70_1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1909 
Symbol 
ID7978737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1961707 
End bp1962972 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content44% 
IMG OID644798740 
Producthypothetical protein 
Protein accessionYP_002949910 
Protein GI239827286 
COG category[L] Replication, recombination and repair 
COG ID[COG3359] Predicted exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000148245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATGA AGCAGAAATT AGCAAGATGG AAGGAACAAC TTGCATCCCG GGCTTCCGTT 
CAGGAAGAAC GGCCCGATGT GCTCTTTGGA GAGCAACAAG AGAAAGAGGT TCCATTTCTT
GATGAATGGC AGAAAAAACA TGTGCAGCCG TTCTTTTTTG ATGGGGATTA TTGTTTGATT
CGCGAAGTAG TATATCCGTT AGATTATCAG CATGGACGAT ATCGACTTGG AGAGTTTCAT
CATATTCATG CCCGTTGGCA AGACGCTTCT TTTACACATC CGCTTTCGAG CAAAGGGCAT
GAAGCGAGCG ATTTATTTTT CTTTGATACG GAGACGACCG GGCTTAGCGG CGGAACGGGA
CATGTCATTT TTTTGCTTGG CCATGCCCGC GTATATGAAG ATCGGGTTGT TGTCCGCCAG
CATTTTTTGC CGCACCCGGG AGCGGAAGTG GCATTGTATC AAAGTTTTTT ATCGGAAGTC
GACTATACAA CGCTTGTTAC GTATAACGGA AAAGCATTCG ATTGGCCGAA AGTCAAGACG
CGCCATACGC TGATTCGTGA TGCTGTCCCG AAACTGCCGG CGTTTGGCCA TTTCGATTTA
TATCACGCAT CAAGGAGAAT GTGGAAACAA AAGCTTGAGT CTGTTCGCCT TTCCAATGTC
GAAAAAGAGA TATTGCAAAT TGAGCGAGAA GAAGATGTTC CCGGTTTTTT GGCGCCGATG
ATGTATATGG ACTTTCTATC GGCGCCGCAT CCTGATCGAA TTTTTCCGGT ATTTCTCCAT
AATGAACTTG ATGTTTTATC GTTAATTTGC CTCTATATTC ATTTATCGAA ACAGCTGCTA
GAAGCGCCGC AACTAAAAGA TGCATTGGAA CAGTTGGAAA CAGCTCGTTG GCTCGAGACG
TTAGGAGAAA CAAATGCCGC GAAAAACGTG TATGAGCGCG TGATCGAAAA AGAAACAAAA
GAATCGTGGC AGGCCAAATG GCAACTATCG CTGTTATATA AAAAAGAAAA ACGGTACGAA
AAAGCAGTGG ACATATGGAA AGAATTATGG CAGCATGGCA GTGATACGTG GAAGATGAAA
GCCGGGGTTG AATTGGCAAA AGCGTATGAA CATTATTTTC GTGATGCCCA TATGGCGCAT
CACTATGCGA TCAACGTATA TGAACGATGG AAAACACTAT CTCGTTCCTA TAAACAGCGG
AATACTACAC AAGAGTTAGA GTTGATCAGG CGTATAGAAC GGCTTCAGCG GAAATTAAAT
CATTAA
 
Protein sequence
MSMKQKLARW KEQLASRASV QEERPDVLFG EQQEKEVPFL DEWQKKHVQP FFFDGDYCLI 
REVVYPLDYQ HGRYRLGEFH HIHARWQDAS FTHPLSSKGH EASDLFFFDT ETTGLSGGTG
HVIFLLGHAR VYEDRVVVRQ HFLPHPGAEV ALYQSFLSEV DYTTLVTYNG KAFDWPKVKT
RHTLIRDAVP KLPAFGHFDL YHASRRMWKQ KLESVRLSNV EKEILQIERE EDVPGFLAPM
MYMDFLSAPH PDRIFPVFLH NELDVLSLIC LYIHLSKQLL EAPQLKDALE QLETARWLET
LGETNAAKNV YERVIEKETK ESWQAKWQLS LLYKKEKRYE KAVDIWKELW QHGSDTWKMK
AGVELAKAYE HYFRDAHMAH HYAINVYERW KTLSRSYKQR NTTQELELIR RIERLQRKLN
H