Gene GWCH70_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3072 
Symbol 
ID7979170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3091380 
End bp3092753 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID644799862 
Producttransposase IS4 family protein 
Protein accessionYP_002951001 
Protein GI239828377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000016062 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATT TCCCGATTCG GTTTGTATTG ACAGATGAAG CGATTACCCC AAGTGCTGGG 
CTTGCTCTCG TTGGCTACTT ACTCCATCGA ACGAAACTGG ATAAACGGGT AAACGCACTT
CGGCTTCCAA CGGTTCGTCG AGAAGTGCAC ATTTCCCATA GCGATGTCAT TCGCTCGATG
ATTGGCTTGC TTGCCACAGG AAAAACGGAT TTCGATCATA TCGAAGCGTA TCGTCAGGAC
GATATCTTTT CGGCATCGAT GGGGATTCAG CACGTGCCTT CCTCTCCAAC CTTGCGACAA
CGACTCGATC AGCTCGCTTG TCTTCCGATG ACCGAAACGA TTCTTTGGGA GGAGTCCATA
CGTCTGTTGA TTCAACGACA TGCCACTTTG TCCCCTTGTT GGACCAAAGG AAAGACGACA
TGGCTTCCCC TTGATATAGA TGGTTCCCCA TTTGACAACT CCGATACGAA AAAAGAAGGA
GTCAGTCGAA CGTATAAAGG ATTTGACGGT TTTACACCGT TGTTTGCGTA TGCAGGGAAG
GAAGGGTATC TCGTTCATGC CGAGTTGCGT CCAGGGAAAC AACATGTGCA AGACAACATG
CCTTCGTTTT TAACTACCGC CATCCGTCGA GCTCGTCAAC TGACTTCATC TCGTCTGCTT
GTCCGCATGG ATGCAGGAAA CGATGCGGAA GCGAATGTGC ACGTATGCCT AAAGGAAGAC
GTGGACTTTG TCATCAAGCG AAACTTACGC CGAGAATCGA AAGCGCTTTG GTTCCAGATC
GCTTCGCAAA AGGGCAGACG CGTCGATGAT GGACAAAGCG AAGGAGTACA AACGTATGAG
TTATGCCTTC CACAGACCGC AGCGATCGAT GGAAACACGT ATACGTACGT TCAAGTCACC
CAAGTGACGG AACGGACGAT GGAACGAAAT GGACAGCTGA TGCTCGTTCC TGATTATGAA
GTGGAAAGCT ATTGGGTGCG GCTCAAAGGA TACGAGCATG TTCGAATGAG CGATGTGCTC
GCGTTGTATC ATGACCATGC GACATGCGAA CAGTTTCATA GCGAACTGAA GAGCGACTTA
GATTTAGAGC GGCTTCCATC TGGGAAGATG AAAACGAATG CGCTCGTGTT GGTCATGGGA
GCCTTCGTTT ACAATCTTCT TCGTCTGATT GGACAAGATC TATTAAGCGA TCCGAGACAT
CCGTTGCACC ACAAAGTGAA ACGCCGTCGC ATCAAGACGA TTATTCAGAC GGTGATCACG
ATGGCAGGTC GACTCGTCCG CCGATCACGA CAGATCTGGA TGAAACTGAC GCGAAGGAGT
GGGTACAGTA TACTCCTACT GAATGTGTAT CAAAAATGGA AAGAGGCAAG ATAA
 
Protein sequence
MKDFPIRFVL TDEAITPSAG LALVGYLLHR TKLDKRVNAL RLPTVRREVH ISHSDVIRSM 
IGLLATGKTD FDHIEAYRQD DIFSASMGIQ HVPSSPTLRQ RLDQLACLPM TETILWEESI
RLLIQRHATL SPCWTKGKTT WLPLDIDGSP FDNSDTKKEG VSRTYKGFDG FTPLFAYAGK
EGYLVHAELR PGKQHVQDNM PSFLTTAIRR ARQLTSSRLL VRMDAGNDAE ANVHVCLKED
VDFVIKRNLR RESKALWFQI ASQKGRRVDD GQSEGVQTYE LCLPQTAAID GNTYTYVQVT
QVTERTMERN GQLMLVPDYE VESYWVRLKG YEHVRMSDVL ALYHDHATCE QFHSELKSDL
DLERLPSGKM KTNALVLVMG AFVYNLLRLI GQDLLSDPRH PLHHKVKRRR IKTIIQTVIT
MAGRLVRRSR QIWMKLTRRS GYSILLLNVY QKWKEAR