Gene GWCH70_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1790 
Symbol 
ID7978704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1857604 
End bp1858977 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID644798629 
Producttransposase IS4 family protein 
Protein accessionYP_002949801 
Protein GI239827177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000150796 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATT TCCCGATTCG GTTTGTATTG ACAGATGAAG CGATTACCCC AAGTGCTGGG 
CTTGCTCTCG TTGGCTACTT ACTCCATCGA ACGAAACTGG ATAAACGGGT AAACGCACTT
CGGCTTCCAA CGGTTCGTCG AGAAGTGCAC ATTTCCCATA GCGATGTCAT TCGCTCGATG
ATTGGCTTGC TTGCCACAGG AAAAACGGAT TTCGATCATA TCGAAGCGTA TCGTCAGGAC
GATATCTTTT CGGCATCGAT GGGGATTCAG CACGTACCTT CCTCTCCAAC CTTGCGACAG
CGTCTCGATC AGCTCGCTTG TCTTCCGATG ACCGAAACCA TCATTTGGGA GGAATCGATG
CGTCTGTTGG TTCGACAACA CGCTACCTTG TCCCCTTGTT GGACCAAAGG AAAGACGACA
TGGCTTCCCC TTGATATAGA TGCTTCCCCA TTTGACAACT CCGATACGAA AAAAGAAGGA
GTCAGTCGAA CGTATAAAGG ATTTGACGGT TTTACACCGT TGTTTGCGTA TGCAGGGAAG
GAAGGGTATC TCGTTCATGC CGAGTTGCGT CCAGGGAAAC AACATGTGCA AGACAACATG
CCCTCGTTTT TAGTCACCGC TATCCGTCGA GCTCGTCAAC TGACTTCATC TCGTCTGCTT
GTTCGCATGG ATGCAGGAAA CGATGCAGAA GCGAATGTGC ACGTATGCCT AAAGGAAGAC
GTGGACTTTG TCATCAAGCG AAACTTACGC CGAGAATCGA AAGCGCTTTG GTTCCAGATC
GCTTCGCAAA AGGGCAGACG CGTCGATGAT GGACAAAGCG AAGGAGTACA AACCTATGAG
CTATGCCTTC CACAGAAGGC AGCGATCGAT GGAAACACGT ATACGTACGT TCAAGTCACC
CAAGTGACGG AACGGACGAT GGAACGCAAT GGACAGCTGA TGCTCGTTCC TGATTATGAA
GTGGAAAGCT ATTGGGTGCG GCTCAAAGGA TACGAGCATG TTCGAATGAG CGATGTGCTC
GCGTTGTATC ATGACCATGC GACATGCGAA CAGTTTCATA GCGAACTGAA GAGCGACTTA
GATTTAGAGC GGCTTCCATC TGGGAAGATG AAAACGAATG CGCTCGTGTT GGTCATGGGA
GCCTTCGTTT ACAATCTTCT TCGTCTGATT GGACAAGATC TATTAAGCGA TCCGAGACAT
CCGTTGCACC ACAAAGTGAA ACGCCGTCGC ATCAAGACGA TTATTCAGAC GGTGATCACG
ATGGCAGGTC GACTCGTCCG CCGATCACGA CAGATCTGGA TGAAACTGAC GCGAAGGAGT
GGGTACAGTA TACTCCTACT GAATGTGTAT CAAAAATGGA AAGAGGCAAG ATAA
 
Protein sequence
MKDFPIRFVL TDEAITPSAG LALVGYLLHR TKLDKRVNAL RLPTVRREVH ISHSDVIRSM 
IGLLATGKTD FDHIEAYRQD DIFSASMGIQ HVPSSPTLRQ RLDQLACLPM TETIIWEESM
RLLVRQHATL SPCWTKGKTT WLPLDIDASP FDNSDTKKEG VSRTYKGFDG FTPLFAYAGK
EGYLVHAELR PGKQHVQDNM PSFLVTAIRR ARQLTSSRLL VRMDAGNDAE ANVHVCLKED
VDFVIKRNLR RESKALWFQI ASQKGRRVDD GQSEGVQTYE LCLPQKAAID GNTYTYVQVT
QVTERTMERN GQLMLVPDYE VESYWVRLKG YEHVRMSDVL ALYHDHATCE QFHSELKSDL
DLERLPSGKM KTNALVLVMG AFVYNLLRLI GQDLLSDPRH PLHHKVKRRR IKTIIQTVIT
MAGRLVRRSR QIWMKLTRRS GYSILLLNVY QKWKEAR