Gene GWCH70_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1416 
Symbol 
ID7979192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1486780 
End bp1488153 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID644798336 
Producttransposase IS4 family protein 
Protein accessionYP_002949509 
Protein GI239826885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000673516 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATT TCCCGATTCG GTTTGTATTG ACAGATGAAG CGATTACCCC AAGTGCTGGG 
CTTGCTCTCG TTGGCTACTT ACTGCACCAA ACGAAGCTGG ATAAACGAGT AAACGCCCTT
CGGCTCCCAA CGGTTCGTCG AGATGTGCAC ATTTCCCATA GCGACGTCAT TCGCTCGATG
ATCGGCTTGC TTGCCACAGG AAAAACGGAT TTCGATCATA TCGAAGCGTA TCGTCAGGAC
GATATCTTTT CGGCATCGAT GGGGATTCAG CACGTACCTT CCTCTCCAAC CTTGCGACAA
CGACTCGATC AGCTCGCTTG TCTTCCGATG ACCGAAACCA TCATTTGGGA GGAATCGATG
CGTCTGTTGG TTCGACAACA CGCTACCTTG TCCCCTTGTT GGACGAAAGG GAAAACGACA
TGGCTTCCCC TTGATATAGA TGCTTCCCCA TTTGACAACT CCGATACGAA AAAAGAAGGA
GTCAGTCGAA CGTATAAAGG ATTTGACGGT TTTACACCGT TGTTTGCGTA TGCAGGGAAG
GAAGGGTATC TCGTTCATGC CGAGTTGCGT CCAGGGAAAC AACATGTGCA AGACAACATG
CCCTCGTTTT TAGTCACCGC TATCCGTCGA GCTCGTCAAC TGACTTCATC TCGTCTGCTT
GTTCGCATGG ATGCAGGAAA CGATGCAGAA GCGAATGTGC ACGTATGCCT AAAGGAAGAC
GTGGACTTTG TCATCAAGCG AAACTTACGC CGAGAATCGA AAGCGCTTTG GTTCCAGATC
GCTTCGCAAA AGGGCAGACG CGTCGATGAT GGACAAAGCG AAGGAGTACA AACCTATGAG
CTATGCCTTC CACAGAAGGC AGTGATCGAT GGAAACACGT ATACGTACGT TCAAGTCACC
CAAGTGACGG AACGGACGAT GGAACGCAAT GGACAGCTGA TGCTCGTTCC TGATTATGAA
GTGGAAAGCT ATTGGGTGCG GCTCAAAGGA TACGAGCATG TTCGAATGAG CGATGTGCTC
GCGTTGTATC ATGACCATGC GACATGCGAA CAGTTTCATA GCGAACTGAA GAGCGACTTA
GATTTAGAGC GGCTTCCATC TGGGAAGATG AAAACGAATG CGCTCGTGTT GGTCATGGGA
GCCTTCGTTT ACAATCTTCT TCGTCTGATT GGACAAGATC TATTAAGCGA TCCGAGACAT
CCGTTGCACC ACAAAGTGAA ACGCCGTCGC ATCAAGACGA TTATTCAGAC GGTGATCACG
ATGGCAGGTC GACTCGTCCG CCGATCACGA CAGATCTGGA TGAAACTGAC GCGAAGGAGT
GGGTACAGTA TACTCCTACT GAATGTGTAT CAAAAATGGA AAGAGGCAAG ATAA
 
Protein sequence
MKDFPIRFVL TDEAITPSAG LALVGYLLHQ TKLDKRVNAL RLPTVRRDVH ISHSDVIRSM 
IGLLATGKTD FDHIEAYRQD DIFSASMGIQ HVPSSPTLRQ RLDQLACLPM TETIIWEESM
RLLVRQHATL SPCWTKGKTT WLPLDIDASP FDNSDTKKEG VSRTYKGFDG FTPLFAYAGK
EGYLVHAELR PGKQHVQDNM PSFLVTAIRR ARQLTSSRLL VRMDAGNDAE ANVHVCLKED
VDFVIKRNLR RESKALWFQI ASQKGRRVDD GQSEGVQTYE LCLPQKAVID GNTYTYVQVT
QVTERTMERN GQLMLVPDYE VESYWVRLKG YEHVRMSDVL ALYHDHATCE QFHSELKSDL
DLERLPSGKM KTNALVLVMG AFVYNLLRLI GQDLLSDPRH PLHHKVKRRR IKTIIQTVIT
MAGRLVRRSR QIWMKLTRRS GYSILLLNVY QKWKEAR