Gene GWCH70_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3103 
Symbol 
ID7979176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3122792 
End bp3124450 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content54% 
IMG OID644799890 
Producttransposase 
Protein accessionYP_002951029 
Protein GI239828405 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000596563 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTC AAGTCAAAAA GGTCTATCGC AATTCTTATT TGAATATAAT AAGTGCCCTA 
TTCAAGAAAC TGGGTCTGCC TCAATTGATT GACCATCTCG TGCCCGTCGA TCCGCAGTGC
CAAACGCGAG TCAGCGATGC CGTTCAGGCC ATCCTCTACA ATGTGTTTGA CGGCCGGCAA
GCCCTTGTTC ACTTGGAACA TTGGGCTCAG GAGGTCGATT GTGAGAAACT CATCCGTCCC
GATCTCCATC CTTCCTGGTT GAACGACGAT GCGTTGGCCC GTCATCTCGA TCGCCTGTAT
GAGGCTGGCA TTCACAACGT CATCAGCACT TGCTTGATTC ATATTTATCG AAAAGAAGGC
CTTTCCCTCC GAGCCTTCCA CGCCGATACG ACGGACAAGA CCGTTTACGG CGCGTATGAA
TCGGCCTCGT TAGAGGCCTT ACAAATCACA CATGGCTACA ACCGCCATCA TCGTTGGCAA
AAACAGATCG GTTTCGGACT GGTCGGCAAC GAGGACGGCA TCCCGTTTTA CGGCGATGTG
CACGATGGCA ACCTGCCCGA TAAAACATGG AATCCCGAGG TGCTGTCTCG TGTCCATGAA
CAGCTGAAGC AGGCCAAAAT CGAAGACGAA TGGATTTACG TGGCCGATTC CGCCGCGATG
ACGAAAGAGA CCCTGGCGCA AACCAAAGCG GCCAACGCCT TTTTGATCAC CAGAGGCCCT
TCGTCGCTCC GGATCGTGAA AACCGCGCTG GCCGAAGCGG ATGTTGAGGA CACGACGTGG
AGCGATCCCT TTACGTTGGC GGAGAGAAAC GGCGCCACGT ACCGGGTATG GGAAACGGCC
TCGACGTATG AAGGCCACCC CGTTCGGCTG ATCGTTGTTG AATCGAGCGC GCTCGACCAG
CGAAAAGGAA AGACGCTTGA AAAAGAACGA ACCAAAGAAG CGGAGCTTCT TCGCGAGGAA
CAAGCCCGTT GGGAGCGTCA CCCCTTCTCC TGCCGGGAAG ATGCCGAACA AGCCTTGGCG
TCCCTCAAGG CGTCCCTTCG CCCCCGGTTT CATCGGGTTG AGGCCGCGGT CGAAGAGATC
GTACGCCTGA AAAAACGGCG CGGACGGCCG AAAAAAGGGG CGGAACCCGA GGTGGAGACG
CTGTATTTCT TGCACCTTGA CGTCGAATTC GACCAAGACG CGTGGGAACA GGCGAGACGG
AAAGCGTCCC GGTTTGTCCT TGTCACGACC GTTCCGAAGG AATGGAAGGG CCAACCCATG
GATGCCCAAG AGATCTTGAA GCTGTATAAA GGGCAGATCT CGGTGGAAAT GAACTTCGCT
TTTTTGAAAG ATCCGTTTTT CACGGATGAG ATTTACGTCA AAAAACCAGA ACGGGTCGCA
GTATTAGGCT ATTTGTTTCT GTTATCCTTG GCTATTTACC GCGTTTTTCA GCGCCGAGTG
CGTCAGTTTA TTACTCCAGA ACACCCGTTG AAGGGTCCTG GAGGCCGCAA GCTGACCCGG
CCGACGGGAC AGGCGATTTT TCAGCTGTTT CAATATGTGA ACGTCGTCCT GTTCAAGCTG
CCGGATGGGC GCATCCAACG CTCACTGGAT CGCTCCCTTA CCCCTGATCA GCGAAGGATT
CTGCAGGGAT TGGGCATGGA TGAGAGCATC TACGTGTAA
 
Protein sequence
MNVQVKKVYR NSYLNIISAL FKKLGLPQLI DHLVPVDPQC QTRVSDAVQA ILYNVFDGRQ 
ALVHLEHWAQ EVDCEKLIRP DLHPSWLNDD ALARHLDRLY EAGIHNVIST CLIHIYRKEG
LSLRAFHADT TDKTVYGAYE SASLEALQIT HGYNRHHRWQ KQIGFGLVGN EDGIPFYGDV
HDGNLPDKTW NPEVLSRVHE QLKQAKIEDE WIYVADSAAM TKETLAQTKA ANAFLITRGP
SSLRIVKTAL AEADVEDTTW SDPFTLAERN GATYRVWETA STYEGHPVRL IVVESSALDQ
RKGKTLEKER TKEAELLREE QARWERHPFS CREDAEQALA SLKASLRPRF HRVEAAVEEI
VRLKKRRGRP KKGAEPEVET LYFLHLDVEF DQDAWEQARR KASRFVLVTT VPKEWKGQPM
DAQEILKLYK GQISVEMNFA FLKDPFFTDE IYVKKPERVA VLGYLFLLSL AIYRVFQRRV
RQFITPEHPL KGPGGRKLTR PTGQAIFQLF QYVNVVLFKL PDGRIQRSLD RSLTPDQRRI
LQGLGMDESI YV