Gene GWCH70_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1514 
Symbol 
ID7976601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1586744 
End bp1588402 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content54% 
IMG OID644798411 
Producttransposase 
Protein accessionYP_002949584 
Protein GI239826960 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTC AAGTCAAAAA GGTCTATCGC AATTCTTATT TGAATATAAT AAGTGCCCTA 
TTCAAGAAAC TGGGTCTGCC TCAATTGATT GACCATCTCG TGCCCGTCGA TCCGCAGTGC
CAAACGCGAG TCAGCGATGC CGTTCAGGCC ATCCTCTACA ATGTGTTTGA CGGCCGGCAA
GCCCTTGTTC ACTTGGAACA TTGGGCTCAG GAGGTCGATT GTGAGAAACT CATCCGTCCC
GATCTCCATC CTTCCTGGTT GAACGACGAT GCGTTGGCCC GTCATCTCGA TCGCCTGTAT
GAGGCTGGCA TTCACAACGT CATCAGCACT TGCTTGATTC ATATTTATCG AAAAGAAGGC
CTTTCCCTCC GAGCCTTCCA CGCCGATACG ACGGACAAGA CCGTTTACGG CGCGTATGAA
TCGGCCTCGT TAGAGGCCTT ACAAATCACA CATGGCTACA ACCGCCATCA TCGTTGGCAA
AAACAGATCG GTTTCGGACT GGTCGGCAAC GAGGACGGCA TCCCGTTTTA CGGCGATGTG
CACGATGGCA ACCTGCCCGA TAAAACATGG AATCCCGAGG TGCTGTCTCG TGTCCATGAA
CAGCTGAAGC AGGCCAAAAT CGAAGACGAA TGGATTTACG TGGCCGATTC CGCCGCGATG
ACGAAAGAGA CCCTGGCGCA AACCAAAGCG GCCAACGCCT TTTTGATCAC CAGAGGCCCT
TCGTCGCTCC GGATCGTGAA AACCGCGCTG GCCGAAGCGG ATGCTGAGGA CACGACGTGG
AGCGATCCCT TTACGTTGGC GGAGAGAAAC GGCGCCACGT ACCGGGTATG GGAAACGGCC
TCGACGTATG AAGGCCACCC CGTTCGGCTG ATCGTTGTTG AATCGAGCGC GCTCGACCAG
CGAAAAGGAA AGACGCTTGA AAAAGAACGA ACCAAAGAAG CGGAGCTTCT TCGCGAGGAA
CAAGCCCGTT GGGAGCGTCA CCCCTTCTCC TGCCGGGAAG ATGCCGAACA AGCCTTGGCG
TCCCTCAAGG CGTCCCTTCG CCCCCGGTTT CATCGGGTTG AGGCCGCGGT CGAAGAGATC
GTACGCCTGA AAAAACGGCG CGGACGGCCG AAAAAAGGGG CGGAACCCGA GGTGGAGACG
CTGTATTTCT TGCACCTTGA CGTCGAATTC GACCAAGACG CGTGGGAACA GGCGAGACGG
AAAGCGTCCC GGTTTGTCCT TGTCACGACC GTTCCGAAGG AATGGAAGGG CCAACCCATG
GATGCCCAAG AGATCTTGAA GCTGTATAAA GGGCAGATCT CGGTGGAAAT GAACTTCGCT
TTTTTGAAAG ATCCGTTTTT CACGGATGAG ATTTACGTCA AAAAACCAGA ACGGGTCGCA
GTATTAGGCT ATTTGTTTCT GTTGGCCTTG GCTATTTACC GCGTTTTTCA GCGCCGAGTG
CGTCAGTTTA TTACTCCAGA ACACCCGTTG AAGGGTCCTG GAGGCCGCAA GCTGACCCGG
CCGACGGGAC AGGCGATTTT TCAGCTGTTT CAATATGTGA ACGTCGTCCT GTTCAAGCTG
CCGGATGGGC GCATCCAACG CTCACTGGAT CGCTCCCTTA CCCCTGATCA GCGAAGGATT
CTGCAGGGAT TGGGCATGGA TGAGAGCATC TACGTGTAA
 
Protein sequence
MNVQVKKVYR NSYLNIISAL FKKLGLPQLI DHLVPVDPQC QTRVSDAVQA ILYNVFDGRQ 
ALVHLEHWAQ EVDCEKLIRP DLHPSWLNDD ALARHLDRLY EAGIHNVIST CLIHIYRKEG
LSLRAFHADT TDKTVYGAYE SASLEALQIT HGYNRHHRWQ KQIGFGLVGN EDGIPFYGDV
HDGNLPDKTW NPEVLSRVHE QLKQAKIEDE WIYVADSAAM TKETLAQTKA ANAFLITRGP
SSLRIVKTAL AEADAEDTTW SDPFTLAERN GATYRVWETA STYEGHPVRL IVVESSALDQ
RKGKTLEKER TKEAELLREE QARWERHPFS CREDAEQALA SLKASLRPRF HRVEAAVEEI
VRLKKRRGRP KKGAEPEVET LYFLHLDVEF DQDAWEQARR KASRFVLVTT VPKEWKGQPM
DAQEILKLYK GQISVEMNFA FLKDPFFTDE IYVKKPERVA VLGYLFLLAL AIYRVFQRRV
RQFITPEHPL KGPGGRKLTR PTGQAIFQLF QYVNVVLFKL PDGRIQRSLD RSLTPDQRRI
LQGLGMDESI YV