Gene GWCH70_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2047 
Symbol 
ID7977283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2107288 
End bp2108568 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID644798865 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002950035 
Protein GI239827411 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000186467 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTAACCG AACAAACGCT TGTCGTGGGC 
ATTGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGCGTGCTT
CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGGTTTC AACAGCTGTA TAAAGCGATT
CAGGGGGCGA TGCAAGCGTT CGGGAAGTCA GAGGTGATCG TCGCCGTGGA GCCGACCGGG
CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC
AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTTGATGACA ACCTGCCGAC GAAACACGAC
GCCAAAGACG CCCTGGTCAT TGCCAGACTG GCAAAAGACG GACGATTCCT CGTTCCCCGG
CTGCTGCACG AGATTGAAGC CGATTTGCGC GTGGGGAGCA CGCTCAAAGA GAAGCTCCGC
AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CGGATCGGTA TTTTCCAGAG
TTTTGGACCG TGTTTCGTGA CTTGGGGAAA ACGGCGCTTT CGGTGTTGGA GTGGACGCCG
CTTCCGGCTG ATATGGCCGG CCGGACGGTG GAGGAGCTTC TTGAGGTGTA CCGGCAAAGC
GAAGGGATGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCAACAC CGCGAAGGAC
TCGATTGGGG TGACGGAAGG GACAGCGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGT
CGATACCGCC AATTGGAGGC GGAGATCGCT GCACTGGACG CCGAGTTGAA GGCATTGGTT
CAAACGACGA TGGAGTACCA ATGGTTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC
ATCGATCTGC TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG CCAATTGGTG
AAGTTGGCGG GCCTGACGCT CAAGGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC
ATCTCCAAAC GGGGACGGAA ACGGTTGCGC TCGGTGCTGT TTCGGGCGAT GATTCCGCTG
ATTCGGCATA ACGAGGCGTT TCGCGAGCTG CATGAGTATT ATACGACCCG ATCCGTCAAT
CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT
GCGATTTGTA CGAAGAAACA AGCGTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC
CAGGTGCAAC GGGCGGCCTA A
 
Protein sequence
MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFQQLYKAI 
QGAMQAFGKS EVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD
AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE
FWTVFRDLGK TALSVLEWTP LPADMAGRTV EELLEVYRQS EGMKCPQKAK IQALINTAKD
SIGVTEGTAM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI
IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL
IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS
QVQRAA