Gene GWCH70_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2089 
Symbol 
ID7979252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2163189 
End bp2164469 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID644798906 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002950066 
Protein GI239827442 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000334168 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTCACCG AACAAACGCT TGTCGTGGGC 
ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGTGTGCTT
CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGCTTTC GCCAGCTGTA TGAAGCGATT
CAGGAGGCGA TGCAAGCGTT CGGAAAGCCG CAGGTGATCG TCGCCGTGGA GCCGACCGGG
CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC
AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACATGAC
GCCAAAGACG CCCTAGTCAT CGCCAGACTG GCGAAAGACG GACGATTCCT CGTCCCCCGG
CTGTTGCACG AGATCGAAGC CGATTTGCGC GTCGGGAGCA CGCTCAAAGA GAAGCTCCGC
AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CCGACCGATA TTTTCCGGAG
TTTTGGACGG TGTTTCGCGA CCTGGGGAAA ACGGCGCTTT CGGTGCTGGA GTGGACGCCG
CTTCCAGCCG ATATGGCCGG CCGGGCGGTG GAGGAGCTTC TTGAGGTGTA CCGGCAAAGC
GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCGACGC CGCGAAGGAC
TCGATTGGGG TGACGGAAGG GACGACGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGC
CGATACCGCC AATTGGAGGC TGAGGTCGCC GCGTTGGACG CCGAGTTGAA GGCGTTGGTT
CAAACGACGA TGGAGTATCA ATGGCTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC
ATCGATCTTC TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG TCAATTGGTG
AAGTTGGCGG GCCTGACGCT CAAGGAGAAT TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC
ATCTCCAAAC GGGGACGGAA ACGGCTGCGA TCGGTGCTGT TTCGGGCGAT GATTCCGCTG
ATTCGGCATA ACGAGGCGTT TCGCGAGCTG CATGAGTATT ATACGACCCG ATCCGTCAAT
CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT
GCGATTTGTA CGAAGAAACA AGCGTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC
CAGGTGCAAC GGGCGGCCTA A
 
Protein sequence
MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFRQLYEAI 
QEAMQAFGKP QVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD
AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE
FWTVFRDLGK TALSVLEWTP LPADMAGRAV EELLEVYRQS EGLKCPQKAK IQALIDAAKD
SIGVTEGTTM ARFEIAALVR RYRQLEAEVA ALDAELKALV QTTMEYQWLK TVDGLGDATI
IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL
IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS
QVQRAA