Gene GWCH70_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1551 
Symbol 
ID7976634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1627052 
End bp1628332 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID644798443 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002949615 
Protein GI239826991 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000273211 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTTACGG AACAAACGCT TGTCGTGGGC 
ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGCGTGCTT
CGCAAGTCGT TCCCGATCTT CCAGTCGAAA GAGGGGTTTC AGCAGCTGTA TAAAGCGATT
CAGGAGGCGA TGCAAGCGTT TGGGAAGTCA GAGGTGATCG TCGCGGTGGA GCCGACCGGG
CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC
AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACACGAC
GCCAAAGACG CCCTGGTCAT TGCCAGACTG GCAAAAGACG GACGATTCCT CGTCCCCCGG
CTGCTGCACG AGATCGAAGC CGATTTGCGC GTGGGAAGCA CGCTCAAAGA GAAGCTCCGC
AAGGAACAGA CGGCGGTGAA AAATGCGATC ATCCGCTGGA CCGACCGGTA TTTTCCGGAG
TTTTGGACGG TGTTTCGCGA TCTGGGAAAA ACGGCGCTTT CGGTGTTGGA GTGGACGCCG
TTTCCGGCCG ATATGGCGGG TCGGACCGCC GAGGAGCTCA TCGAGGTGTA CCGGCAAAGC
GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCAACGC CGCGAAGGAC
TCCATTGGGG TGACGGAAGG GACAGCGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGC
CGATACCGCC AATTGGAGGC GGAGATCGCC GCGTTGGACG CCGAGTTGAA GGCATTGGTT
CAAACGACGA TGGAGTACCA ATGGTTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC
ATCGATCTGT TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG CCAATTGGTG
AAGTTGGCGG GCCTGACGCT CAAGGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC
ATCTCCAAAC GGGGACGGAA ACGGCTGCGC TCGGTGCTGT TTCGGGCGAT GATTCCGCTG
ATCCGGCACA ACGAGGCGTT TCGCGAGCTG CATGAATATT ACACGACCCG ATCCGTCAAC
CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT
GCGATTTGTA CGAAGAAACA AGCCTTTGAC GCGGAGCGAA TGAAGCAGGA CGTCTTGTCC
CAAGTGCAAC GGGCGGCCTA A
 
Protein sequence
MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFQQLYKAI 
QEAMQAFGKS EVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD
AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI IRWTDRYFPE
FWTVFRDLGK TALSVLEWTP FPADMAGRTA EELIEVYRQS EGLKCPQKAK IQALINAAKD
SIGVTEGTAM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI
IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL
IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS
QVQRAA