Gene GWCH70_3127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3127 
Symbol 
ID7976771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3155301 
End bp3156521 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID644799913 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_002951052 
Protein GI239828428 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTCT TGTATGAACG CTGTTGCGGA TTGGACGTGC ATAAGCAATC GATTACGGCT 
TGCGCCCTTA CCCCTGAAGG AAAAGAGATT CGCACGTTTG GTACGCTGAC CGACGATCTC
GAGGAGTTGG TGGATTGGCT GAAAGAAAAA AAGGTGACGC ACGTTGCCAT GGAGTCGACG
GGCGTATATT GGAAGCCAGT GTATAATCTC CTCGAAGCAG AGCCGATCGA AGTGCTTGTC
GTCAATGCCC AACACATCAA AGCGGTTCCC GGGCGAAAGA CCGATGTCAA AGATGCCGAA
TGGATCGCGG ACTTGCTTCG CCATGGATTG CTAAAAGGGA GTTACATCCC TCATCGGGCT
CAGCGGGAGC TCCGGGAACT GGTCCGTTAT CGGCGCAGTT TGATCGAGGA ACGGGCACGG
GAGCTCAACC GCATCCAAAA AGTGCTGGAA GGAGCCAATA TCAAGCTTTC TTCGGTCGTA
TCCGACATCA ACGGGATGTC GGCCCGGCTC ATCATTCGCG CCCTTATCGA AGGAAAGGAC
GATCCGGCGG CCCTCGCCCA GCTCGCCAAA GGGCGGCTGA AACAAAAAAC GGAAGAGCTC
CGGCGCGCAT TGAAAGGAGT GATGGGGCCG CATCAACGCA TGATGCTGGC CGAGCAATGG
CGTCATGTGG AGTATTTAGA TGAAGCGATT GCCCGGTTGG ATCGGGAAAT CGAGGAACGA
ACGAGCCCTT TTCATGAAGC GCTGGAGCTC ATCGATACGA TCCCGGGAGT GGGGCGGCAA
AGCGCGGAAC AAATTGTAGC GGAAATCGGG ACGGACATGA GCCGGTTCCC TACCGCCGCG
CACTTGGCCT CATGGGCCGG AATGGCTCCC GGGAATCATG AGAGTGCAGG GAAACGGTTG
TCAGGTCGAA CGAGGAAAGG GAACAAGAAG CTGAGGTCGT GCCTCGTGGA ATGCGCCCGT
GCCGCCGCCC GAACGAAGAA CACGTACCTA TCGGCCAAGT ATCATCGGAT CGCCAAACGA
AGAGGAGCGA ATCGAGCGAG TGTCGCGGTC GGGAGAACGA TTTTAGAAAT GATCTATTAT
ATCTTAACTC GAAAGGAACC GTATAGAGAG TTGGGAGCCG ACTACTGGGA TCGGCAGCGA
GAAGCGCGCA TCGTGCGTCA AACGGTGAAA CGATTAGAGG GGTTAGGGTA CGAAGTGAAA
CTGGAAAAAA CGAGTGCATA G
 
Protein sequence
MRVLYERCCG LDVHKQSITA CALTPEGKEI RTFGTLTDDL EELVDWLKEK KVTHVAMEST 
GVYWKPVYNL LEAEPIEVLV VNAQHIKAVP GRKTDVKDAE WIADLLRHGL LKGSYIPHRA
QRELRELVRY RRSLIEERAR ELNRIQKVLE GANIKLSSVV SDINGMSARL IIRALIEGKD
DPAALAQLAK GRLKQKTEEL RRALKGVMGP HQRMMLAEQW RHVEYLDEAI ARLDREIEER
TSPFHEALEL IDTIPGVGRQ SAEQIVAEIG TDMSRFPTAA HLASWAGMAP GNHESAGKRL
SGRTRKGNKK LRSCLVECAR AAARTKNTYL SAKYHRIAKR RGANRASVAV GRTILEMIYY
ILTRKEPYRE LGADYWDRQR EARIVRQTVK RLEGLGYEVK LEKTSA