Gene GWCH70_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1457 
Symbol 
ID7976905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1527145 
End bp1528593 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content48% 
IMG OID644798361 
Producttransposase IS66 
Protein accessionYP_002949534 
Protein GI239826910 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACGG TACAACAAGC TGTATTTACA GTTGAGGGCT TAATCGGCAA AGTTCAACAA 
CAAAAACAGC TCATTCATCA ACTCATTCAA GAAAATGAAC ATTTGCGTCA CGAAAACAAA
CAGCTACGCA AAGAAAATGA ACAACTGAAG TACCGTGTTC AAGAGCTGGA AGCACGCACG
AAAAAAAACA GCTCCAATAG CCATTTGCCC CCATCTTCTG ACCGTTTTGA GAAAAAGCGT
TCCTCCCGCG AGCCGTCTGG CAAAAAGCCT GGTGGGCAAG AGGGACATGA GGGGAAGACG
CTCCGTCAAG TGGAACATCC ACATCATCGT GTCGTCCACC GTGTGCATAC GTGTCAAGGA
TGTGGAGCTT CTTTGCGTGA AGTCAAACCG TTCAAAGTAG ATATCCGTCA AGTGTTTGAT
GTCCCTCCTG TGGCGATCGA GGTGACACAA CATGAACGTG AAGTGAAATC GTGTCCACAT
TGTCGATGCG TGCAACAAGC CGAATTCCCA TCCCATGTCA CGAATCATGT GCAATACGGT
CCACGGCTCA CGGCGCTCGT TGTTTATTTA CATCATATCC AATTGATCCC GTACAAGCGT
TTAAGTGATA CAATCGAAGC GTTATATCAA CACTCGATTA GTACGGGAAC CCTTGCCAAT
ATGGTGAAAC GAGGACGCGA ATTGTTGGAA TCAAATATGG ACATCATCGA AGACGCCTTA
CTTGAATCCA ACATCCTGCA TGTCGATGAA ACGAGTTTGC GCATCAATGG GAAACTCGCA
TGGGTGCATG TCGCGTGTAC ATCGAGATAT ACATACTTGG CTCCTCACGC TTCTCGTGGA
AAAAAAGCGA CCGATGATAT CGGGATTCTT CCCCGATATG AAGGGACGAT GATGCACGAT
GCGTTCGGTA CATATCCGAA ATACACCCAT GCCACCCATG CCCTTTGTCA TGCCCACCAT
TTGCGTGAGT TAAAAGGATT CATCGAACAA GGGCATACGT GGGCGATGCG CATGACCACG
TTTCTGTTAG CCGCCAAGCA AGCCGTCGAA GCCCATCACG GTGCACTTTC CGAAGAAGAA
GCGAGACGGT GGGAACGAGT GTATGATCGC ATCCTAGAAA GAGCACAACA CCGATTAGAA
ACGATGACGC CTCTTCCGAA AAAAGCACTC GCTTTTGTTC GACGCCTTCA AAAACGAAAG
GAAGAAGCGC TGCGTTTCTT ACGTGAAGTA CATGTTCCCT TTGATAACAA CCAAGCCGAA
CGCGATCTTC GCATGGTCAA AGTCAAAGAG AACATTTCGG GTACGTTTCG CGAAGAAACA
TTCGCGCAGT CGTTTTGCAT CGCAAGAAGC ATCGTTTCCA CACTGACGAA ACACGAAAAA
AACGTGTGGG ATTCGTTATG TCTTCTGTTG GCAGGCGAAA CGATCGATCG AGTTCTTTCC
GCTACCTAG
 
Protein sequence
MLTVQQAVFT VEGLIGKVQQ QKQLIHQLIQ ENEHLRHENK QLRKENEQLK YRVQELEART 
KKNSSNSHLP PSSDRFEKKR SSREPSGKKP GGQEGHEGKT LRQVEHPHHR VVHRVHTCQG
CGASLREVKP FKVDIRQVFD VPPVAIEVTQ HEREVKSCPH CRCVQQAEFP SHVTNHVQYG
PRLTALVVYL HHIQLIPYKR LSDTIEALYQ HSISTGTLAN MVKRGRELLE SNMDIIEDAL
LESNILHVDE TSLRINGKLA WVHVACTSRY TYLAPHASRG KKATDDIGIL PRYEGTMMHD
AFGTYPKYTH ATHALCHAHH LRELKGFIEQ GHTWAMRMTT FLLAAKQAVE AHHGALSEEE
ARRWERVYDR ILERAQHRLE TMTPLPKKAL AFVRRLQKRK EEALRFLREV HVPFDNNQAE
RDLRMVKVKE NISGTFREET FAQSFCIARS IVSTLTKHEK NVWDSLCLLL AGETIDRVLS
AT