Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3186 |
Symbol | |
ID | 7977037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3218127 |
End bp | 3219575 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644799970 |
Product | transposase IS66 |
Protein accession | YP_002951109 |
Protein GI | 239828485 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATGG TACAACAAGC TGTATTTACA GTTGAGAGCT TAATCGGCAA AGTTCAACAA CAAAAACAGC TCATTCATCA ACTCATTCAA GAAAATGAAC ATTTGCGTCA CGAAAACAAA CAACTACGCA AAGAAAATGA ACAACTGAAG TACCGTGTTC AAGAGCTGGA AGCACGCACG AAAAAAAACA GCTCCAATAG CCATTTGCCC CCATCTTCTG ACCGTTTTGA GAAAAAGCGT TCCTCCCGCG AGCCGTCTGG CAAAAAGCCT GGTGGGCAAG AGGGACATGA GGGGAAGACG CTCCGTCAAG TGGAACATCC ACATCATCGT GTCGTCCACC GTGTGCATAC GTGTCAAGGA TGTGGAGCTT CTTTGCGTGA AGTCAAACCG TTCAAAGTAG ATATCCGTCA AGTGTTTGAT GTCCCTCCTG TGGCGATCGA GGTGACACAA CATGAACGTG AAGTGAAATC GTGTCCACAT TGTCGATGCG TGCAACAAGC CGAATTCCCA TCCCATGTCA CGAATCATGT GCAATACGGT CCACGGCTCA CGGCGCTCGT TGTTTATTTA CATCATATCC AATTGATCCC GTACAAGCGT TTAAGTGATA CAATCGAAGC GTTATATCAA CACTCGATTA GTACGGGAAC TCTTGCCAAT ATGGTGAAAC GAGGACGCGA ATTGTTGGAA TCAAATATGG ACATCATCGA AGACGCCTTA CTTGAATCCA ACATCCTGCA TGTCGATGAA ACGAGTTTGC GCATCAATGG GAAACTCGCA TGGGTGCATG TCGCGTGTAC ATCGAGATAT ACATACTTGG CTCCTCACGC TTCTCGTGGA AAGAAAGCAA CGGATGAGAT CGGGGTTCTT CCACAATACA AAGGGACGAT GATGCATGAT GCATTCGGTA CGTATCCGAG ATACACGAAA GCCACACATG CTCTTTGTCA TGCCCATCAT TTACGTGAGC TAAAAGGCTT CACCGAACAA GGGCATACGT GGGCGATGCG CATGACCACG TTTCTGTTAG CCGCCAAACA GGCAGTCGAA GCCCATCACG GTGCACTTTC CGAAGAAGAA GCGAGACGGT GGGAACGAGT GTATGATCGC ATCCTAGAAA GAGCACAACA CCGATTAGAA ACGATGACGC CTCTTCCGAA AAAAGCACTC GCTTTTGTTC GACGCCTTCA AAAACGAAAG GAAGAAGCGC TGCGTTTCTT ACGTGAAGTA CATGTTCCCT TTGATAACAA CCAAGCCGAA CGCGATCTTC GCATGGTCAA AGTCAAAGAG AACATTTCGG GTACGTTTCG CGAAGAAACA TTCGCGCAGT CGTTTTGCAT CGCAAGAAGC ATCGTTTCCA CACTGACGAA ACACGAAAAA AACGTGTGGG ATTCGTTATG TCTTCTGTTG GCAGGCGAAA CGATCGATCG AGTTCTTTCC GCTACCTAG
|
Protein sequence | MLMVQQAVFT VESLIGKVQQ QKQLIHQLIQ ENEHLRHENK QLRKENEQLK YRVQELEART KKNSSNSHLP PSSDRFEKKR SSREPSGKKP GGQEGHEGKT LRQVEHPHHR VVHRVHTCQG CGASLREVKP FKVDIRQVFD VPPVAIEVTQ HEREVKSCPH CRCVQQAEFP SHVTNHVQYG PRLTALVVYL HHIQLIPYKR LSDTIEALYQ HSISTGTLAN MVKRGRELLE SNMDIIEDAL LESNILHVDE TSLRINGKLA WVHVACTSRY TYLAPHASRG KKATDEIGVL PQYKGTMMHD AFGTYPRYTK ATHALCHAHH LRELKGFTEQ GHTWAMRMTT FLLAAKQAVE AHHGALSEEE ARRWERVYDR ILERAQHRLE TMTPLPKKAL AFVRRLQKRK EEALRFLREV HVPFDNNQAE RDLRMVKVKE NISGTFREET FAQSFCIARS IVSTLTKHEK NVWDSLCLLL AGETIDRVLS AT
|
| |