Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0472 |
Symbol | |
ID | 7978622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 523758 |
End bp | 525005 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644797449 |
Product | transposase, IS605 OrfB family |
Protein accession | YP_002948649 |
Protein GI | 239826025 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00293819 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTT GTATCAAACA ACAGCTAAAT GGTTTGACCA AAGAAGAATA CTTGACTCTT CGAGAACTGT GCCATATTGC CAAGAACATA TACAACGTTG GATTGTATAA TGTCAGACAA TACTATTTTG AACACAAGGA ATTTCTTAAT TATGAGAAAA ACTATCATCT TGCAAAAACG AACGAAAACT ATAAGCTGTT AAACAGCAAC ATGGCACAGC AAATTTTAAA AAAGGTTAAT GAGGCTTTTA AATCTTTCTT TGGCTTAGTA AAACTAGCCA AACAAGGCAA ATATGACTAC AAGGCTATCA GTATCCCAAA ATATCTTAAA AAAGATGGCT TTCATTCACT GATCATTGGC CAGATTCGTA TAGACGGCAA CAAATTTACG ATACCGTATT CTCGCCTATT TAAAAAGACT CACAAGCCTA TCACGATAAC GATTCCGCCT GTGTTACTGG ACCAAAAGAT TAAGCAGATT GAAATCATTC CTAAGCATCA TGCCAGGTTC TTTGAGATTC AGTACAAATA TGAAATGCCT GAAGATCAAA GAGAATTAAA TGACCAAAAA GCACTGGCAA TTGATTTAGG ATTAAACAAT TTTGCCACTT GTGTCACATC AGACGGCAGA TCATTCATCA TTGATGGGCG GAGATTAAAA AGTATAAATC AATGGTTTAA CAAAGAAAAT GCCAGACTTC AAAGCATAAA AGATAAGCAA AAAATCAAAG GCACCACTCG TAAACAAGCT TTGCTTGCTA TGAATCGCAA TAATAAAGTG AATGATTATA TCAACAAGAC TTGCCGTTAC ATCATTAACT ACTGTATTGA AAATCAAATT GGCAAACTTG TCATTGGCTA TGCGGAAACA TGGCAACGCA ATATTAATCT AGGAAAAAAG ACAAATCAAA ACTTTGTCAA TATTCCTCTC GGTAACATAA AAGAAAAACT AGAATATCTT TGTAAATTTT ACGGCATTGA ATTCTTGAAA CAGGAAGAAT CATATACGTC TCAAGCCAGC TTTTTTGACG GCGATGAGAT TCCTGAATAT AATGCCGACA ATCCAAAAGA ATATAAGTTC AGCGGCAAAC GTATTAAGCG CGGCTTGTAT CGAACAAAGT CTGGCAAACT AATTAATGCT GATGTCAATG GCGCATTAAA CATCTTAAAG AAAAGTAAAG CTGTAGACCT GAGTGTCTTA TGCTCTAGCG GCGAAGTGGA CACGCCTCAA AGAATAAGGA TTGCTTGA
|
Protein sequence | MYFCIKQQLN GLTKEEYLTL RELCHIAKNI YNVGLYNVRQ YYFEHKEFLN YEKNYHLAKT NENYKLLNSN MAQQILKKVN EAFKSFFGLV KLAKQGKYDY KAISIPKYLK KDGFHSLIIG QIRIDGNKFT IPYSRLFKKT HKPITITIPP VLLDQKIKQI EIIPKHHARF FEIQYKYEMP EDQRELNDQK ALAIDLGLNN FATCVTSDGR SFIIDGRRLK SINQWFNKEN ARLQSIKDKQ KIKGTTRKQA LLAMNRNNKV NDYINKTCRY IINYCIENQI GKLVIGYAET WQRNINLGKK TNQNFVNIPL GNIKEKLEYL CKFYGIEFLK QEESYTSQAS FFDGDEIPEY NADNPKEYKF SGKRIKRGLY RTKSGKLINA DVNGALNILK KSKAVDLSVL CSSGEVDTPQ RIRIA
|
| |