Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0859 |
Symbol | |
ID | 7977865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 923005 |
End bp | 924252 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644797831 |
Product | transposase, IS605 OrfB family |
Protein accession | YP_002949004 |
Protein GI | 239826380 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.613067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTT GTATCAAACA ACAGCTAAAT GGTTTGACCA AAGAAGAATA CTTGACTCTT CGAGAACTGT GCCATATTGC CAAGAACATG TACAACGTCG GATTGTACAA TGTCAGACAA TACTATTTTG AACACAAGGA ATTTCTTAAT TATGAGAAAA ACTATCATCT TGCAAAAACG AACGAAAACT ATAAGCTGTT GAACAGCAAC ATGGCACAGC AAATTTTAAA AAAGGTTAAT GAGGCTTTTA AATCTTTCTT TGGCTTAGTA AAACTAGCCA AACAAGGCAA ATATGACTAC AAGGCTATCA GTATCCCAAA ATATCTTAAA AAAGATGGCT TTCATTCACT AATCATTGGT CAAATTCGTA TAGACGGCAA CAAATTCACG ATACCGTATT CGAATCTGTA TAAAAAGACT CATAAGCCTA TCACGATAAC GATTCCGCCT GTGTTACTGG ACAAAAAGAT TAAGCAGATT GAAATCATTC CTAAACATCA TGCCAGGTTC TTTGAGATTC AGTACAAATA TGAAATGCCT GAAGATCAAA GAGAATTAAA TGACCAAAAA GCACTGGCGA TTGATTTAGG AGTGAATAAT CTTGCCACTT GTGTCACATC AGACGGCAGA TCATTCATCA TTGATGGGCG GAGATTAAAA AGTATAAATC AATGGTTTAA CAAAGAAAAT GCCAGACTTC AAAGCATAAA AGATAAGCAA AAAATCAAAG GCACCACTCG TAAACAGGCT TTGCTTGCTA TGAATCGCAA TAATAAAGTG AATGATTATA TCAACAAGAC TTGTCGTTAC ATCATAAACT ACTGTATTGA AAATCAAATT GGCAAACTTG TCATTGGCTA TGCGGAAACA TTGCAGCGCA ATATGAATCT AGGAAAAAAG ACAAATCAAA ACTTTGCCAA TATTCCTCTC GGTAATATAA AAGAAAAACT AGAGTATCTT TGTAAATTTT ACGGCATTAA ATTCTTCAAA CAGGAAGAGT CATATACGTC TAAAGCCAGC TTTTTTGACG GGGATGAGAT TCCTGAATAT AATGCCGACA ATCCAAAAGA ATATAAGTTC AGTGGCAAAC GTATTAAGCG AGGTTTGTAT CGAACAAAGT CCGGCAAACT AATCAATGCT GATGTAAATG GTGCATTAAA CATCTTAAAG AAAAGTAAAG CTGTAGACCT GAGTGTCTTA TGCTCTAGCG GCGAAGTGGA CACGCCTCAA AGAATAAGGA TTGCTTAA
|
Protein sequence | MYFCIKQQLN GLTKEEYLTL RELCHIAKNM YNVGLYNVRQ YYFEHKEFLN YEKNYHLAKT NENYKLLNSN MAQQILKKVN EAFKSFFGLV KLAKQGKYDY KAISIPKYLK KDGFHSLIIG QIRIDGNKFT IPYSNLYKKT HKPITITIPP VLLDKKIKQI EIIPKHHARF FEIQYKYEMP EDQRELNDQK ALAIDLGVNN LATCVTSDGR SFIIDGRRLK SINQWFNKEN ARLQSIKDKQ KIKGTTRKQA LLAMNRNNKV NDYINKTCRY IINYCIENQI GKLVIGYAET LQRNMNLGKK TNQNFANIPL GNIKEKLEYL CKFYGIKFFK QEESYTSKAS FFDGDEIPEY NADNPKEYKF SGKRIKRGLY RTKSGKLINA DVNGALNILK KSKAVDLSVL CSSGEVDTPQ RIRIA
|
| |