Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3098 |
Symbol | |
ID | 7976744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3118690 |
End bp | 3119880 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799885 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_002951024 |
Protein GI | 239828400 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0316374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTTCTA TATCACTAGG ATTGCCAGAA TTTAAAGTGA TTAAACAAGA ACTTCTTTCC TATGGTTATG CGATTCATGT AGAGAAAACA GAGACACAGG AACGTTGCCC TCATTGTGGG TTTGCCACTT CCTCTGTCCA CGACAGACGG ACAAGAAAAG TACGGGATTT GGCGATTTTC CATCAGCCGG TGTACTTGTT CGTCAAGGTA AAGCGCTATC GGTGCCGGAA TTGTTCCCAA GTGTTTTCTG CCTCTTTGGA ATCGATTGAA CCCAATCAAC ATTACACTAG TCGATTTTGT GAGCACTTGT ATGAACTTTG TGAAGGCTCC ACCATTCAAG AGGTTAGTCG AAAGCAGCGC ATCCCCTATA CGACATTGGA ACGCATCTAC TACTCCATCG CATCAAAAAA AGCAAAAGAG CGTCAAAACA CAATGGAGGC ATCTTGTCAG GAAGGGATGG TGCTTAGCTT AGATGAAATC GCTGTAAAAA AGGGACATCA GTATGAAACC GTATTGATGG ATGCCAAAGC TGGATCGGTC ATGGGAATGC ATGCCGATCG CCAATGTGAC TCCGCCATCA ACTTGTTGAG CCAAAATATC CTGTCGAAAG AAATGGTCCA AACGGTGATT CTTGACATGT GGGAACCTTA TCATAAGGCG GTTCGCGCCC TGTTTCCATC TGCTTCGATT GTCATCGATA AGTACCATGT GGTTCAAAAA GTGACACAAG CCTTGGATCA AGCAAGAAAG GAATTTTCTC CATTGAAAAA GGCTCGATAT CTTCTCTTAA AAGGCTGTGA AAAGCTTCGT AAGGACCAAC GGCTTCGATT AGACGATATC TTGGAGGAGT ATCCGACACT TTCCATTGCT TATTATCTGA AAGAGTTGTT TCGGGATTTT TACCGAACCG ATGGATATAA CGAAGCAAAG GAACGCTTGG AAGAATGGAT TAAGTTAGCC AAACAGAGCC CTTTTGCTTC TTTTCAGGAA GCAGCCAACA CGCTTGAAAG GTGGAAGGAG CCTATTCTTT CCTACTTTTT GTGCCCATAT ACGAATGCCC GAATCGAGGG GACGAATCAC AAGATCAAAA ACATCAAACG CCGGGCATAT GGCTATCGAA ATCTAGAACG GTTTCGTTTA CGTGTATTTC TGGAGTGTAC AGGGAACACT ACAGGTAGTC AGGCTGCCTA A
|
Protein sequence | MLSISLGLPE FKVIKQELLS YGYAIHVEKT ETQERCPHCG FATSSVHDRR TRKVRDLAIF HQPVYLFVKV KRYRCRNCSQ VFSASLESIE PNQHYTSRFC EHLYELCEGS TIQEVSRKQR IPYTTLERIY YSIASKKAKE RQNTMEASCQ EGMVLSLDEI AVKKGHQYET VLMDAKAGSV MGMHADRQCD SAINLLSQNI LSKEMVQTVI LDMWEPYHKA VRALFPSASI VIDKYHVVQK VTQALDQARK EFSPLKKARY LLLKGCEKLR KDQRLRLDDI LEEYPTLSIA YYLKELFRDF YRTDGYNEAK ERLEEWIKLA KQSPFASFQE AANTLERWKE PILSYFLCPY TNARIEGTNH KIKNIKRRAY GYRNLERFRL RVFLECTGNT TGSQAA
|
| |