Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0944 |
Symbol | |
ID | 7314975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1017344 |
End bp | 1018360 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643615829 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002513019 |
Protein GI | 220934120 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAGA TTACTACTGT AGGTCTGGAT CTGGCAAAGA ACGTGTTCCA TGCGGTATGT CTGGATCGCC ATGGGGGTGA GGTACGCAAG AAGGTGCTGC GCCGCGCCCG GGTTCTTGAA TGGTTTGCCA ATCTGACGCC CTGCCTGGTG GGCATGGAGG GCTGTGCGGG GTCGCATTAC TGGGCCCGGG AGCTGCAGGC GCTGGGTCAT GAGGTGAGGC TGATCCCGGC CCAGCACGTC AAGGCCTATG TGCGGGGTCA GAAGAACGAC TACAACGATG CCCGGGCGAT AGCCGAGGCG GTGGTGCGCC CGGGGATGCG TTTCGTGGCG ATCAAGGAGC AGGTGCAGCA GGATGTGCAG GCGCTGCACC GGATGCGCGC GGGTGTGGTG AGCGAGCGCA CGGCGCTGTG CAATCGGTTG CGCGGGCTGT TGACCGAGAA CGGGATCGTG TTGAGCCAGG GGATAGGGCG GTTGCGCCGA CACATCCCGG CGTTGCTGGA GGATGGCGAG AACGGGCTCA GTGGTCTGTT TCGGGAGTTG CTGGCGCGAG GTTACCGCCA GCTGTGTGAA CTCGACGATC ATATCGAGTA CTACAACGGG CTGATCGAGC GGGGTGCGCG GGAGCGGGAG GCCGAGCGGC GGCTGCGCAC GATCCCGGGG TTTGGGCCGG TGTTGGCCAG TGCGTTTCAT GGCGCGGTGG GCGATGGGCA GGGCTATCGC CGGGGGCGGG ATGTCTCGGC CTCGCTGGGC GTGGTGCCGC GTCAGCACTC CAGCGGTGGC AAGGCGGTGC TGCTGGGGAT CAGCAAGCGT GGAGATGGGT ATCTGCGCAG CCTGCTGGTG CACGGGGCGC GTTCGGTGGT GCTCAGAGCA CCGGGCAAGG AGGACCGCCT GAGCCGCTGG GTGCAGCGCC TGGTGGCCGA GCGGGGGGTG AACAAGGCCA CGGTGGCGCT GGCGAACAAG CTGGCACGCA TCGGCTGGGC GGTGTTGCGC CATAACACGG TCTATCAACC GGCCTGA
|
Protein sequence | MSEITTVGLD LAKNVFHAVC LDRHGGEVRK KVLRRARVLE WFANLTPCLV GMEGCAGSHY WARELQALGH EVRLIPAQHV KAYVRGQKND YNDARAIAEA VVRPGMRFVA IKEQVQQDVQ ALHRMRAGVV SERTALCNRL RGLLTENGIV LSQGIGRLRR HIPALLEDGE NGLSGLFREL LARGYRQLCE LDDHIEYYNG LIERGARERE AERRLRTIPG FGPVLASAFH GAVGDGQGYR RGRDVSASLG VVPRQHSSGG KAVLLGISKR GDGYLRSLLV HGARSVVLRA PGKEDRLSRW VQRLVAERGV NKATVALANK LARIGWAVLR HNTVYQPA
|
| |