Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3025 |
Symbol | |
ID | 7315953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3166321 |
End bp | 3167301 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643617923 |
Product | transposase IS4 family protein |
Protein accession | YP_002515082 |
Protein GI | 220936183 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.965699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGA TGACCTTCGC CGACGCCGAG TACGCTGGCA AGCGCAAGCA AACCCGCAAG GAGTTGTTCC TGATCGAGAT GGATCGGGTG GTGCCGTGGA AGGGCTTGAT TGCTTTGATC GAGCCACATT ATCCGAAGGG TGAAGGTGGC CGTCCGGCCT ACCCGTTGAT GGCGATGCTG CGTGTGCATC TGCTGCAGAA CTGGTTCGGC TACAGCGATC CAGCGATGGA GGAAGCGCTG TACGAAACCA CGATCCTGCG CCAGTTTGCC GGGCTGAACC TGGAGCGCAT CCCCGACGAA ACCACCATTC TCAACTTCCG CCGCTTGCTG GAGAAACACG AGCTGGCGGC CGGCATCCTC GCTGTCATCA ATGGCTATCT GGGCGACCGC GGCCTGTCGC TGCGCCAGGG CACCATCGTC GATGCAACGC TGATCAATGC GCCCAGTTCG ACCAAGAACA AGGACGGCAA GCGCGACCCG GAAATGCACC AGACCAAGAA GGGAAACCAG TATTATTTTG GCATGAAGGC CCACATCGGC GCCGATGACG AATCGGGTCT GGTGCACAGC GTAGTGGGCA CGGCGGCCAA TGTGGCGGAT GTCACCCAGG TGGACAAATT GCTGCATGGC GACGAAAACG TGGTCTGCGC CGATGCAGGC TACACCGGTG TCGAAAAGCG GCCCGAGCAT GAAGGACGTG AAGTTATCTG GCAGGTGGCG GCACGCCGCA GCACCTACAA AAAACTCGAT AAGCGCAGCG TGCTGTACAA AGCCAAGCGC AAGATTGAAA AGGCCAAGGC TCAGGTGCGC GCCAAGGTCG AGCATCCGTT CCGGGTAATC AAGCGCCAGT TCGGTTACAC CAAGGTGCGC TTTCGCGGCT TGGCCAAAAA CACGGCGCAA CTGGTGACAC TGTTCGCTCT GTCGAACCTG TGGATGGCGC GCCGACATTT ACTGGCGAAT GCAGGAGAGG TGCGCCTGTA A
|
Protein sequence | MKQMTFADAE YAGKRKQTRK ELFLIEMDRV VPWKGLIALI EPHYPKGEGG RPAYPLMAML RVHLLQNWFG YSDPAMEEAL YETTILRQFA GLNLERIPDE TTILNFRRLL EKHELAAGIL AVINGYLGDR GLSLRQGTIV DATLINAPSS TKNKDGKRDP EMHQTKKGNQ YYFGMKAHIG ADDESGLVHS VVGTAANVAD VTQVDKLLHG DENVVCADAG YTGVEKRPEH EGREVIWQVA ARRSTYKKLD KRSVLYKAKR KIEKAKAQVR AKVEHPFRVI KRQFGYTKVR FRGLAKNTAQ LVTLFALSNL WMARRHLLAN AGEVRL
|
| |