Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1542 |
Symbol | |
ID | 8807311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1641346 |
End bp | 1642329 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_003460779 |
Protein GI | 289208713 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00714409 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.00689732 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCATA TTGGCATTGA TGTCAGCAAG AACAAGCTCG ACTGCATGTG GGTCCGGGAT CTGGAGGCGG GCAAGGTCAA GCCCAAGGTA TTCCCGAACC GCCAGGACCA GTACCCGGAG TTGCTGCGCT GGCTGCAGCG CAACACGGGT GAAGCGCCCG AGACGCTTCA GGTGTATCTG GAGGCCACCG GCATCTACCA CGAACCGCTG GCCTACTGGC TGCATGAGCA GGGGGTTCGG GTCCATGTGC TGAACCCGGC CCAGGTGCGC TTCCACGCCC AGGGCATGGG GGTGCGCAAC AAGACGGATC GCAAGGACAG CATGATGCTG GCGCGTTACG GCATCGAACG CGCGCCCGCA CCGTGGCAGC CGGAACCCCC GGAGGTGCGG GAACTCAAGC GTCTTCTGGG GCGGCTGGAG GCGCTGGAGC AGGACATCCG GCGCGAGGAG AACCGGCTGG AGAAGGCGCA GTTCAGTGAG GATGCGCTGG CCCAGGAGTC GATCGGCAAC GTGCTGGAGG CCCTGCGCGA GGAACACCGG CGTCTCCAGC AGCAGATCGA CGACCACTTC GACGCGCATG ACCACCTGAA GCGGGATCGG GCCCTGCTGG AGAGCATCCC CGGGATCGGC CGGGTGTTAT CGGCCTCGAT GACCGCGACG TTGCGCAGCC GTGCGTTCAC CAGTGCTCGG CAGGCGGCGG CGTTCCATGG ACTGGTGCCG GTCCAGCAGG AATCGGGCAT CTCGGTCCAG CGGCGACCGC AGCTGGCGAA GGCCGGCTCC AGCCGCCTGC GACGCAAGCT CTACATGGCG GCGGTCACGG CCGCGCACCA CAATGCCGAC GTTCGGGCGC AGTACCAGCG GCTGTTGCGG CGCGGCAAGG CGAAGATGGC GGCAATCGGC GCCGCGATGC GCAAGCTGCT CCATATCGCC TTCGGTGTGC TGAAGTCTCA GCAGCCGTAT CAGCCTCGCT GCAGCACCGG TTGA
|
Protein sequence | MAHIGIDVSK NKLDCMWVRD LEAGKVKPKV FPNRQDQYPE LLRWLQRNTG EAPETLQVYL EATGIYHEPL AYWLHEQGVR VHVLNPAQVR FHAQGMGVRN KTDRKDSMML ARYGIERAPA PWQPEPPEVR ELKRLLGRLE ALEQDIRREE NRLEKAQFSE DALAQESIGN VLEALREEHR RLQQQIDDHF DAHDHLKRDR ALLESIPGIG RVLSASMTAT LRSRAFTSAR QAAAFHGLVP VQQESGISVQ RRPQLAKAGS SRLRRKLYMA AVTAAHHNAD VRAQYQRLLR RGKAKMAAIG AAMRKLLHIA FGVLKSQQPY QPRCSTG
|
| |