Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0216 |
Symbol | |
ID | 8805946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 223723 |
End bp | 225378 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003459467 |
Protein GI | 289207401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.590946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.120433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAACC ACACCAGATT CCGAGTCTGT GGCATATCGC TGATCGAGGC CCTGATCGCT ATAGCGGTGG TCTCTGTCGG CCTTTTGGCA ATAGCACGAC TCCACGGCGA CCTGCTTTCG AGTGCAGGGG ACTCAAAGGC TCGCGCGGAA GCGATGCAGA TTGCGGAAAC GGAAATCGAA AAGCTGCGCA ACGCCTCGAC GCTAACGGAT CTCAATCGAC GGCTTGAGAA TGTCCGAACA GAGGCGCCAG GGCGCAACGC CGACTACACG CTTGAATGGA GCAACCTACC AGACACCTCG GATTTCGGGT TTTTCGAGCC CGAGGTGCGC GTCACTTGGA TTGACCGGCG GGGTGACGCG CAGAGCGTCC TCATCAGCAC GGCGGTGGCC TGGAGCAACC CCCGATTCAG TGTCGCAGCC GCGCGCGCGG ATTCCGAAGG TGGAAACTTC CTTAATGCGC CAACGGGCCG CGCGAGAATG GGAGGCGGAG AGCTGCTTGA GGGAACCGAC GTCACCACGG TCACCAACGT CATCGAAGGC ACGTCCATCG AAGACGGCAC CGAGACCCGC CAGCGCGACG GAGAAGTTCA GCTCGTGCGT GACGGTGAAG TGGTTATGAC GATGGACACC ACCAACCCCG GGGAGCTATT CAGCACTATT AGCGGCCGGG TGTATTACGG AGGAACGATC AGTCTGAATC CCGTCGTCAC GCGCGTTCTT TCGTCCGACG CAGGCTTCTG CACCCGCCGT GGCCAGGAAC CCTTGGGTAG TGCTCTGAAG CCCTTCGTGG ATAGCGCCTC GGGGTACTCG TACTTTTACT ACCAGTGCTA CGTTGGCGAG GGATGGTATG GGAACGTGGG GATCGTCCGC CAGGAGAATG CGAACCCGAA TGACCGCTCC TGCGTCGGCG ACCACTTAGC GGAGGAAACG AGCGCCTGGG ACAGTCGCGA GCCACGCACA AGCTCGAACC GCTTCTACCG CTCGTTCATC GAGACAACAA GTGGTATTCG AGCCATCGGC ATCGGGATCA CTCCAGCCGA AGGGGACAGC CAGAACCCAA CCTACGTTGC CCAGAACCTG ACAAGCCACG ATTTCATGCT TACGCGCTTC CGGAGCGGAA ATAATCGCCC GAGGGACTGC TTCGAGGCCC TTGAAGATGG CGAAGGTGCA TTCGAGGACA ACGCGGGCCG GCATGTCTGT CTTTACGAGG GCGGCATGGG CTGCGGCACA GACGACGGAG ACTCTTTGCC ACAACCCGGT GACCAGGAAA CCACCCTAGT CGCCGGCAAC GTCGAGCAGA TTAATTCGCC CACCGGGAAC AGCCGCCCAG TATTGGAAGA CCTGACCTTC GACAGCGTCT ACGCCACCTG CGGACCGAAC CGAGAGTCTC TGGGACAAGG AAGGAATGAG GTCGAGTCGA TCACTTCTTA TGAGTGCGTC ATCGACTGGG AAGGCTGGAC AGGCGAGACA TGGTCCAGCG ACATGATCCT GACGCTTGGA CCAAACACCA AACTCTGCGA CGTAGGCACG GCTGAACTTG GCGGCGGAAC CGTAACGAAA TCGGGCGACA ATGTGCTCAT CTTTGACCGC ATCAGCCAAA CCGAAGCGCT ACAGTTTGAC TTTGCAGTCG CGCACGAAGA TGCAGACTGC CCATAG
|
Protein sequence | MHNHTRFRVC GISLIEALIA IAVVSVGLLA IARLHGDLLS SAGDSKARAE AMQIAETEIE KLRNASTLTD LNRRLENVRT EAPGRNADYT LEWSNLPDTS DFGFFEPEVR VTWIDRRGDA QSVLISTAVA WSNPRFSVAA ARADSEGGNF LNAPTGRARM GGGELLEGTD VTTVTNVIEG TSIEDGTETR QRDGEVQLVR DGEVVMTMDT TNPGELFSTI SGRVYYGGTI SLNPVVTRVL SSDAGFCTRR GQEPLGSALK PFVDSASGYS YFYYQCYVGE GWYGNVGIVR QENANPNDRS CVGDHLAEET SAWDSREPRT SSNRFYRSFI ETTSGIRAIG IGITPAEGDS QNPTYVAQNL TSHDFMLTRF RSGNNRPRDC FEALEDGEGA FEDNAGRHVC LYEGGMGCGT DDGDSLPQPG DQETTLVAGN VEQINSPTGN SRPVLEDLTF DSVYATCGPN RESLGQGRNE VESITSYECV IDWEGWTGET WSSDMILTLG PNTKLCDVGT AELGGGTVTK SGDNVLIFDR ISQTEALQFD FAVAHEDADC P
|
| |