Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0816 |
Symbol | |
ID | 8806571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 872055 |
End bp | 875456 |
Gene Length | 3402 bp |
Protein Length | 1133 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transcriptional activator domain protein |
Protein accession | YP_003460067 |
Protein GI | 289208001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.437222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGCAC TCCAGGATCT CCCCGAGCTC GCCGCCTGGG CACAGACACA GCGCGACCAC TACTGGTCCC AGCTGCTCCT TCTGATCGAG GAAAATCTCG ACGTTCTCTG TCATTCCTCA GACGCCGAGG CCCTGCTCAC CGAGCTCCAG CGCCTGGTGG CGATGGAGCC CCACGAAGAG GCTCTGCACC GTCTGGTCAT GCTGACCCTG GCCGGCATGG GTAACCGCGA GCGGGCGCTG GAACACTTTG GCCGCCTGGA GCGCCAGGCG GTCGATGGCC CTGGTCAGTT ACTGGTGCGA ACCCGCGACG AAATCCTGGT CCCAACACCA AAGGCACCGG CCACGGTGTC TCCGACTGCC ACACCCGAGA CAGCCTCCCT GCGTGCCGAG CGCCGCCGGG TCACCGTCGT GGCCTGCCTT GTCCAGGCGG ACTCCGACCT TGACGTAGAG CTGCAGCATG AACGCATGCA ACACACGCTG GAGCAACTGG ACCTGGTCTT GCTGAACCAT CGGGGCCATG TCGTTCGGGC CCCTGGCTGC GGCCTCCTCG CCTATTTTGG CTATCCTGCA GGACGCGAGC AGGCCACTGT CGATGCCGTC CACGCCGCCT GGCAAGTCCT GCAGACAGCA CCAAGCGACT GTCAGCTGCG CATAAGCGTC GATACCGACC TCATCGTAAC AGGGGTCGAC CCGGACACCC CCGACCCCGC TGGCCAGCTC AGCCAGCGCG CGCAGGCTCT GGCGCGTTCG ACCGACACGG GAACGCTCGT GGTCTCTCCA TCCGTGCACC AGCGCGTCTC GGGATATTTT CTCTTCAGCC CTCACGCGGG CACGGGAGCA CCCGAAGGAA GCCAGCGCGT CACGGGCGAC CCCGGCCCCC TGGATCGTGT CGATGCCCAG CACGCCCGAG GTGGCCTCAC CCCCCTGGCG GGACGCACAG AGGAGCTTGA GCGCCTGCAC CAAATCTGGC GCCAGACACT CAAACATCAC AAGCCGCACT TTGTACTGAT CCAGGGTGAA GCCGGACTGG GCAAGAGCCG TCTCGCACGA CACCTCGACG AGCGCATCCG CGGCCAGGCG CGCTCGCGCT TCCTGCTGAA GTGCCGCGAG GACCGCTCGC GGCACTTGTT GGCGCCCGTG CGTCAGGCGT TCGCCACCTG GCTTGGACTC GGCAGCATGA ATGCCAGCCG CCAGCGGCGA CTGGCACGGG TCCTCGGCAC CGCTCGCTCA CTCGATGCGG ACATCGCCGA CGCCCTGGCG CACTGGCTCA CACACCCGGA CAGCAACGGG CTCGAAGCGC TGGCCGCGCA GGGACTCGAC CGGGATCACA TCATTGATGT CCTCGTCGAG CTCGCCCGGC GCCGAGCCCG CCGAGGCCCC CTCCTGCTGA TCTGTGACGA CCTGCACTGG ATGGATTCCG GCACGGCCGA ATTCCTTCGA CGCCTGTATC AGCGCCTGCA TGACCGCCCG TTGCTGACCC TGCTCACGGC CCGCATGAGC TTTCGCTCCA ACTGGCGCGA TGTCCCGCTT GAGCATCTGC GACTGCACGC CCTGCCCGAC CAGGCTGCAC ATGAACTCAT CGAGCGTTGC GACACCCGGC ATCGCCTGAC GAAACACCTG CGCGCGCAAC TCATCGAACG CGGCGAGGGC GTCCCGCTTT TTCTCGAGGA GCTGACGCGC TACACCCTCG AAGCCGCGCG CACGGGGACC CTCCCCGACA CCTTGCCGCC CGGGCTGGCC AATCTGCTGG TGGCACGCCT GGAACAAACC GGGACCGCGC GGGATTTGGC CCACGCCGCC GCCGTAATCG GACGCGATTT TGATACACGC ATCCTGAGTC ACCTCCTGGA GCAGCCAGCC GACACGATCC GCCGCCAGCT CAGCCGCCTC ACGGGCCGCG GGTTCGTCGA GCCGGTCGGC GTCCTGGACG GTGTGCGCTA CCGTTTCCGT CACGCCCTAT TCCAGCAGGC TGCCTACGAA TCCATGCTGG TGGCCGAACG AACGCGTCTT CACGGCCGCC TGGCGGATCT CCTGCGTGAC GATCAAGGCA CGAGCAATAT CGACTGGGGC CAGGTGGCCA GCCATCTTTA TCACGCCGGA CGTGTGGACG AGGCCGTGCC CGCCTGGCTC GAGGCGGGAG AGGCGGCGTT CGCCCGGGGC ATGCTGCTGG AGGCATCACA CTACTTCGAA GATGCCCTCA GCTGTCTCGA TCGCGAGCTA GGCGACCGGG AACCCTCGGA AGAACGCGAC GCGCAGACGC AGCGAGCTCT GGGCGGACTG GGTGCCTCAA ACCTCGCCCT GTTGGGGTAC GGCTCCCGCA CGGTCCACGC GATCTTCGAA CGCACCTTGG GCATCACCGC GCCTCAGCGA GATGCGGTGC AGTACTTCCG TGTGCTCTGG GGCCTTTGGC ACGGCGCCGG ATCCTGGCAC GGCTTTGATG AAGCCCACCG ACTGGCCAAC GACATGGAAC AGGTCGCAGA GCAGTCGAAC GACCGCCTGC TACGAATCGC CGCACACTAC GTACAAGGCA ACACGCACTT CTGGAGTGGC CAGCTACCGC TGGCGCTGGA ACACCAGACC CAGGCGCTCG CCCTGTACCG CGAAGACGAC CACCCGCACC TGGTCGCGCG CCACACCGAG AACCCCGCGA TCAGCTCACG CGGCTTCATG GCCTGGACGC TCCTCTTCTT CGGCAACCAC GCTCGCGCGT GGGAGGAAAT GGATGCGGCC TGCGCCGAGG CCGCAGCGCT GGAGCACCGC CCCACAGAGG CCTTTGTATA CGCCTTCCGG GCAGCGCTTG GCTTCTTCGC CGACGACCCT GACGAAGCCC TGACCTCCGC CCGCCGCGCG CTCGCGATTG CCGAGGACTA TGACTATCCC CTCTGGCGGG CCTCAGGCCT GGCACTACAG CACTGGGCCC TGGCGCGCCA GGGCGACACC TCGGCCGTGG CACCACTCCG GGAACAGGCA GACGCATTGC GCCACATCAT GGATGGCGTC AGCACCATCT TTCAGCTCTT CCTGCTCGAC GCACTGCATG CAACCGGAGC CAAGCCTTCT GAGCGTCTGC AGATCGCCAC CGCTACACTG ACCAACTGCC TGAAACGAGG CGACCACGGC TTCGAGCCCG CAATCCGGCG ACTGCGTGCC CATGCGCTGC TGGAACACAG CGACGGCACC GACAACGAGG GTTGGCAGGA GCTGGAGCGG GCACGCCGTC TGGCCCACGA CCACGGCAAC CCGAATATCG AACGATTAAC CCTGCTCGAC TATGCCCGCT TTGCTCGCGA TGACGTCTGG CGCCAGTGGA GCGAACGCGA ACTCCTGCGC ATCAACCACT GCATCATCCC CCGACGACAC CCATCCGAGC AGGCGCTGCA GCAGAACCCG CGGCTCGCCT GA
|
Protein sequence | MRALQDLPEL AAWAQTQRDH YWSQLLLLIE ENLDVLCHSS DAEALLTELQ RLVAMEPHEE ALHRLVMLTL AGMGNRERAL EHFGRLERQA VDGPGQLLVR TRDEILVPTP KAPATVSPTA TPETASLRAE RRRVTVVACL VQADSDLDVE LQHERMQHTL EQLDLVLLNH RGHVVRAPGC GLLAYFGYPA GREQATVDAV HAAWQVLQTA PSDCQLRISV DTDLIVTGVD PDTPDPAGQL SQRAQALARS TDTGTLVVSP SVHQRVSGYF LFSPHAGTGA PEGSQRVTGD PGPLDRVDAQ HARGGLTPLA GRTEELERLH QIWRQTLKHH KPHFVLIQGE AGLGKSRLAR HLDERIRGQA RSRFLLKCRE DRSRHLLAPV RQAFATWLGL GSMNASRQRR LARVLGTARS LDADIADALA HWLTHPDSNG LEALAAQGLD RDHIIDVLVE LARRRARRGP LLLICDDLHW MDSGTAEFLR RLYQRLHDRP LLTLLTARMS FRSNWRDVPL EHLRLHALPD QAAHELIERC DTRHRLTKHL RAQLIERGEG VPLFLEELTR YTLEAARTGT LPDTLPPGLA NLLVARLEQT GTARDLAHAA AVIGRDFDTR ILSHLLEQPA DTIRRQLSRL TGRGFVEPVG VLDGVRYRFR HALFQQAAYE SMLVAERTRL HGRLADLLRD DQGTSNIDWG QVASHLYHAG RVDEAVPAWL EAGEAAFARG MLLEASHYFE DALSCLDREL GDREPSEERD AQTQRALGGL GASNLALLGY GSRTVHAIFE RTLGITAPQR DAVQYFRVLW GLWHGAGSWH GFDEAHRLAN DMEQVAEQSN DRLLRIAAHY VQGNTHFWSG QLPLALEHQT QALALYREDD HPHLVARHTE NPAISSRGFM AWTLLFFGNH ARAWEEMDAA CAEAAALEHR PTEAFVYAFR AALGFFADDP DEALTSARRA LAIAEDYDYP LWRASGLALQ HWALARQGDT SAVAPLREQA DALRHIMDGV STIFQLFLLD ALHATGAKPS ERLQIATATL TNCLKRGDHG FEPAIRRLRA HALLEHSDGT DNEGWQELER ARRLAHDHGN PNIERLTLLD YARFARDDVW RQWSERELLR INHCIIPRRH PSEQALQQNP RLA
|
| |