Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1977 |
Symbol | |
ID | 8807751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2098822 |
End bp | 2101077 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Organic solvent tolerance protein |
Protein accession | YP_003461204 |
Protein GI | 289209138 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCT CGCCAGCCGG CCACGCACGC CCACTGCACC GACGGGCCAC AGTGCGAGCC ATCATTGCCT GCGTGCTCGG CGTCGCCGCC ACGACGGCGC AGGCCGAACT CGAGCGGCTG GAGCGCCCCC CGGGCTGCGA ATTCGCCCCC GGCATGGAGT GGCCCGAGAT GCGCGAGCTC GCGGCCGAGA TCACCGATAT CGACGCCGAT TACGTACGCC GCGAGGGGGA GATGATCAGC GCCCGCGGCA ACGTCCAGCT CTTCGAACAA GGGCGCCGCC TGGACACCGA GTTTCTTGAA TACGACCGGG CCACGGGCCT CGCACGTACC GAGGGCGACA CCCGATTGTT CGACGGCGAC CTTCTGCTGC ACACCGACGG CGGCGAATAC CGTCTGGACG AGGAGACCGG GGAGTTCCGG CGCATCGACT ACCTGGTGCA GTCCGTGCCG GCCCGCGGAA GCGCCGGGGT GGGCATCCAG CTCGCGCCTC ACCTGCACGA ATTCGAGGAC GCCGAGTACA CCACCTGCCC GATCAACAAC GACAGCTGGT GGCTGCGCAC GCGCGATCTG GAGCTGGACC GCGAGCGCGG GATCGGCGTT GCCCGCAACG CCCGCATCGA CTTCAAGGGC GTGCCCATCG CCTACACGCC GTATATCCAC TTCCCGCTGG ACGACCGGCG CCAGAGCGGC TTCCTGCTGC CCAGTGCCGG GGTCAGCTCA CGACACGGGG TCGATATCAG CACGCCCTGG TACTGGAACA TCGCGCCGAA TTACGACGCG ACTTTCACCC CGCGCGTCAT GTCGGACCGC GGGGTCAAGC TCGGCACCGA GTTCCGTTAT CTGCAGCCCC GCCACGAGGG CGTGGCCGAT ATCCGCTACC TGCCGAGCGA TCGCGTATTC GGCGACGACC GGCACTACGT GCAGTGGTTT CACCAGAGCC GCCTGACCTC GGACCTGCGC TTTCGCCTGG ACGCCGCCGA TGCCTCCGAC AGCGAGTATT TCCGCGACTT CGGTGGCGCG GACCGCTACT CCGGCAGCAC GCCCTATCTG CGCCGTCGAG CGGACCTGAC CTACAACCAG CCCGCCTACC GCCTGCGCAC CCGGATCGAG GACTACCAGG TCATCGACCC CGAGCTGCGG GCCCAGCGCG AGCCCTACCA GCGCCTGCCG CAGATCACCC TGGAGGCGGG CGAGTATCTG GGGCGCGGCC TGGCCTGGGA CCTGGACACC GAACTGGTGC GCTTCGAGCG TAGCGAGAGC GACTCGCTGG AACCCCTGGG CACGCGCTTC GACCTCAACC CCACGGCCTC GTGGCGCTAC GATACCGGGG GCTTCTTCTT CGAGCCCAGC GCCGGCGTGC GCTACACCCA CTACGAGCTG GACCGCCGCG GTCTCCCGGG CCCCGAGTCG ATCAGCCGCA CCGTGCCGCG CGCCTCGGTC GACACCGGGC TGGTCTTCGA GCGCCCGATC CAGGACGGGC GTCGTCTGCA GACGCTGGAA CCCCGGATCT TCTACGGCTA TGTCCCGTTC GTCGACCAGA CCGATATCCC GATTTTCGAC ACCGGCGAGG CGGACTTCAG CTTCGACAAC CTGTTCAGCC TGGATCGCTT CGTCGGCGCC GACCGCGTCG GCGACACCCA CCAGCTGACC ACCGCGCTGA CCACGCGCAT CTTCGACGAG CAAACCGGAC AGGAGCGGAT GAGCCTGTCG GCGGGCCAGA TCCGCTACTT CGAGGACCGC GAGGTCCGCC GCGACCCGCA AGGCGACCCG CTGACCGAAT CGCGTTCGGA TCTCGCCCTG GAGGGTCGCG TGCGCATCGG CGATGCCTGG AGCGGTCGCG GCTCCATCCT GTACAACCAG GACGAAGGCG AGACCACCCT CACAGCGCTG TCGGTCAGCT ACAGCCCGGG CGCGCGGGCC CGCATCAATG TCGGCTATCG TCAACGCGAG GCGGGCGCCC GCTCCATCGA CCAGACCGAT GTCTCCGTGC TGTGGCCGGT GACCCCGCGC ATCGATGCGA TCGGGCGCTG GAATTACAGC CTGCAGGAGG AACGTGATCT CGAACTTCTG GCCGGCCTTC AGTATCGTAG TTGCTGCTAT GGCATCCGAG TGGTCGCCCG CCGCGCCTAC GACTTCGACG AGACCTACGA CAACTCGATT TATTTCCAAC TCACGCTGGA CGGGCTCGGA CGCTTCGACT CCGGCGTGGA TTCGCTTTTG AGCGAGGGTA TAGCCGGATA TGACGCACTC CCCTGA
|
Protein sequence | MTPSPAGHAR PLHRRATVRA IIACVLGVAA TTAQAELERL ERPPGCEFAP GMEWPEMREL AAEITDIDAD YVRREGEMIS ARGNVQLFEQ GRRLDTEFLE YDRATGLART EGDTRLFDGD LLLHTDGGEY RLDEETGEFR RIDYLVQSVP ARGSAGVGIQ LAPHLHEFED AEYTTCPINN DSWWLRTRDL ELDRERGIGV ARNARIDFKG VPIAYTPYIH FPLDDRRQSG FLLPSAGVSS RHGVDISTPW YWNIAPNYDA TFTPRVMSDR GVKLGTEFRY LQPRHEGVAD IRYLPSDRVF GDDRHYVQWF HQSRLTSDLR FRLDAADASD SEYFRDFGGA DRYSGSTPYL RRRADLTYNQ PAYRLRTRIE DYQVIDPELR AQREPYQRLP QITLEAGEYL GRGLAWDLDT ELVRFERSES DSLEPLGTRF DLNPTASWRY DTGGFFFEPS AGVRYTHYEL DRRGLPGPES ISRTVPRASV DTGLVFERPI QDGRRLQTLE PRIFYGYVPF VDQTDIPIFD TGEADFSFDN LFSLDRFVGA DRVGDTHQLT TALTTRIFDE QTGQERMSLS AGQIRYFEDR EVRRDPQGDP LTESRSDLAL EGRVRIGDAW SGRGSILYNQ DEGETTLTAL SVSYSPGARA RINVGYRQRE AGARSIDQTD VSVLWPVTPR IDAIGRWNYS LQEERDLELL AGLQYRSCCY GIRVVARRAY DFDETYDNSI YFQLTLDGLG RFDSGVDSLL SEGIAGYDAL P
|
| |