Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2390 |
Symbol | |
ID | 8808171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2509458 |
End bp | 2511458 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transketolase |
Protein accession | YP_003461616 |
Protein GI | 289209550 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0781466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACCC GTAGAGAGCT TGCCAATGCC ATTCGCGCCC TGTCGATGGA TGCCGTCCAG AAGGCCAATT CCGGCCACCC GGGCGCCCCA ATGGGGATGG CGGATATCGC CGAAGTGCTG TGGAACGACT ACATGCAGCA CAACCCGGCC AACCCGCACT GGCCCGATCG CGACCGCTTC GTGATGTCCA ACGGCCACGG CTCCATGCTG ATCTACTCCC TGCTGCACCT GACCGGCTAC GAGTTGCCTA TGGACGAGCT CAAGCAGTTC CGTCAGCTGC ACTCCAAGAC CCCGGGTCAC CCGGAATACG GCTACACCCC GGGTGTGGAG ACCACGACCG GCCCGCTGGG CCAGGGGATC TCCAACGCGG TGGGCATGGC GCTGGCCGAG AAGATGATGG CCGCGCACTT CAACCAGCCC GAGCACGACA TCGTTGATCA TCACACCTAC GCCTTCCTCG GCGATGGCTG CATGATGGAA GGGGTCTCCC ACGAAAGCTG CGCGCTGGCC GGCACCCTGG GCCTGGGCAA GCTGATCGCC TTCTGGGATG ACAACGGCAT CTCCATCGAC GGCGAAGTCG AGGGGTGGTT CACCGACGAT ACCCCGACCC GTTTCGAAGC CTATGGCTGG CACGTGGTAC GTGGTGTGGA TGGTCACGAC GCCGATGCGG TCAAGGCCGC GATCGAAGAG GCACGCCAGA ACAGCGACAA GCCGAGCCTG ATCTGCTGCC GCACCGTGAT CGGCTTCGGT TCGCCGAACA AGTGTGGCAA GGAAGAGTGT CACGGCGCCG CCCTGGGCGA CGACGAAGTC GCGCTGACCC GCAAGGAGCT CGGCTGGAAC CACGCCCCGT TCGAGATCCC GAACGAGATC TACGAAGGCT GGAGTGCAAA GGACAAGGGC GCCAAGGCCG AGTCCGAGTG GAACGAACGC TTTGCCGCGT ACAAGCAGGC GCATCCGGAA CTGGCCGCCG AGTTCGAGCG CCGCATGGCC GGTGACCTGC CGTCCGACTG GAGCGAAAAG GCCAACGCCT ACATCAAGGA AACCGTCGAG AAGGCCGAGA AGGTCGCCTC GCGCAAGGCC TCGCAGAACG CGATCGCCGG CATGGCGCCG GCCGTCCCGG AACTGCTGGG CGGCTCCGCC GACCTGGCCG GCTCCAACCT GACCATGTAC TCCGGCTCCA AGGGCGTCAG CAGGGACGAT GCCGCGGGCA ACTACGTCTA CTACGGCGTG CGCGAGTTCG GCATGGCCGC GATCATGAAC GGCATCGCCC TGCACGGCGG CTTCATCCCG TACGGCGGCA CCTTCCTGAT CTTCTCCGAC TATGCCCGCA ACGGCATCCG CATGTCCGCG CTGATGGGCC AGCGTGTCAT GTACGTACTG ACCCATGACT CCATCGGTCT GGGCGAAGAC GGCCCGACCC ACCAGCCGAT CGAGCAGACC CCGAGCCTGC GCCTGATCCC CAACCTGAAC GTCTGGCGCC CCTGCGACGC GGTCGAGACC GCCGTCGCCT GGAAGGCCGG TATCGAACGC ACCGACGGCC CCTCCGCGTT CATCCTGTCG CGTCAGGGCC TGCCGCACAT GGAGCGCGAT GGCGAACAGC TCGCCAACAT CGCCCGTGGT GGTTACATCC TGAAGGACTG CGACGGCACC CCGGACCTGA TCTTTATCGC GACCGGCTCC GAGGTCGAGC TGGCGGTCAA GGCGGCCGAG GAGCTTGCCG GCGAAGGCAA GAAGGCGCGC GTCGTCTCCA TGCCCTGCGT CGAGCTGTTC GAGCAGCAGG ACCCCGCCTA CCGGGAGGCC GTGCTGCCGG CCGCCGTGCG CAAGCGCGTG GCGGTCGAGG CGGGCGCCAC TGCGGGCTGG TACAAGTTCA CCGGTCTGGA CGGCGCCGTG TTCGGCATGG ATCGCTTCGG CGAGTCGGCC CCGGGTGGCG CCCTGTTCGA CTACTTCGGC TTCAAGCCGG AGAACGTCGC CCAGATCGCC CGTGACGTGC TGAACGGCTA A
|
Protein sequence | MPTRRELANA IRALSMDAVQ KANSGHPGAP MGMADIAEVL WNDYMQHNPA NPHWPDRDRF VMSNGHGSML IYSLLHLTGY ELPMDELKQF RQLHSKTPGH PEYGYTPGVE TTTGPLGQGI SNAVGMALAE KMMAAHFNQP EHDIVDHHTY AFLGDGCMME GVSHESCALA GTLGLGKLIA FWDDNGISID GEVEGWFTDD TPTRFEAYGW HVVRGVDGHD ADAVKAAIEE ARQNSDKPSL ICCRTVIGFG SPNKCGKEEC HGAALGDDEV ALTRKELGWN HAPFEIPNEI YEGWSAKDKG AKAESEWNER FAAYKQAHPE LAAEFERRMA GDLPSDWSEK ANAYIKETVE KAEKVASRKA SQNAIAGMAP AVPELLGGSA DLAGSNLTMY SGSKGVSRDD AAGNYVYYGV REFGMAAIMN GIALHGGFIP YGGTFLIFSD YARNGIRMSA LMGQRVMYVL THDSIGLGED GPTHQPIEQT PSLRLIPNLN VWRPCDAVET AVAWKAGIER TDGPSAFILS RQGLPHMERD GEQLANIARG GYILKDCDGT PDLIFIATGS EVELAVKAAE ELAGEGKKAR VVSMPCVELF EQQDPAYREA VLPAAVRKRV AVEAGATAGW YKFTGLDGAV FGMDRFGESA PGGALFDYFG FKPENVAQIA RDVLNG
|
| |