Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2441 |
Symbol | |
ID | 8808222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2561381 |
End bp | 2562631 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_003461667 |
Protein GI | 289209601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTC TTCCCGTCAC CGTCCTCTCC GGCTTTCTCG GCGCCGGCAA GACCACGCTG CTCAACAACA TCCTGCACAA CCAGTCGGGG CTGCGCGTCG CGGTCATCGT CAACGACATG AGCGAGGTCA ACATCGACAG CGCCCTGGTG CGCGGCGGCC AGGATGCCGT CACCCGTACC GATGCCCGCA TGGTCGAGAT GACCAACGGC TGCATCTGCT GCACGCTGCG GGACGACCTG CTGGTGGAGG TGCGCCGCCT CGCCGAATCC GGGCAATACG ACTACCTGGT CATCGAATCC ACCGGCATTT CCGAGCCCAT GCCCGTGGCC GCCACCTTCG CCTTCCGCGA CGAGAACGGT CAGTCCCTGG ATGATGTCGC CCGCCTCGAC ACCATGGTGA CCGTCGTCGA CGCCGGCGCT CTGCTGAAGG ACTATGCCTC CACCGATTAT CTAGGGGATC GGGGTGAATC ACTGGGCGAG GAGGACGAGC GCACCGTCGT CGACCTTCTG GTGGATCAAA TGGAGTTTGC AGATGTCATC ATCATCAACA AGGTGGACCG CGCGACGCCC GATGAACTCG CGACCGTTCG CGGCGTGGTC CGCGGCCTGA ACCGCGAAGC GCATGTGATC GAGGCCACCC ATGGCAACGT TCCCCTCGAA CTGCTTCTGG GCACCGGCCG CTTCGACTTC GACCGCGCAT CGCAGGCCGC TGGCTGGGCG CAGGAGATGA TGGGCGGACA CACGCCGGAA ACCGAGGAAT ACGGGATCTC GAGCATCGTG TATCGGGTGA ATCGCCCGTT CCACCCGGCC CGCTTCTTTG ACCTGCTGCA CCAGGAGTGG CCCGGCGTGA TCCGCAGCAA GGGCTGGTTC TGGCTGGCCT CGCGCCCGGA CTGGGCCGGC ACACTGTCGC AGGCCGGCGG CGCGTTGACG CATCACGCCG CCGGCTTCTG GTGGGCCGCC GTACCGCCGG AGAAACGTCC CACTAACGAG GACTGGTGGG ACACGGCCAT AGCGCCAGTA TGGGACGAAC GTTTCGGCGA TCGCCGCCAG GAGATCGTAC TGATCGGTAT AAAAATGGAC GCAGACGCCA TGCGCCAGCG TCTCGATGCC TGCCTGCTCA CCGACGACGA AATGGCGGCG GGCCCGCAGC AATGGCAGCA CTTCGAGGAC CCGTTTCCGG TCTGGCGCCT TTCGGACGAG CCCGAGGGAT CGCCGGAAGA ATCCCGGACC CGAGAGGTTT CATCCGCATG A
|
Protein sequence | MKRLPVTVLS GFLGAGKTTL LNNILHNQSG LRVAVIVNDM SEVNIDSALV RGGQDAVTRT DARMVEMTNG CICCTLRDDL LVEVRRLAES GQYDYLVIES TGISEPMPVA ATFAFRDENG QSLDDVARLD TMVTVVDAGA LLKDYASTDY LGDRGESLGE EDERTVVDLL VDQMEFADVI IINKVDRATP DELATVRGVV RGLNREAHVI EATHGNVPLE LLLGTGRFDF DRASQAAGWA QEMMGGHTPE TEEYGISSIV YRVNRPFHPA RFFDLLHQEW PGVIRSKGWF WLASRPDWAG TLSQAGGALT HHAAGFWWAA VPPEKRPTNE DWWDTAIAPV WDERFGDRRQ EIVLIGIKMD ADAMRQRLDA CLLTDDEMAA GPQQWQHFED PFPVWRLSDE PEGSPEESRT REVSSA
|
| |