Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2517 |
Symbol | |
ID | 8808301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2647692 |
End bp | 2648909 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003461743 |
Protein GI | 289209677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.587179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0541201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC TCCACGTCCT CGACCACTCC ATCCCGCTGC ACAGCGGGTA CACCTTTCGC ACCGCGGCCA TCCTGCGCGA ACAGCACCGC AGGGGCTGGC AGACGTGTCA CCTGACCAGC CCCAAGCATA CCGGCGCCCA GACCGACGAG GAGACGATCG ACGGCCTGCA TTTCTACCGC ACGACCTACA ACCCGAGCGC GATCCCGGGG GTGGAACAAT GGGGGTTGAT GCAGGCACTG ACCCAGCGGC TGACCCGCGT CGCCTGCCAG GTCGCACCCG ACATCATCCA TGCGCACTCG CCCGCGCTGA ATGCCCTGCC GGCCCTGCGT GCCGGTCGCC GGCTGGGCAT TCCGGTGGTC TACGAGGTGC GGGCATTCTG GGAGGACGCC GCGGTCGACC ATGGCACCGC CCGCGAACAT GGGCTGCGCT ACCGCCTGAC GCGAGAGCTG GAGACCTATG CCCTGAAGCG CGTGGGCCAT GTCACCACCA TCTGCGAGGG GCTGCGCCGC GACATCATCG CGCGCGGGAT CCCGCCCGAA CGGGTCACCG TGATCCCCAA TGCGGTGGAC GCGGATGACT TCCACCTGGG CGGTCAGCCC GACCCCGCCC TGAAGGCCCA ACTGGGTCTG GGTGACGCCC GTGTGCTCGG CTTTCTCGGG TCCTTTTATG CCTACGAGGG ACTGGATACC CTGCTCGAGG CCTGTCCGCT TATCCAGGCC CGGACGCCCG ACGTCCGCAT CCTGCTGGTC GGGGGCGGGC CGCAGGCCGA ACGCCTGAAG GCCCAGGCCG AACGCCTCGG CATCCAGGAC CGCGTCATCT TCACCGGGCG CGTCCCGCAT CACCAGGTAC AGCGCTACTA CGACCTGGTC GACCTGCTCG TGTATCCACG CCATTCGATG CGCCTGACCG AGCTGGTCAC GCCGCTGAAA CCACTGGAGG CGATGGCCCA GGGGCGCCTG CTGGTGGCAT CGGACGTGGG CGGCCACCGG GAACTGATCC GCGATGGCGA GACCGGCTGG CTGTTCCCCG CCGACAACCC CGAAGCCCTG GCCGCCCGGG TGCTGGCGAC ACTGGAGCAC CCGGAACACT GGCCACGGGT GCGCGAGAAC GGGCGGCGCT TCGTGGAACA GGAGCGCACC TGGGCGGCGA GTGTCGCTCG CTACCAGTCC GTCTACGACG GGCTGCGGGC CCGGCCGGAG GTGCTGCGTG CCGGATGA
|
Protein sequence | MKILHVLDHS IPLHSGYTFR TAAILREQHR RGWQTCHLTS PKHTGAQTDE ETIDGLHFYR TTYNPSAIPG VEQWGLMQAL TQRLTRVACQ VAPDIIHAHS PALNALPALR AGRRLGIPVV YEVRAFWEDA AVDHGTAREH GLRYRLTREL ETYALKRVGH VTTICEGLRR DIIARGIPPE RVTVIPNAVD ADDFHLGGQP DPALKAQLGL GDARVLGFLG SFYAYEGLDT LLEACPLIQA RTPDVRILLV GGGPQAERLK AQAERLGIQD RVIFTGRVPH HQVQRYYDLV DLLVYPRHSM RLTELVTPLK PLEAMAQGRL LVASDVGGHR ELIRDGETGW LFPADNPEAL AARVLATLEH PEHWPRVREN GRRFVEQERT WAASVARYQS VYDGLRARPE VLRAG
|
| |