Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2506 |
Symbol | |
ID | 8808290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2633613 |
End bp | 2635169 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
Protein accession | YP_003461732 |
Protein GI | 289209666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.882544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA TTGTTCAGGA TTTTGTTCAG CGGGCACGCG CCACATGGCG CCGTCGCTGG TGGATTCTGC CGATTGCCTG GATTGTGTGT CTGGGCGGAT GGGCTTACAT CCAGAGCCTT CCGGATACCT ATCAGGCTAA TGCGCAGGTA TTCGTAAACA CCCAGTCCGT ACTGAATCCG CTGCTGCGCG GGATGACGGT GCGTCCGGAT ACAGAGGAGC GGCTGCGGAT GGTGACGCGC ACTCTGCTCA GCCGCGACAA CCTGGAAACC ATCGCCCAGC AAAGCGACCT GGACGTCTAC ATGGGCAGTG CCGACCTCGA CGCACAGGTC AACCTGCTGC GCGGCAGGCT GACCCTGGAC GGCGAACGGC GCGAAAACAT CTTCACTGTG CGTTTTCGCC ACAGCAATCC TGACGTCGCC GAGCGCGTGG TCCGCGAGAC GGTGAATCTG TTCATGGAGC GCGGCCTGGG CGACTCGCGC CTGGACCTGA GCACCTCGCA GCGATTCATT GAGCGTCAGC TCGAGACCTA CGAGCGCCAG CTGCAGGCCA GGGAAGCCGA GATCGAAGAC TTCAAGCGGC AGAACGCGCG GTTTATCAGC GGCGACGGCA ACTTCTACTC GCGCCTGGAG CGCGGCCGCG AACAGGTGGC TCAGGCAGAG CTGCAACTGC GCGAGGCGCA CAGCCGCCTG GCCGCGTTAC AGCGCCGTGC CGAGGAAAGC CGGTCGGGCG GAGGCACGGT CAGGGCGCAG TACCGCAACC CCGAGCTGGA CCGGCGGATT AACGACCTGC AGGAAAACAT CGATGCCATG CGGCATCGCT ACACCGACGA CCATCCCGAC ATCGTTGCCG CAAGGCGCAT CCTCTCGGAG CTGGAAGAAC GCCGCTCGCG CGAAGCCGCC GACTACGCCC GCAACCCAGC GGTGGTCCCC ATTGGGGGCA CCGGGGGTGG CGACTCGCTG CAGGCCGCGA TCACCGAGGC CGAGAGCGAG ATCGCGATGC TGGAGACCCG CATCGAGGAG TTCCAGGCGC GGGTCGCGCG GCTGGAACAG GATGTCGACC GCGCCCCGGC CGTGGAGTCG GAGCTGACCG CGCTGACGCG CAACTACGAC GTGCTGCGCG ACTCCTACCG CCAGCTCCAG AGCCGCCGCG AGCAGGCGAT CATGTCCGGC GCGGTGGAGT CGGAGACCGA CTCGGTCGAT TTCCGCGTCC TTGAACCGCC GCGCCGACCG ACGGCCCCCT CGGCACCCGA TCGCCCACTG ATGGCCACCG GCATCCTGCT GGTCGGCCTG GGCGCGGGAA CGGGCTTCGC CTTCCTGCTT TCGCAGCTGC GGCGGACGAT CAACAGCCGA CGACAGCTTG CGGAAATCGC CGGTCGCCCG GTTCTGGGCA CCCTCTCGGC CGTACGCACA CCCCAGATAC GGCGCCGCCG CAGGCTGGAG CTATCGGCCT TTTCGATCGC CCTGGCCACA CTCCTGCTGG TGTATGCCGT GGTCATGGGT CTTTACCTCA CCGGATCGGC CGGTGTTCTC GACGAACTGC TGAGCCTGTT GCGATGA
|
Protein sequence | MEKIVQDFVQ RARATWRRRW WILPIAWIVC LGGWAYIQSL PDTYQANAQV FVNTQSVLNP LLRGMTVRPD TEERLRMVTR TLLSRDNLET IAQQSDLDVY MGSADLDAQV NLLRGRLTLD GERRENIFTV RFRHSNPDVA ERVVRETVNL FMERGLGDSR LDLSTSQRFI ERQLETYERQ LQAREAEIED FKRQNARFIS GDGNFYSRLE RGREQVAQAE LQLREAHSRL AALQRRAEES RSGGGTVRAQ YRNPELDRRI NDLQENIDAM RHRYTDDHPD IVAARRILSE LEERRSREAA DYARNPAVVP IGGTGGGDSL QAAITEAESE IAMLETRIEE FQARVARLEQ DVDRAPAVES ELTALTRNYD VLRDSYRQLQ SRREQAIMSG AVESETDSVD FRVLEPPRRP TAPSAPDRPL MATGILLVGL GAGTGFAFLL SQLRRTINSR RQLAEIAGRP VLGTLSAVRT PQIRRRRRLE LSAFSIALAT LLLVYAVVMG LYLTGSAGVL DELLSLLR
|
| |