Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0599 |
Symbol | |
ID | 8806344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 632386 |
End bp | 634245 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003459850 |
Protein GI | 289207784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.300607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCGTC ATTTTGGTTG GATTGTCACC TTGATCGTGC TGGGCATCCT GCTGTTGCTG GTCATGGTGC AGGTAGACCG CCAGTGGCAG CGCATGGCCG CCATGCAGAA CATGATGAGC GAGCAGGCGC GCGACCTGCG GGCACTGCGC CGCAGTCTGC AGAGTCTCGA GTCCGATCTG GCCTCGGGGC GTTTCACCAC TGGGGACGGC GAGCGCGCAC CCCGGGTCGC GGAGGTCCCC TCGGCATTCG AGCGGGCGGC CGAGGCGGCG GGGCGCGAGG ACTACGCGGA GGGCGACTGG CTGGTTCAGG CCTTCGGCAC GGGGCTGGCG ACGATCACGC CCTATGTCTC CAGCGATGCC TATGCCGCCG ATGTGCAGAA CTATGTGCTG GAGTCGCTGG TGCAGCGCCA CCCGGAGACC CTGGAGTGGC AGGGCCTGCT GGCGGCCTCC TGGGACTTCG ACGACTCTGG CCAGGAGCTG AGCTTCCAGC TGCGCCCGGG CCTGCAGTTC TCCGACGGCG AGCCGCTGAC CGCGGAGGAC GTGGAGTTCA CCTTCAACTT CCTGATGACC GATGCGATTG CGGTGCCGCG CGCGCGGGCC TTCCTGGAGA AGGTCGAGAA GGTGGAGGCA GCGGACGAGC GCACCGTCGT GTTTACCTTC GAGGAGCCGT ATTTTGACGC CCTGCGGGTT GCCGGTGGGC TGAACGTCTT GCCGAAGCAT TTCTACGAGC GCTACCTCGA CGAGCCCGAG ACCTTCAACG AGTCGCGCGG GATCCTGCTC GGATCCGGCC CCTACCGGCT CGAGGACCCG ACCGGCTGGA CGCCGGATGC CGGGCGGATC GAACTGGAGC GCAATCCGCG CTACTGGGGC CCGGTGGAGC CCTCGTTCGA CCGGCTCGTC TGGCGCGTGA TCCAGAATGA CAGTGCGCGC CTGACGACCT TTCGCAACCG CGATATCGAT GTCTACGGCG CCCGGCCGCG CGAGTACGCG CGGCTGCGTG ACGACGAGGC CCTGCGCGAA CGCGCGGATA CGCACGAGTA CATGAGCCCC ACCGCGGGTT ATTCCTACAT CGGCTGGAAC CAGAAGCGCG ATGGCGAGCC GACGCGCTTT GCCGATCCCC GTGTGCGCCG GGCGATGAGC TATCTGACGG ATGTCGATCG TCTGATCGAG CAGGTGATGC TGGGGTATGC TGAACGCGCG ATCAGCCCGT TCAGCCCGCG CAGCGACCAG CACAACCCCG ACCTCGATTA CATCCCGTTC GATGTCGAGC GTGCGCTTGA ACTGCTGGCC GAGGCCGGCT ACTCCGAGAA AAACCGGGAC GGGGTGCTGG TCAACGAGGA GGGCGAGCCG TTCTCCTTCG ACCTGGTGTA TTTCCAGGAC AATGAGGACA CCCGTCGCAT CGCGCTGTTC CTGCGCGATC TGTACGCGCG TGCCGGGGTT CATATGCGGC CGCAGCCAAC CGAGTGGTCG GTGATGCTGG AGAAGATCAG CCGCCAGGAC TTCGATGCCA TTACCCTGGG CTGGACCAGT GGCGTGGAGG TGGACATCTA CCAGATGTTC CACTCCAGCC AGACGGTCTC TGGTGGTGAC AATTTCATCA ACTACGAAAA CCCCGAGCTG GATGCCGTGA TCGAGGCCGC GCGCGGCGAG GTGGACGAGG ACAAGCGCAT GGAACACTGG CGCGAGGCCG AGCGGATCCT GGTGGAGGAT CAGCCGTATA CCTTCCTGAT GCGGCGCCAG ACCCTTGCCT TCATCGACCG CCGTATCCAG AACCTGGAAC AGACGGCACT CGGCCTGAAC CTCGGGTTCG TGCCGGTGGA GATCTACGTC CCGTTTGACC AGCAGCGGTA CGGGCAGTAG
|
Protein sequence | MSRHFGWIVT LIVLGILLLL VMVQVDRQWQ RMAAMQNMMS EQARDLRALR RSLQSLESDL ASGRFTTGDG ERAPRVAEVP SAFERAAEAA GREDYAEGDW LVQAFGTGLA TITPYVSSDA YAADVQNYVL ESLVQRHPET LEWQGLLAAS WDFDDSGQEL SFQLRPGLQF SDGEPLTAED VEFTFNFLMT DAIAVPRARA FLEKVEKVEA ADERTVVFTF EEPYFDALRV AGGLNVLPKH FYERYLDEPE TFNESRGILL GSGPYRLEDP TGWTPDAGRI ELERNPRYWG PVEPSFDRLV WRVIQNDSAR LTTFRNRDID VYGARPREYA RLRDDEALRE RADTHEYMSP TAGYSYIGWN QKRDGEPTRF ADPRVRRAMS YLTDVDRLIE QVMLGYAERA ISPFSPRSDQ HNPDLDYIPF DVERALELLA EAGYSEKNRD GVLVNEEGEP FSFDLVYFQD NEDTRRIALF LRDLYARAGV HMRPQPTEWS VMLEKISRQD FDAITLGWTS GVEVDIYQMF HSSQTVSGGD NFINYENPEL DAVIEAARGE VDEDKRMEHW REAERILVED QPYTFLMRRQ TLAFIDRRIQ NLEQTALGLN LGFVPVEIYV PFDQQRYGQ
|
| |