Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1865 |
Symbol | |
ID | 8807638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1979272 |
End bp | 1981050 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | putative sodium symporter protein |
Protein accession | YP_003461092 |
Protein GI | 289209026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.151601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.340772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAAC AGACCCTAAG TCTTCTCGTG GTCGGGGCGA CCTTCGCCCT GTACATCGGC ATCGCCATCT GGGCGCGGGC CGGTTCGACC AGCGAGTTCT ACGTCGCCGG GCGCGGCGTC AACCCGATCG CCAACGGGAT GGCCACCGCG GCGGACTGGA TGTCCGCCGC GTCGTTCATC TCCATGGCCG GCCTGATCGC GTTCCTCGGC TTCACCGGGG GCGCCTACCT GATGGGCTGG ACCGGCGGCT ACGTGCTGAT GGCCCTGTTG CTGGCCCCGT ACCTGCGCAA GTTCGGCAAG TTCACGGTGC CGGAATTCAT CGGTGACCGG TTCTACTCGA AGGTTGCACG CGTGGTGGCG GTCGTCTCGC TGCTGGCGAT GTCCATCACC TACGTGATCG GACAGATGCG TGGCGTCGGG ATTGCGTTCT CCAACATCCT CGAAGTGCCG CTGGCCATCG GCCTGATCTC GGGCATGGCG GTGGTGTTCA TCTACGCGGT GCTCGGCGGG ATGAAAGGGA TTACCTACAC CCAGATCGCG CAGTACGTGA TCATGATCTT CGCCTACACC GTGCCGGCGG TCTTCATCTC GCTGACCCTC ACCGGCCAGG TGTTCCCGCA GATCGGCCTG GGTTCCACCC TGCAGGGTCA GGACACCTAC CTGCTGGAGG CCCTGGACCA GACGATGGTG GATCTTGGCT TCACCATGTA CACGGCCACC GAGGGCGGCA TGAGCATGCT CAACATGTTC CTGCTGACCA TGGCGCTGAT GATCGGTACC GCCGGTCTGC CGCACGTGAT CATCCGGTTC TTTACCGTTC CGCGTGTGCG CGATGCGCGT AAGTCCGCCG GCTGGGCGCT GGTCTTCATC GGCATCCTCT ACACCACCGC CCCGGCGGTC GGTGCGATGG CGATCTGGAA CCTCGTCAAC ACCGTGCATC CGGGCGAAAT CGGCACCGAG GAAGGCCACC TGGCCTACGA GGACAAGCCG GCCTGGATGG AGCGCTGGGA ACAGACTGGT CTGCTCGCGT TCGAGGACAA GAACGAAGAC GGCCGCATGC AGTACTACAA CGATGCCAAC GAAGAGTTCG CCGCGATGGC GGAAGAGGAA TACGGCTGGG AAGGCTCCGA GATCGTAACC CTCGACCGCG ACATCATGGT GCTGGCCAAT CCCGAGATCG CCGGGCTTCC GGGCTGGGTG ATCGCGCTGG TGGCGGCCGG TGGTATCGCG GCGGCCCTGT CGACTGCGGC CGGTCTGTTG CTGGCCATCT CGTCGGCCAT CTCGCATGAC TTGCTGAAGG GGGTCTTCAT GCCTCGAATC AGCGAGAAGA ACGAGCTGCT GGCGGGACGG ATCGCCATGG CAGGCGCAAT CCTGTTCGCG GGATATCTGG GGCTCAATCC ACCAGGCTTT GCGGCCGAGG TGGTCGCACT CGCCTTCGGT CTTGCCGCGG CCTCGCTGTT CCCGACCCTG ATGATGGGCA TCTTCATGAA GAAGATGAAC AAGGAAGGCG CGATCGCCGG CATGCTGGTC GGTCTGGTGA CCACCCTGCT CTACATCTTT ACCTACAAGG GGTGGTTCTT CTTCTCCGGC ACCGCCATGC TCCCGGATAC CGAGGAGTAC TGGCTGTTCG GCGTGAACCC GACCGGCTTC GGCGCCATTG GCGCGGTGTT CAACTTCGTC GCGGCCTACA TCGTGATGAA GCTGACCAAG GAGCCGCCGC AGCATATCCA GGAGCTGGTG GAAAGCGTGC GTGTTCCGCG CACCGACAAC CCGAGCTGA
|
Protein sequence | MDQQTLSLLV VGATFALYIG IAIWARAGST SEFYVAGRGV NPIANGMATA ADWMSAASFI SMAGLIAFLG FTGGAYLMGW TGGYVLMALL LAPYLRKFGK FTVPEFIGDR FYSKVARVVA VVSLLAMSIT YVIGQMRGVG IAFSNILEVP LAIGLISGMA VVFIYAVLGG MKGITYTQIA QYVIMIFAYT VPAVFISLTL TGQVFPQIGL GSTLQGQDTY LLEALDQTMV DLGFTMYTAT EGGMSMLNMF LLTMALMIGT AGLPHVIIRF FTVPRVRDAR KSAGWALVFI GILYTTAPAV GAMAIWNLVN TVHPGEIGTE EGHLAYEDKP AWMERWEQTG LLAFEDKNED GRMQYYNDAN EEFAAMAEEE YGWEGSEIVT LDRDIMVLAN PEIAGLPGWV IALVAAGGIA AALSTAAGLL LAISSAISHD LLKGVFMPRI SEKNELLAGR IAMAGAILFA GYLGLNPPGF AAEVVALAFG LAAASLFPTL MMGIFMKKMN KEGAIAGMLV GLVTTLLYIF TYKGWFFFSG TAMLPDTEEY WLFGVNPTGF GAIGAVFNFV AAYIVMKLTK EPPQHIQELV ESVRVPRTDN PS
|
| |