Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1907 |
Symbol | |
ID | 8807680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2023111 |
End bp | 2024781 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Na/Pi-cotransporter II-related protein |
Protein accession | YP_003461134 |
Protein GI | 289209068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGG CCGAAGCCGT CAGCACCTAT GCCCTGATCG CCGGCTTCCT CGGCGGTCTC GGGCTGTTCC TGCTCGGGAT GACCCTGCTG ACCGACGGCC TGAAGACGGC TGGCGGCAAA GCATTGCAAC ACATCCTTGG GCAGTGGACC CGCACCCGGC CACACGCCCT CGCCTCGGGC ATGGGTGTGA CCGCCCTCGT GCAGTCTTCC AGTGCGGTCA CGGTCGCCAC CATCGGCTTC GCCAATGCCG GCCTGCTGAC GCTGGGCCAG TCGGTCTGGG TGATCTTCGG GTCCAACGTC GGCACCACCA TGACCGGCTG GCTGGTCGCC CTGATCGGCT TCAACATCCG CATCGAGGCC TTTGCCCTGC CGGCGATCGG CATCGGCGCC CTGCTCTACC TGTTCATGAA GGCCGCACGC CCACGCTACC TGGGTCTGGC CCTGGCCGGC TTCGGCCTGC TGTTCTTCGG GATCGATGTA CTGCGCGAGA CCTTCGAGGG TCTGAGCGCG ACCTTCGACC TCTCCGCCCT CGCCCGACCG GGCTTCGCCG GACTGTTGAT CATGGTCGGC ATCGGCGCGT TGCTGACCGT GCTGATGCAG AGCTCGTCGG CATCCATGGC AATGGCCCTG ACCGCCGCGA TGAGCGGGGC CGTGCCGCTG GAGGCCGCGG CCGCCGCAGT TATCGGGGCG AACCTCGGGA CCACGGTCAA GGCACTCCTG GTCGTGATCG GCGCGACCCC GAACGCCAAG CGCGTCGCGG CGGCCCACGT GATCTTCAAC GGCCTGACCG CGATCGTCGC ATTGCTGATC CTGCCCTGGT TCCTGGCCGC GATCGCCTGG ATCTGGGGCA GCACCGGGGA GGCCCCGGCC CCGGCGGTAC TGCTGGCGCT GTTCCATACC GCGTTCAACC TGCTGGGTGT GGCGCTGATG GTGCCTGTCG CCCCGCCGAT GATCCGCTTC CTGCAGATGC GTTTTCGCAC GGAAGAGGAG GACATCGCCC GGCCACGTTT CCTTGACCGG CACAGCGCCG GCGTACCGGA CCTGGCACTG CAGTCGCTGC TGCAGGAGAC GGACCGGCTG GCGGATATCA GCCACTCTAT GGCGCGCGAG GCCCTGCGGG CCGATCCGCC AGCCGAGGGC AGCCTGGACA CCCGCCGGGC CGCCGTCGAG CAGCTGACGG CGGCGATCAG CGATTTCGCC ACCCGCACCG CCAACCACCC CATGCCGCAG GCGGTAGGAC ACGGGATCGC CGAGGTCGTC CGCATCGCCC GCTATCATCG CGAGATAGCC GTATTGGCCG AGCAGATCAC CGACCTGCGC AGGCACTCCA CCGGCGAGGA ACGCCCAGCC GTCAGCGACC AGCGCCGCGC GTGGCAGGAG CAGGTCATCC GCGCCCTCGA CCTGGCCAAC CTCGAGCCCG AAAATGGCGA GAACATCGCG GCCGCCGAGG CCGTCCAATC GGTTGAAGTC GCCTATGCCG ACCTAAAGGC CGCACTGCTG CGCGAGGGGA CCTTCGGCTC GCTGAGCATG GCCGAGATGG AGCGGCAGCT GCGGATCATC AGCATGGCGC GTCGCCTGGT GACGCAGGCC GACAAGGCTC GACGCCACAC CGAGAACCTG GAAAACCATC CTCGTGACAT CGATTCGACA CTGGATTCGG AAGCCGCATG A
|
Protein sequence | MPEAEAVSTY ALIAGFLGGL GLFLLGMTLL TDGLKTAGGK ALQHILGQWT RTRPHALASG MGVTALVQSS SAVTVATIGF ANAGLLTLGQ SVWVIFGSNV GTTMTGWLVA LIGFNIRIEA FALPAIGIGA LLYLFMKAAR PRYLGLALAG FGLLFFGIDV LRETFEGLSA TFDLSALARP GFAGLLIMVG IGALLTVLMQ SSSASMAMAL TAAMSGAVPL EAAAAAVIGA NLGTTVKALL VVIGATPNAK RVAAAHVIFN GLTAIVALLI LPWFLAAIAW IWGSTGEAPA PAVLLALFHT AFNLLGVALM VPVAPPMIRF LQMRFRTEEE DIARPRFLDR HSAGVPDLAL QSLLQETDRL ADISHSMARE ALRADPPAEG SLDTRRAAVE QLTAAISDFA TRTANHPMPQ AVGHGIAEVV RIARYHREIA VLAEQITDLR RHSTGEERPA VSDQRRAWQE QVIRALDLAN LEPENGENIA AAEAVQSVEV AYADLKAALL REGTFGSLSM AEMERQLRII SMARRLVTQA DKARRHTENL ENHPRDIDST LDSEAA
|
| |