Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2098 |
Symbol | |
ID | 8807873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2215006 |
End bp | 2218086 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | heavy metal efflux pump, CzcA family |
Protein accession | YP_003461324 |
Protein GI | 289209258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATC GACTGATTCG ATGGTCCCTC GCCAACCGCC TGCTGGTGAT GGCCGGGGCC CTCGGCCTCC TGCTGCTGGG CGCCTGGCAG GCCAGCCAGA CCCCGATCGA CGTATTCCCG GACCTGACCG CACCCACCGT AACGGTGATG ACCGAGGCCC CCGGCCTGGC CGCCGAGGAG GTGGAACAGC AGGTCAGCTA CCTGATCGAG TCCGCGGTCA GCGGCGCGAG CGGGGTGCGC CGCGTGCGCT CGCAGTCCGT GGCCGGCTTC TCGCTGGTGT GGGTGGAGTT CGAGTGGGGC GAGGAGATCT ACCAGGCCCG GCAGGTCGTC AACGAGCGGC TGGGGCGCGT GCAGGCGCAA CTCCCCGAAG GCGTCGCCGC CCCGCAGCTG GGGCCGATCA GCTCGATCAT GGGCGAGATC GCCTATCTCG GCGTACGTGA CGACGCCCGT GCGGATGGCA TGCGCGCCCG CGAAGTCGCG GACTGGGTGG TAGCGCGGCA GCTGGAGGCC ATCCCCGGCG TGGCCCAGGT GACCACCCTG GGCGGCCAGG TCCGCCAATA CCAGGTGCAA GTCTCCCCGG AGCGCCTCGC GGCGTTCGGC CTGAACCTCG CCGAGGTCCG GGACGCACTG GAACAGGCCA GCCGCAATGC GGGCGGCGAC CGGCTCAGCG AGGGCGGCCA GGACTACCTG ATCCGCTTCC TCGGTCGCGC CGAGCGGCTG GACGAGATCG AACAGGCCGT GGTCGCAGTA CGCGACGGCA TGCCGATCCT GGTACGGCAT GTCGCGGAGG TCGCGATGGG GCCCTCCATC CCGGTCGGGG ATGGCGGCGT GAACGGCCAC CCCGGTGTGG TGCTGGCCGT CTCCAAACAG CCAGATGCCG ACACCCTCGA CCTGACGCGT TCGATCAACG AGACCCTCGC GCGGCTCGAA GCCGGCCTGC CCGAGGGTGT CACCATCGAA CCCAACCTGT TCCGCCAGGC CGATTTCATC CAGACGGCAG TGGCGAACGT GGGCAAGGCC CTGCGCGACG GGGCGATCTT CGTGGTGCTG ATCCTGCTGG TGTTCCTGCT GAGCGTACGG ATCACGCTGA TCTCGGCGCT GGCCATCCCG CTGTCGTTGA CCGTCGCCCT GCTGGTGCTG CACGCGCTGG GCATCACCAT CAACACCATG ACCCTGGGCG GCATGACCAT CGCCATCGGC GCGCTGGTCG ACGATGCCAT CATCTTCGTG GAGAACATCT ATCGCCGCCT GCGCGAGAAC GGCGGCGACC GCGATCGGGC CAGCCTGGAC GAGACCGTAC GCCGGGCCTG CTCGGAGATG CGCTATCCGG TGGTCTATTC CACCGTGATC ATCATGCTGG CGTTCGCCCC GCTGTTCTTC CTGAGCGGGG TGGAGGGCCG CCTGCTGCAG CCGCTGGGGC TGGCCTATCT GGTCGCGATC GCGGCCTCGC TCATGGTCGC CCTGACCGTG ACGCCGGTGC TCGCCCGTTG GCTGCTGCCC GCGGTCGCGG AGCGTAGCGC GGGTCGCGAA CCGCCGGTGG CCGCGGCCCT GCAGCGCGCC TACCGCCCGC TGCTGCGCGC CACCCTGCGC TTTCGCCGCA GTGTGGTGGT CATCTCGGCA GCCCTTGCCA TCGCCACCCT GGCGGCCACG CTGCTGCTCG GGCGCTCCTT TCTGCCCGAC TTCAACGAAG GCAGCCTGAC CATCGAGATG GCGACCCCGC CCGGCACCGG CCTGGAGACC GCGGCGGAGA TGGCCGGCCA GGTGGAACGC TGGCTGCTGG AACAGGATCC GATCCTCTCC GTCGCCCGCC GTACCGGACG GGCCGAAGGG GCCGAACACA CCCAGCCGCC CCATCTGAGC GAACTGGATG TCCGTCTGGG CCCGCTGCCG CAGGGGAAGG AAGCGTTTCT CGAGGAGCTG CGCGAGGGGC TGGCCGCGTT TCCTGGCCAG TTCAACATCG GCCAGCCGAT CTCGCACCGC ATCGACCATA TGCTGACCGG CGTGCGCGCG AACCTGTCGG TGAAGATCTT CGGCCCGGGC CTGGGCGAAC TGCAGCGCCT GGCGGGCGAG GTCGAGCGAA CCCTCGAGGG CATCGACGGC CTGGTCGACC TGTCGATCGA GCCCGTCGCC GCGGTGCCAC AAATCCGGCT GCATGCCGAT CGCAGCGCCC TCGCCCGCTA CGGCCTGACC GTGGCCGACG CCGCGGAGAC CCTGGAGACC GCTTGGCGCG GCGCCCGCGT CGCCCAGATC TTCGAGGGGG AGCGGCGTTT CGACCTGGTC GTGCAGCTGC CCGTGGAGCG GCGCGAGCTG GACCGCCTGG CCGACATCCC GATGCGCACG CCGACCGGGC ACAACATCCG GCTGGGCGAC GTGGCCGAAC CGCGGATCGA ACGCGGCCCC GCCCAGGTCA ACCGCGAAAA CGCCGAACGC CGCCTGGTGG TCAGCGCCAA CGTGGCAGGG CGTGACCTGC GCGGGGCGGC CGACGAGGTC CGCGCCACCA TCGACGAACG GGTAGCACTG CCCGAGGGCT ATCGCGTGGA ACTGGGCGGC CAGTTCGAGG CCGAGGCCCA GGCCAGCCGG ACCATCGCCA TCGTGTCCGC CGGCGTCCTG GTCGCCATGC TGGTGGTGCT GCAACTGGCC CTGCGCTCGC TGACCCTGGT CCTGCTGGTG ATGGTCAACA TCCCGCTGGC GCTGATCGGC GGGGTGATCG CCGTGTTCGC CATGGGCGGC GTGCTGACCA TCGCGGCGCT GGTCGGGTTC ATCACCCTGT TCGGCATCGC CGTGCGCAAC GGGCTGCTGC TGATCAGCCG CTACCAGAGC CTGGCGGCCG AGGGCGTGCC GCTGGACGAG GCCGTGGAAC GCGGCGCACT GGAGCGGCTG ACGCCGATCC TGATGACCGC GCTCACCGCC GCGCTGGCGC TCGTCCCGCT GGCGCTGGGC CTGGGCGAGA CCGGCACCGA GATCCAGGCC CCGATGGCCA TCGTGATCCT CGGCGGCCTG CTCGGCTCCA CCGCGCTCAA CATGATTGTG CTGCCGGCCC TGTACCACTG GGTCCGGTCG CGCGCCGCGG CGCGGGCCTG A
|
Protein sequence | MLDRLIRWSL ANRLLVMAGA LGLLLLGAWQ ASQTPIDVFP DLTAPTVTVM TEAPGLAAEE VEQQVSYLIE SAVSGASGVR RVRSQSVAGF SLVWVEFEWG EEIYQARQVV NERLGRVQAQ LPEGVAAPQL GPISSIMGEI AYLGVRDDAR ADGMRAREVA DWVVARQLEA IPGVAQVTTL GGQVRQYQVQ VSPERLAAFG LNLAEVRDAL EQASRNAGGD RLSEGGQDYL IRFLGRAERL DEIEQAVVAV RDGMPILVRH VAEVAMGPSI PVGDGGVNGH PGVVLAVSKQ PDADTLDLTR SINETLARLE AGLPEGVTIE PNLFRQADFI QTAVANVGKA LRDGAIFVVL ILLVFLLSVR ITLISALAIP LSLTVALLVL HALGITINTM TLGGMTIAIG ALVDDAIIFV ENIYRRLREN GGDRDRASLD ETVRRACSEM RYPVVYSTVI IMLAFAPLFF LSGVEGRLLQ PLGLAYLVAI AASLMVALTV TPVLARWLLP AVAERSAGRE PPVAAALQRA YRPLLRATLR FRRSVVVISA ALAIATLAAT LLLGRSFLPD FNEGSLTIEM ATPPGTGLET AAEMAGQVER WLLEQDPILS VARRTGRAEG AEHTQPPHLS ELDVRLGPLP QGKEAFLEEL REGLAAFPGQ FNIGQPISHR IDHMLTGVRA NLSVKIFGPG LGELQRLAGE VERTLEGIDG LVDLSIEPVA AVPQIRLHAD RSALARYGLT VADAAETLET AWRGARVAQI FEGERRFDLV VQLPVERREL DRLADIPMRT PTGHNIRLGD VAEPRIERGP AQVNRENAER RLVVSANVAG RDLRGAADEV RATIDERVAL PEGYRVELGG QFEAEAQASR TIAIVSAGVL VAMLVVLQLA LRSLTLVLLV MVNIPLALIG GVIAVFAMGG VLTIAALVGF ITLFGIAVRN GLLLISRYQS LAAEGVPLDE AVERGALERL TPILMTALTA ALALVPLALG LGETGTEIQA PMAIVILGGL LGSTALNMIV LPALYHWVRS RAAARA
|
| |