Gene TK90_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_2098 
Symbol 
ID8807873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp2215006 
End bp2218086 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content71% 
IMG OID 
Productheavy metal efflux pump, CzcA family 
Protein accessionYP_003461324 
Protein GI289209258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATC GACTGATTCG ATGGTCCCTC GCCAACCGCC TGCTGGTGAT GGCCGGGGCC 
CTCGGCCTCC TGCTGCTGGG CGCCTGGCAG GCCAGCCAGA CCCCGATCGA CGTATTCCCG
GACCTGACCG CACCCACCGT AACGGTGATG ACCGAGGCCC CCGGCCTGGC CGCCGAGGAG
GTGGAACAGC AGGTCAGCTA CCTGATCGAG TCCGCGGTCA GCGGCGCGAG CGGGGTGCGC
CGCGTGCGCT CGCAGTCCGT GGCCGGCTTC TCGCTGGTGT GGGTGGAGTT CGAGTGGGGC
GAGGAGATCT ACCAGGCCCG GCAGGTCGTC AACGAGCGGC TGGGGCGCGT GCAGGCGCAA
CTCCCCGAAG GCGTCGCCGC CCCGCAGCTG GGGCCGATCA GCTCGATCAT GGGCGAGATC
GCCTATCTCG GCGTACGTGA CGACGCCCGT GCGGATGGCA TGCGCGCCCG CGAAGTCGCG
GACTGGGTGG TAGCGCGGCA GCTGGAGGCC ATCCCCGGCG TGGCCCAGGT GACCACCCTG
GGCGGCCAGG TCCGCCAATA CCAGGTGCAA GTCTCCCCGG AGCGCCTCGC GGCGTTCGGC
CTGAACCTCG CCGAGGTCCG GGACGCACTG GAACAGGCCA GCCGCAATGC GGGCGGCGAC
CGGCTCAGCG AGGGCGGCCA GGACTACCTG ATCCGCTTCC TCGGTCGCGC CGAGCGGCTG
GACGAGATCG AACAGGCCGT GGTCGCAGTA CGCGACGGCA TGCCGATCCT GGTACGGCAT
GTCGCGGAGG TCGCGATGGG GCCCTCCATC CCGGTCGGGG ATGGCGGCGT GAACGGCCAC
CCCGGTGTGG TGCTGGCCGT CTCCAAACAG CCAGATGCCG ACACCCTCGA CCTGACGCGT
TCGATCAACG AGACCCTCGC GCGGCTCGAA GCCGGCCTGC CCGAGGGTGT CACCATCGAA
CCCAACCTGT TCCGCCAGGC CGATTTCATC CAGACGGCAG TGGCGAACGT GGGCAAGGCC
CTGCGCGACG GGGCGATCTT CGTGGTGCTG ATCCTGCTGG TGTTCCTGCT GAGCGTACGG
ATCACGCTGA TCTCGGCGCT GGCCATCCCG CTGTCGTTGA CCGTCGCCCT GCTGGTGCTG
CACGCGCTGG GCATCACCAT CAACACCATG ACCCTGGGCG GCATGACCAT CGCCATCGGC
GCGCTGGTCG ACGATGCCAT CATCTTCGTG GAGAACATCT ATCGCCGCCT GCGCGAGAAC
GGCGGCGACC GCGATCGGGC CAGCCTGGAC GAGACCGTAC GCCGGGCCTG CTCGGAGATG
CGCTATCCGG TGGTCTATTC CACCGTGATC ATCATGCTGG CGTTCGCCCC GCTGTTCTTC
CTGAGCGGGG TGGAGGGCCG CCTGCTGCAG CCGCTGGGGC TGGCCTATCT GGTCGCGATC
GCGGCCTCGC TCATGGTCGC CCTGACCGTG ACGCCGGTGC TCGCCCGTTG GCTGCTGCCC
GCGGTCGCGG AGCGTAGCGC GGGTCGCGAA CCGCCGGTGG CCGCGGCCCT GCAGCGCGCC
TACCGCCCGC TGCTGCGCGC CACCCTGCGC TTTCGCCGCA GTGTGGTGGT CATCTCGGCA
GCCCTTGCCA TCGCCACCCT GGCGGCCACG CTGCTGCTCG GGCGCTCCTT TCTGCCCGAC
TTCAACGAAG GCAGCCTGAC CATCGAGATG GCGACCCCGC CCGGCACCGG CCTGGAGACC
GCGGCGGAGA TGGCCGGCCA GGTGGAACGC TGGCTGCTGG AACAGGATCC GATCCTCTCC
GTCGCCCGCC GTACCGGACG GGCCGAAGGG GCCGAACACA CCCAGCCGCC CCATCTGAGC
GAACTGGATG TCCGTCTGGG CCCGCTGCCG CAGGGGAAGG AAGCGTTTCT CGAGGAGCTG
CGCGAGGGGC TGGCCGCGTT TCCTGGCCAG TTCAACATCG GCCAGCCGAT CTCGCACCGC
ATCGACCATA TGCTGACCGG CGTGCGCGCG AACCTGTCGG TGAAGATCTT CGGCCCGGGC
CTGGGCGAAC TGCAGCGCCT GGCGGGCGAG GTCGAGCGAA CCCTCGAGGG CATCGACGGC
CTGGTCGACC TGTCGATCGA GCCCGTCGCC GCGGTGCCAC AAATCCGGCT GCATGCCGAT
CGCAGCGCCC TCGCCCGCTA CGGCCTGACC GTGGCCGACG CCGCGGAGAC CCTGGAGACC
GCTTGGCGCG GCGCCCGCGT CGCCCAGATC TTCGAGGGGG AGCGGCGTTT CGACCTGGTC
GTGCAGCTGC CCGTGGAGCG GCGCGAGCTG GACCGCCTGG CCGACATCCC GATGCGCACG
CCGACCGGGC ACAACATCCG GCTGGGCGAC GTGGCCGAAC CGCGGATCGA ACGCGGCCCC
GCCCAGGTCA ACCGCGAAAA CGCCGAACGC CGCCTGGTGG TCAGCGCCAA CGTGGCAGGG
CGTGACCTGC GCGGGGCGGC CGACGAGGTC CGCGCCACCA TCGACGAACG GGTAGCACTG
CCCGAGGGCT ATCGCGTGGA ACTGGGCGGC CAGTTCGAGG CCGAGGCCCA GGCCAGCCGG
ACCATCGCCA TCGTGTCCGC CGGCGTCCTG GTCGCCATGC TGGTGGTGCT GCAACTGGCC
CTGCGCTCGC TGACCCTGGT CCTGCTGGTG ATGGTCAACA TCCCGCTGGC GCTGATCGGC
GGGGTGATCG CCGTGTTCGC CATGGGCGGC GTGCTGACCA TCGCGGCGCT GGTCGGGTTC
ATCACCCTGT TCGGCATCGC CGTGCGCAAC GGGCTGCTGC TGATCAGCCG CTACCAGAGC
CTGGCGGCCG AGGGCGTGCC GCTGGACGAG GCCGTGGAAC GCGGCGCACT GGAGCGGCTG
ACGCCGATCC TGATGACCGC GCTCACCGCC GCGCTGGCGC TCGTCCCGCT GGCGCTGGGC
CTGGGCGAGA CCGGCACCGA GATCCAGGCC CCGATGGCCA TCGTGATCCT CGGCGGCCTG
CTCGGCTCCA CCGCGCTCAA CATGATTGTG CTGCCGGCCC TGTACCACTG GGTCCGGTCG
CGCGCCGCGG CGCGGGCCTG A
 
Protein sequence
MLDRLIRWSL ANRLLVMAGA LGLLLLGAWQ ASQTPIDVFP DLTAPTVTVM TEAPGLAAEE 
VEQQVSYLIE SAVSGASGVR RVRSQSVAGF SLVWVEFEWG EEIYQARQVV NERLGRVQAQ
LPEGVAAPQL GPISSIMGEI AYLGVRDDAR ADGMRAREVA DWVVARQLEA IPGVAQVTTL
GGQVRQYQVQ VSPERLAAFG LNLAEVRDAL EQASRNAGGD RLSEGGQDYL IRFLGRAERL
DEIEQAVVAV RDGMPILVRH VAEVAMGPSI PVGDGGVNGH PGVVLAVSKQ PDADTLDLTR
SINETLARLE AGLPEGVTIE PNLFRQADFI QTAVANVGKA LRDGAIFVVL ILLVFLLSVR
ITLISALAIP LSLTVALLVL HALGITINTM TLGGMTIAIG ALVDDAIIFV ENIYRRLREN
GGDRDRASLD ETVRRACSEM RYPVVYSTVI IMLAFAPLFF LSGVEGRLLQ PLGLAYLVAI
AASLMVALTV TPVLARWLLP AVAERSAGRE PPVAAALQRA YRPLLRATLR FRRSVVVISA
ALAIATLAAT LLLGRSFLPD FNEGSLTIEM ATPPGTGLET AAEMAGQVER WLLEQDPILS
VARRTGRAEG AEHTQPPHLS ELDVRLGPLP QGKEAFLEEL REGLAAFPGQ FNIGQPISHR
IDHMLTGVRA NLSVKIFGPG LGELQRLAGE VERTLEGIDG LVDLSIEPVA AVPQIRLHAD
RSALARYGLT VADAAETLET AWRGARVAQI FEGERRFDLV VQLPVERREL DRLADIPMRT
PTGHNIRLGD VAEPRIERGP AQVNRENAER RLVVSANVAG RDLRGAADEV RATIDERVAL
PEGYRVELGG QFEAEAQASR TIAIVSAGVL VAMLVVLQLA LRSLTLVLLV MVNIPLALIG
GVIAVFAMGG VLTIAALVGF ITLFGIAVRN GLLLISRYQS LAAEGVPLDE AVERGALERL
TPILMTALTA ALALVPLALG LGETGTEIQA PMAIVILGGL LGSTALNMIV LPALYHWVRS
RAAARA