Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1920 |
Symbol | |
ID | 8807693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2038418 |
End bp | 2041552 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | transporter, hydrophobe/amphiphile efflux-1 (HAE1) family |
Protein accession | YP_003461147 |
Protein GI | 289209081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0960887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGCT TCTTCATCGA CCGGCCGATC TTCTCCTCGG TCATCTCGTT CGTTATCGTG TTGGCGGGGC TGGCGGCGCT GAGCGCGTTG CCGGTAGAGC AGTACCCGGA TGTGGTGCCA CCGCAGGTCG TGGTCCAGGC CAACTATCCC GGTGCCAGTT CCGAAGTGCT GGCGGAATCG GTCGCCGCGC CGCTGGAGCA GGAAATCAAC GGCGTGGACA ACATGATCTA CATGGAGTCC ACCAGCACCG ATGCCGGCTC CCTGCAGATC GCCGTCTCGT TCGAGATGGG CACCGATCCG GACCAGGCGG CCATCAACGT CAATAACCGG GTGCAGGCCG CCCTGCCGCG CCTGCCGCAG GAGGTGCGGG ATCAGGGCGT GCGGGTGGAA GCGCGCTCCA CCAACATTCT GATGGTGCCG GTGCTGAGCT CGCCGGATGG CCGCTACGAT TCGCTGTTCA TCAGCAATTA CGCGCTGCTC AATGTGCTGG ACGAACTGGT CCGTCTGCCC GGTGTCGGCG ACGCCAGCCT GTTCGGCGCC CAGGATTACT CCATGCGCGT GTGGCTGCGG CCGGACAAGC TCGCCCAGTT CGAGCTCACA CCGTCCGATG TGACCGCCGC GATACGGGAG CAGAACGCCC AGTTCGCCGC AGGGCGCATC GGGGCGGAGC CGGCACCGGA GGGGCAGGCC TTTACCTTCA CGGTGACCAC CGATGGGCAG CTCGACAACG CCGAGGCCTT CGAGGACATC ATCCTGCGCT CCGGGTCCGA TGGCAGCACC CTGCGGCTGG GGGATGTGGC AAGGGTCAGC CTGGGCGCCC AGAACTACGA GTTTTCCGCC ACCTATAACG GTGAGCCAAC CGTGCCCCTG GGCGTCTTTC TGCAGCCCGG CGCCAACGCG CTGGACACGG CCCGGGAGGT GCGGTCGGCC CTCGACGAGT TGTCCGAGCG TTTTCCGGAC GGCCTGGAAT ACACGGTAGC CTACGACATC ACCGAGTTCG TGGAGATCTC GGTGCGGGAG GTCTTCATCA CGCTGCTGAT CGCCGTGGCG CTGGTGGTGC TGGTGACCTT CCTGTTCCTG CAGCACCTGC GGGCCACGCT GATCCCCGTG GCGGCCATTC CGGTGTCCCT GATCGGCACG TTTGCCGGCA TGCAGGCGAT GGGCTTTTCG GTCAATCTGC TGACCCTGTT CGGCCTGGTG CTGGCGATCG GCATTGTGGT CGACAACGCC ATCATCGTTA TGGAAAACGT GGAACGGCTG ATGGCCGAAA AGGGCTTGAA GGCGCGCGAG GCGTCCATTG AAACCATGCA GCAGGTGGCC GGGGCGGTGG TGGCCTCGAC CCTGGTGCTG GTGGCAGTGT TCGCGCCGGT GGCATTCCTG GGAGGCCTTT CCGGGGAGTT GTACCGTCAG TTCGCGGTGA CCATCGCCGT GTCCGTGGTG GTTTCCGGCG TGGTGGCGCT GACCTTGACG CCGGCCATGT GTGCCCTGCT GCTGGACAAG CAGAAGCACA CGGTGTCCCG GCCGTTCGCC CTGTTCAATA GCGGCTTCAA TGCGCTGACC CGTGGTTTCG TAGGAACGGT GGGCTGGCTG TTGCGCCATC GCACGGTGGG TGTGGCGCTG TTTCTGGGTT TCTGCGCGAC GACGGTGTTC CTGCTGGACC GCCTGCCATC CGGTCTGGTG CCGCAGGAAG ACCAGGGCGT CGCGCTGGTG GTGGGCCAGT TGCCGCCGGT ATCGGCTCTG GGGCGCACGG AGCAGGTGCG GGACGAACTG ACGGACCGCC TCCGGGAGAT CGAGGAGATC GACGAGTTCA CCGCGTTCGC CGGATTCGAC ATTATCGCCT CGTCCCTGCG TACCAACGCC CTGGTCGGCT TTGCCAACCT GAGCGACTGG TCCGAGCGCC GAGCTCCGGA CCAGCATGCC TCGGCGGTTG TGGGCAGGAT CATGGGTGTC GGAGCCGGGA TCCCGGAGGC CAATGTGTTC GCCTTCGCGC CGCCGCCTAT TCAGGGCTTG TCACTGACCG GGGGGGTGGA AGGCTACCTT CAGGTCCGCG GTCAGACCAC CACGGACGAA GTGGACGCAG CGGCCCAGCG GGTGGTGCAG GCCGCCAATG AACGGCCCGA GCTGGTTAAT GTGCGGGCCA CGCTGGATAC CAACATGCCG CGCTACAGCG CCACCGTGGA CCGGGAGAAG GCCCGCGCCA TCGGCGTGCC CATCAACACC ATCTTCGAGG CCATGCAGAG CACCTTCGGC GCCTTTTATG TCAACGATTT CACTTACCAG GGGCGTTTGT GGCGGGTGAA CGTGCAGTCC GAAGCGGAGT TCCGCAGTCG CGAAGAGGAC CTCCGGCATG TGTTCGTGCG CTCGGATGCC GGGGAGATGG TGCCCGTGGA TTCCCTCGTG AGCCTGGAGC GGGGCAGCGG CGCGGACATT ATCAACCGCT TCAATATCTA TCAGTCGGCG CGCTTGCTTG CGGATGCGGC ACCGGGTTAC ACCACGGGCC AGGCCAAGGA GGCATTGGAA GCCGTGGTGG CCGAGTTGGA CTCGGACGCC AACACCACCA TGGGCTGGAT TGGCGAGGCC TACCAGCTGG ACGTTGCCGC CGGTGCCGCT GGCGCGGCGT TCGGTCTGGG GCTGCTGATG GTGTTCCTGA TTCTGGCCGC CCAGTACGAG CGTCTGACCC TGCCCCTGGC GGTGGCCACC GCCGTACCCT TCGGTGTGCT CGGCGCGGCC CTGGCAACCA TGCTGCGCGG CTTTCCCAAT GACATCTACT TTCAGGTAGG GCTGCTGGTG CTGATCGGCC TGGCGGCGAA GAATGCCATC CTGATCGTGG AGTTCGCCGC GCAGAACCGG CGCGAAGGCA TGAGTTCTAC CGATGCCGCC ATGGCGGCGG CGCGGCAGCG TTTCCGTGCG ATCGTCATGA CCGCGATGAC CTTCATCATC GGCACGCTGC CGCTGATGTT CGCCACCGGA GCCGGCGCGG CCAGTCGACA GGAGATCGGC ACCGTGGTGG TGGGCGGCAT GATCGCGGCC AGCACCCTGG CGCTGCTGTT CGTGCCGTTG TTCTACAAAT TGCTCGAGGA TCTGGTCACT TGGCGTCAGC AACGTCGCGA GAAGAAGGAG GCCAGCCATG CGTAA
|
Protein sequence | MLRFFIDRPI FSSVISFVIV LAGLAALSAL PVEQYPDVVP PQVVVQANYP GASSEVLAES VAAPLEQEIN GVDNMIYMES TSTDAGSLQI AVSFEMGTDP DQAAINVNNR VQAALPRLPQ EVRDQGVRVE ARSTNILMVP VLSSPDGRYD SLFISNYALL NVLDELVRLP GVGDASLFGA QDYSMRVWLR PDKLAQFELT PSDVTAAIRE QNAQFAAGRI GAEPAPEGQA FTFTVTTDGQ LDNAEAFEDI ILRSGSDGST LRLGDVARVS LGAQNYEFSA TYNGEPTVPL GVFLQPGANA LDTAREVRSA LDELSERFPD GLEYTVAYDI TEFVEISVRE VFITLLIAVA LVVLVTFLFL QHLRATLIPV AAIPVSLIGT FAGMQAMGFS VNLLTLFGLV LAIGIVVDNA IIVMENVERL MAEKGLKARE ASIETMQQVA GAVVASTLVL VAVFAPVAFL GGLSGELYRQ FAVTIAVSVV VSGVVALTLT PAMCALLLDK QKHTVSRPFA LFNSGFNALT RGFVGTVGWL LRHRTVGVAL FLGFCATTVF LLDRLPSGLV PQEDQGVALV VGQLPPVSAL GRTEQVRDEL TDRLREIEEI DEFTAFAGFD IIASSLRTNA LVGFANLSDW SERRAPDQHA SAVVGRIMGV GAGIPEANVF AFAPPPIQGL SLTGGVEGYL QVRGQTTTDE VDAAAQRVVQ AANERPELVN VRATLDTNMP RYSATVDREK ARAIGVPINT IFEAMQSTFG AFYVNDFTYQ GRLWRVNVQS EAEFRSREED LRHVFVRSDA GEMVPVDSLV SLERGSGADI INRFNIYQSA RLLADAAPGY TTGQAKEALE AVVAELDSDA NTTMGWIGEA YQLDVAAGAA GAAFGLGLLM VFLILAAQYE RLTLPLAVAT AVPFGVLGAA LATMLRGFPN DIYFQVGLLV LIGLAAKNAI LIVEFAAQNR REGMSSTDAA MAAARQRFRA IVMTAMTFII GTLPLMFATG AGAASRQEIG TVVVGGMIAA STLALLFVPL FYKLLEDLVT WRQQRREKKE ASHA
|
| |