Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0879 |
Symbol | |
ID | 8806634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 936104 |
End bp | 939238 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | Protein of unknown function DUF2309 |
Protein accession | YP_003460130 |
Protein GI | 289208064 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.754113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCTGA CGCTAGGACG CAAGCTCAAG ATCCGCTCGA TGGTGCACAT GGCGGCCGAG CCCATTCCCA ACTTCTGGCC CATGCGCACC TTCATCCATC ACAACCCGCT CCACGGGCTG GAAGACCTGC CGTTCCCTGA AGCCATCCAG CGCGGTGAAG TACTGTTTCA CGGGCGGGGG TTCCTGCCCC GGGCCGACTA CCAGCGACTG TTCCGCGAGG GCCACGTGGA CCGCGACACC CTCGAGACGC GCGTCGAGGC CGCCCTCGAG AACCGCCCGG CCCTCCAGGC GATCGGCGGG ATCGACCTGC TCTGCACGCT GCTGACTGGA TATCATGATC CGGTCATCAC ACCGCGCACG CTCGCCGATG TCGACGACGT GGCGGCCGCC CTGAACGGTC GCGAACACAC CCCCTCTGCG ACCGATACCG GGATACTCGC CGAACGGCTG CATGCCGCCT TTCCGGTCAT ACAGCCGCTC TACGAGGCAC TCGATTCGCT GTTCGGCACA CGTATCGGCA CCACGCTGGA CGAACAGCTC ACCAAGATCT GCCTGGATTA CTTCGACGAG GGTCAGTCAG CATGGCAGAT GCCCGGACGC GAGCACGGCC TGTTTGCCGC ATGGAAGACG ATCGTCCATC ACCACCCGCG CCTGCTCCTG CGTGGCCAGC ACACCCAAAC GATCCTGGCA GACCACGACA GCCCCGAGGC GATCATCGCC CATGTCCTGG ACGAGATCGG CATCCCCGAA GAGGCCTGGC CCGATCTGAT CGTGCGCGAA CTGACCCGCC TGCACGGCTG GGCCGGCTTC ATCCGCTGGC GCTCGACGGC AAACCACTAT CACTGGGCAC GCGAGCATCC GGCCGACCTG ATTGATTTTC TCGCCATCCG TCTAGTGCTC GGGCTGTCCC TGATTCGCGA GCACAGCCGT CGCCACAAGA TGCCGGGCGA CCGCAAGGCC CTCGAAACAC TCCTCGATGA GCGCCCCGCC GAGTGCTACC TGCGCCGCGA GTTTTTCGGC GGCGAGGTAC TGCCCGTCTA CGCCGAGACC GTTGAGGCCG TCATTGCACG CGGCAAATCC AACGAGATCG CCCGGCAGCT GGACCAATAC CTGCCGGCCA AGCGCCGCGC CGAAGGCCGC GACCAGGCCC GTGCCCTCGG CTCGCTGGCC TCGCTTACTG GCAATCCGTC GCTGTTTGCC GATCTCGATC GTAGCCAACT CGCCGCAGTG CTTGAACAGC TGGGCGACTT CGAGCAGCAC GAGGGTATGG TCTGGCTGGA AGCCATGGAG GCGACCTACC GCCGCAGCGT GCTGTCAAGC CTGCGGCTCG AACCGCCCGC GCCGCGCGCG AAACGCCCGT TCGCGCAATT GCTCTTTTGC ATCGACGTAC GCTCCGAACG CGTGCGCCGT CAACTCGAGA CCATCGGCGA CTACCAGACC TACGGCATTG CCGGCTTCTT CGGCGTCCCG GTGAGCTTCA TCGCGCTTGG CAAGGGCAGC GAGGATCACC TCTGTCCGGT CGTCGTTACA CCCAGGAATG TCGTACTGGA AGTCACCACC GGTGGCGAGC CGCTCGACCT CGACCTGTAC TCCTCGGCCG AGCATCTGCT GCACGACCTG AAGAACTCCG TTCTCTCGCC GTACTTCACC GTCGAGGCCA TTGGCCTGCT GTTCGGCTTT GACATGATCG GCAAGACCAT CGCTCCGGCG GCCTACCATC GATGGCGCGA CAGGATCGAA GCGGATCAGC CCTCGACCCG TCTGATGGTC GACAAACTCA CCCGCGAGCA GGCGGACTCG GTCGTTCGCG CCCAGCAACG GGCCATCATC GTCCAGGCCA TCCACCAGGC GTTCGACATC GAGCGTGAAG CCATCACCGA TGCCATGATC CGCGAGCTGC GCGAAACGGC ATTGGGCCAC TGCAGCGGAC AGACCCATTT CGCCCGCGAC TTTGGCCTGA ACGAGCGTGA CGAGTCGGCT TTCATCAAGA CCTTGCAAGA TGACTACCGC ATCAACCGTT CCTATGCGCA GATGCAGCTG GAGCGCCTGG GCCGGATCGG GTTTTCGCTG GACGAGCAGA CCTATTTCGT TGCCCAGGCG TTGCGCTCGA TCGGCCTGAC CGAACAGTTC TCGCGCTTCA TCATCCTCAC CGGCCATGGC AGCCGGTCGG ACAACAACCC CTACGAATCA GCGCTCGACT GTGGCGCCTG CGGAGGCAGC CACGGGATCG TCAGCGCCCG GGTACTCGCG CACATGGGCA ACAAGCCGGA GGTGCGCCGA CGGCTGCGTG CCCAGGGCAT CGACATCCCG GATGACGCCT GGTTTCTGCC GGCCATGCAC AACACCACCT CCGACGAGAT CCGGCTGCAC GATCTCGACC TGCTGCCGAC CAGCCACCTG GTCTACCTCG AGCGCCTGCG CAACGGCCTG CGCGCCGCCA CGCGCCTCGT TGCCCGCGAG CGCCTGCAGG CCCTGGACCC GGACCGCGAG CCAGCCCCGG ATGCTGTCAA GGCCGCGCGC AAGGCACAGC GCAATGCGGT CGACTGGGCG CAGGTACGCC CCGAATGGGG TCTGGCGCGC AATGCAGGCT TCATCATCGG CCGACGGCAC CTGACCGAGA CAACCGATTT GAAGGGACGC ACCTTCCTGC ATTCCTACGA CCACCGGGTC GACCCCAGGG GCCGATTGCT GGAAAGCATC CTCACCGGCC CGCTGATCGT CGCCCAATGG ATCAACATGG AACACTACTT CTCGGCGGTC GACAATGAAC ACTACGGTTC AGGCAGCAAG GTCTACCACA ACGTTGCCTG TCGCATCGGG GTCATGACCG GCAACCTGTC GGACCTGCGC ACCGGGCTGC CCGCGCAGAC CGTGCACTAC AACGGTTTGC CCTACCACGA CCCGATCCGT CTGCTGGCCC TCATCGAGGC GCCGCTCGCG CACGCGCGCC GGGCCATCGA GAACGTGCCC AAGGTGCGCA GCCTGATCGC GAATGGCTGG GTGCAGTGCG TCGTCCTGGA CCCCGAAACC GGCGGACTGC ACCGATACGT CGACGGCAAC TGGCACGACG AACCCCTGCC CAACACCGCG CAAGGCGCAA CCGATCGACC TGCTGAGGAG GACCACCCTG CATGA
|
Protein sequence | MKLTLGRKLK IRSMVHMAAE PIPNFWPMRT FIHHNPLHGL EDLPFPEAIQ RGEVLFHGRG FLPRADYQRL FREGHVDRDT LETRVEAALE NRPALQAIGG IDLLCTLLTG YHDPVITPRT LADVDDVAAA LNGREHTPSA TDTGILAERL HAAFPVIQPL YEALDSLFGT RIGTTLDEQL TKICLDYFDE GQSAWQMPGR EHGLFAAWKT IVHHHPRLLL RGQHTQTILA DHDSPEAIIA HVLDEIGIPE EAWPDLIVRE LTRLHGWAGF IRWRSTANHY HWAREHPADL IDFLAIRLVL GLSLIREHSR RHKMPGDRKA LETLLDERPA ECYLRREFFG GEVLPVYAET VEAVIARGKS NEIARQLDQY LPAKRRAEGR DQARALGSLA SLTGNPSLFA DLDRSQLAAV LEQLGDFEQH EGMVWLEAME ATYRRSVLSS LRLEPPAPRA KRPFAQLLFC IDVRSERVRR QLETIGDYQT YGIAGFFGVP VSFIALGKGS EDHLCPVVVT PRNVVLEVTT GGEPLDLDLY SSAEHLLHDL KNSVLSPYFT VEAIGLLFGF DMIGKTIAPA AYHRWRDRIE ADQPSTRLMV DKLTREQADS VVRAQQRAII VQAIHQAFDI EREAITDAMI RELRETALGH CSGQTHFARD FGLNERDESA FIKTLQDDYR INRSYAQMQL ERLGRIGFSL DEQTYFVAQA LRSIGLTEQF SRFIILTGHG SRSDNNPYES ALDCGACGGS HGIVSARVLA HMGNKPEVRR RLRAQGIDIP DDAWFLPAMH NTTSDEIRLH DLDLLPTSHL VYLERLRNGL RAATRLVARE RLQALDPDRE PAPDAVKAAR KAQRNAVDWA QVRPEWGLAR NAGFIIGRRH LTETTDLKGR TFLHSYDHRV DPRGRLLESI LTGPLIVAQW INMEHYFSAV DNEHYGSGSK VYHNVACRIG VMTGNLSDLR TGLPAQTVHY NGLPYHDPIR LLALIEAPLA HARRAIENVP KVRSLIANGW VQCVVLDPET GGLHRYVDGN WHDEPLPNTA QGATDRPAEE DHPA
|
| |