Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2492 |
Symbol | |
ID | 8808276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2614374 |
End bp | 2617574 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003461718 |
Protein GI | 289209652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAG ATAGCGCTGC CAAATCCCTG TCCAGTGGCC AATGGGTCGG CCTGCATGAA CGGTTGGATA CGCCGCGGGT ACGACAGGCG CTCGAGGCCC TGGGGTTGGA GGATCACGCC CGCTGGAGCC AGGGTCTAGA GGGCGATGAT CTGGCGATGG CCCTGGTCGC CGAGCTGGCG CAGCGGCTTG CGGAGCGCGC AGGTGAGTTG CGCGAAAAGG GACACGAGGA CTGGACGCGA GCCGTGCAGG GGCTGGCGGA TGCGTTGGCG GCGGCGGGGC ATCCGCTTTC GGGGATCGAG GAGGACCTTC CGCAGCCGCC GTTTCGGCAG TTGCTGGAGG TGCGTTCCCC GTCGGCTGAT CAGGTTGGGC AGCAGGAGAC GCCACGGCCG GATGTTCCCT TGGCTCTGTC CAGCCTGCTG ACTGGGTCCG ATCGCACACC GAGTCTGGTC ACGCAGATCG AGAAGGAGCT GGCCAGTGCG GAGCGGGCCG ATTGGTTGGT GTCCTTCATC AAGTGGAGTG GGATTCGCAC CCTGCGCGAG ACGCTGAAGC GCTTTACCGA GACCCCGACG GCCGATGGCT CGCCCAGGCT GCGGATCGCG ACGACCTCGT ACCTCGGGGC TACGGATCTC AAGGCGATCG AGTTTCTGCT GGGCCTGCCG AACACGGAGG TTCGGGTTTC GTACGACACC CACCGCACGC GTCTGCATGC CAAGGCTTAC CTGTTCCATC GCAGCACCGG CTTCGGCAGC GCCTACATCG GGTCGGCGAA TGTCTCGCGG GTGGCGATGG ATGAGGGCCT GGAGTGGACG GCCAAGGTCA GCGAGTACGA GCTGCCGCAT ATGTGGCGCC AGATCCAGAC CTCGTTCGAG GCGCATTGGG CAGACCCGGC GGAGTTCGAA CCGCTCACCA CGGACGACCT ACCCCGTCTG CAACAGGCAC TGGAATCGGA ATTCAGCTCG GGGCCTGGGG CGCAGAAAGA AGCCACCACG TTCTTCGACC TGCGGCCGTA TGCCTTTCAG CAGGAGATCC TGGAGGACAT CCGGCGCGAG CGCCGTTCGG GCATCGACCG TCATCTGGTG ATCGCGGCCA CCGGGACCGG CAAAACCATG ATCGCCGCGT TCGACTATCG CCGGTTTGCG CGCGAGCAGG CGGACAAGGG GCGCCCGTCG CTGCTGTTCA TCGCGCACCG CGAGGAGATC CTGCGTCAGG CGATGGCCAG CTTTCGGCAA GTGCTGCAGG ACCACGAGTT TGGCGACGTG CTGGTGGGCG GGGCGCGTCC GAGTCAGGAC CGCCACCTGT TCTGTACGGT GGCGAGTTGG AATGCACGGG ACCTCGATCG CCTAGACCCG GAGCATTTCG ATTACGTGGT GCTGGATGAG GCGCACCACA GTAGCGCCGG CAGCTACCAG CGCATCCTCG AGCATGTGCG GCCCAAGTGC CTGCTGGGTC TGACGGCGAC GCCGGAGCGC ACGGATGCCG ACGACATCCG CCAGCACTTC GGCCAGCGTT ACACGCATGA GATCCGCCTG CCGGATGCGA TCGAGCGCCG CCTGCTGGTG CCGTTCCACT ACTTCGGTGT AGCCGACCAT GAGTCCGTGG ATCTGAGCCG AGTGAGTTGG CGGCGCGGGG GCTATGACGC CGAAGAATTG GAAGGGGTGT TCACCGGCAA CCGGGCGCGC GCCGATTGGG TCTTGCAGCA GGCCCATGAA CACGCCACGG ATCTCGGCTC GGTCAGGGGC TTGGGTTTCT GCATCAGTCG CGCCCATGCC CGGTTCATGG CGGAGTATTT TTCGGCCAAG CAGGTGCCCA GCGCGGTGCT GACGGGCGAC AGCTCAGATC ACGAGCGGAA GTCGATACAG CGTGACCTGC GCGAGGGGCG CATCCGCTTC ATCTTTACGG TGGATCTCTA CAACGAAGGC GTGGACCTGC CGGAGGTGGA CACCGTTTTG CTGCTGCGCC CGACCGAGAG CCTGACGGTC TACCTGCAGC AGATGGGCCG CGGCCTGCGC TTGCACGAGG GCAAGCCGCA TCTGACCGTG CTGGATTTCG TCGCGCCGCA GCACCGGCGC TTTCGCTACG CCGCTCGGTT CCGGGCGCTC AGCGCGCGGC CGGAGAAGCG GGTGGATCAC CAGATTCAGG AAGGCTTCCC CTGGTTACCC AGCGGCTGCC TGATCCACCT CGACCGCCTG GCCACCCAGC ACGTGCTCGA CAACATCCGT GCGCAGATCA ACCGCCAGCG ACCGCAGATC GTGCGCGAGC TGAAGGACTG TTTCCGGGAG GACATCGACT CCGCCAGCCT GCCCAAGATG CTCGACTGCC TGCATATGGA TGAAGCGGAA GACCTGCTGC GCAAGGACCT GCCCTCCCGG CTGCGCGCGG ACTGCGGAGG CGAGTCGACC GAAGCGCTGG GACCCTATGT CAGCGGCCTC AAACGTGGGC TGCAGCGCTT AGCGCGGTGT GACGATCACC AGTTGCTGGA AACCCTGCAG ACCCATCTGG CGCAGCCGGC CGCGAACGTG GAGCAGCTTG CCGAAGAAGA TCGGCTCCGG CTGGCCCTTG TCCACAGCAC GCTATGGGGC AAGACGCGGC CGGGGGACGG GGACCTATCC GCCGTGGACG CCTTCGTCCG CGAACAGCGC CCCCTGTACC GGGATCTCCA GGAAGTCGCC GCCTGGCGAC ACGAGCGTAT CGTCCCCGCC AGCGGCATCA CCTTCCCGGA GCAGACGGGG CCACTGGAAC TGCACGCGTC CTACACCCGC GAGCAGATCC TGCTGGCGCT GGGGCTCGGT AGCCTGGAGA AGCCGATCAC CCAGCAGGAG GGTGTGCGGC ACGTACCGGA GCGGCGGGTG GATGCGTTGT TTGTGACCCT GGAGAAATCC GAGCAGGAGT TCTCGCCCAC CACCATGTAC GAGGACTACG CCCTCAACGA GCGGCGGTTC CACTGGCAGT CACAGTCCGG CACCTCACCC ACCTCGGAGA CCGGGCAGCG TTACATCCAC CACGAAGAGC TCGGCTACAC GCCCATGCTG TTCATTCGGC GGGCTAAGCG TGATGCGACG GGCCTCAGTG AGCCATTCAC CTTCTGCGGC CCGGTGCGTT ACCAGCACCA CGAGGGCAGC AAGCCGATGA GCATCATCTG GGAGCTGGAG TACGAGATGC CCGCGCGGCT CCTGCACCTC GCCCGCCGCA CGGTCCTGTA A
|
Protein sequence | MSRDSAAKSL SSGQWVGLHE RLDTPRVRQA LEALGLEDHA RWSQGLEGDD LAMALVAELA QRLAERAGEL REKGHEDWTR AVQGLADALA AAGHPLSGIE EDLPQPPFRQ LLEVRSPSAD QVGQQETPRP DVPLALSSLL TGSDRTPSLV TQIEKELASA ERADWLVSFI KWSGIRTLRE TLKRFTETPT ADGSPRLRIA TTSYLGATDL KAIEFLLGLP NTEVRVSYDT HRTRLHAKAY LFHRSTGFGS AYIGSANVSR VAMDEGLEWT AKVSEYELPH MWRQIQTSFE AHWADPAEFE PLTTDDLPRL QQALESEFSS GPGAQKEATT FFDLRPYAFQ QEILEDIRRE RRSGIDRHLV IAATGTGKTM IAAFDYRRFA REQADKGRPS LLFIAHREEI LRQAMASFRQ VLQDHEFGDV LVGGARPSQD RHLFCTVASW NARDLDRLDP EHFDYVVLDE AHHSSAGSYQ RILEHVRPKC LLGLTATPER TDADDIRQHF GQRYTHEIRL PDAIERRLLV PFHYFGVADH ESVDLSRVSW RRGGYDAEEL EGVFTGNRAR ADWVLQQAHE HATDLGSVRG LGFCISRAHA RFMAEYFSAK QVPSAVLTGD SSDHERKSIQ RDLREGRIRF IFTVDLYNEG VDLPEVDTVL LLRPTESLTV YLQQMGRGLR LHEGKPHLTV LDFVAPQHRR FRYAARFRAL SARPEKRVDH QIQEGFPWLP SGCLIHLDRL ATQHVLDNIR AQINRQRPQI VRELKDCFRE DIDSASLPKM LDCLHMDEAE DLLRKDLPSR LRADCGGEST EALGPYVSGL KRGLQRLARC DDHQLLETLQ THLAQPAANV EQLAEEDRLR LALVHSTLWG KTRPGDGDLS AVDAFVREQR PLYRDLQEVA AWRHERIVPA SGITFPEQTG PLELHASYTR EQILLALGLG SLEKPITQQE GVRHVPERRV DALFVTLEKS EQEFSPTTMY EDYALNERRF HWQSQSGTSP TSETGQRYIH HEELGYTPML FIRRAKRDAT GLSEPFTFCG PVRYQHHEGS KPMSIIWELE YEMPARLLHL ARRTVL
|
| |