Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0633 |
Symbol | |
ID | 8806382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 673560 |
End bp | 674966 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003459884 |
Protein GI | 289207818 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0599243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC AGACCCCAGG CAATCACAAC CAGAACGGCG AAGGTGCCGG TGCCGGCACC ATGAAGCCCG GCTACCACAA TACCGACGGT CAGGGCATCG CCGGTCACGG TTCCTTCTTC CAGGCGACCG ACCTGTCCAA GGAAGACGCG ATCAAGGCGA CCGACTGGGT GCGCAAGCAC GTCGACCGCC GTACGGTCGA CCTTGGTGAC CGTATGGACG ACGTCCGCGA GCACATGTAC GAGCTCGAAA AAGACGGTCA GATCATCATC CACCGCATCG AGGACCAGCA CGAGCCGATG TCGGTGAAAA CCCTGTTCGG CTGGGACAAG AAGGTCCCGA CCAAGCAGCT CTGGCACCAC AAGTCCTGTG GCCAGTGCGG GAACATCCCG GGCTACCCGA CCTCGCTGCT GTGGTTCATG AACAAGTTCG GCTTCGAGCC GGGCAAGGAC TACCTCGACG AGACCGACCA GACCTCCTGC ACCGCGTGGA ACTACCACGG TTCCGGTATC GGGAACGTCG AGTCGCTGGC CGCGGTGTTC CTGCGCAACT TCCACCAGGC CTACGTCTCC GGCAAGCAGC ACGGCCATGA GCTGGGTCAC TTCTACCCGC TGGTCCATTG CGGCACCTCC TTCGGGAACT ACAAGGAGAT CCGCAAGTAT CTGGTCGAGT CCGCCGAGCT GCGCGAGAAG GTCACCAAGA TCCTCGGCAA GCTGGGTCGC CTGGTCGACG GCAAGCTGGT CATCCCGGAA GAGGTCATCC ACTACTCCGA GTGGGTGCAC GTGATGCGCA ACCGCATCGC CTCCGAGCTG CAGACCATCG ACGTCTCCAA TATCCGTACC ACTGCCCACG TGGCCTGCCA TTACTACAAG ATGGTGCACG AGGACGCGGT GTACGACCCG TCCGTGCTGG GTGGCAACCG TACCGCGATC ATCACCTCCA CCGCCCAGGC CCTGGGTGCG CAGGTGATCG ACTACTCCAC GTGGTACGAC TGCTGCGGCT TCGGCTTCCG CCACATCATC TCCGAGCGCG AGTTCACCCG CTCGTTCACG ATGGATCGCA AGATCCGCGT GGCCCGCGAG GAAGCCAACG CCGACGTGAT GCTGGCCAAC GACACCGGCT GCGTCACGAC CATGGACAAG AACCAGTGGA TCGGTAAGGC CCACAACCAG AACTTCCAGA TTCCGATCAT GGCGGAAGTC CAGTTCGCGG CCCTGGCCTG CGGCGCGGAT CCGTTCAAGA TCGTCCAGCT GCAGTGGCAC GCATCGCCCT GCGAAGACCT GGTCGAGAAG ATGGGGATCT CCTGGGACGA GGCCAAGAAG ACCTTCCAGG AATACCTGAA GGAAGTCGAG GCCGGCAATA TCGAATACCT CTACAATCCT GAGCTCGCTT ACGGGGGCAA AGTCTGA
|
Protein sequence | MSEQTPGNHN QNGEGAGAGT MKPGYHNTDG QGIAGHGSFF QATDLSKEDA IKATDWVRKH VDRRTVDLGD RMDDVREHMY ELEKDGQIII HRIEDQHEPM SVKTLFGWDK KVPTKQLWHH KSCGQCGNIP GYPTSLLWFM NKFGFEPGKD YLDETDQTSC TAWNYHGSGI GNVESLAAVF LRNFHQAYVS GKQHGHELGH FYPLVHCGTS FGNYKEIRKY LVESAELREK VTKILGKLGR LVDGKLVIPE EVIHYSEWVH VMRNRIASEL QTIDVSNIRT TAHVACHYYK MVHEDAVYDP SVLGGNRTAI ITSTAQALGA QVIDYSTWYD CCGFGFRHII SEREFTRSFT MDRKIRVARE EANADVMLAN DTGCVTTMDK NQWIGKAHNQ NFQIPIMAEV QFAALACGAD PFKIVQLQWH ASPCEDLVEK MGISWDEAKK TFQEYLKEVE AGNIEYLYNP ELAYGGKV
|
| |