Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1274 |
Symbol | |
ID | 8807038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1349744 |
End bp | 1350901 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Cupin 4 family protein |
Protein accession | YP_003460519 |
Protein GI | 289208453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCCC AAGACCCGAA TCAGGGCCTG GAGCTACTGG GCGGGCTCTC GCCGGCGGAG TTTCTGCGCG ACTACTGGCA ACAAAAGCCG TTGCTCGTAC GCGGCGCAGT ATCGGGCTTC GCCAACCCCA TCGAGCCCGA CGATCTGGCC GGGCTCGCCT GTGATCCGGA TGCCAGCGCC CGACTCGTAC TCGGCGACAC CGACCACGGC GACTGGGCAG TCGAATACGG CCCGTTCGAG GAAGATCGCT TCGCCTCCCT GCCCGATCGC GCCTGGACGC TGCTGATCAG CGATGTCGAG CGCTTCTGGC CCGAAGGGCA CGACTTCCTG GCCCGGTTCG ACTTTGTCCC GCGCTGGCGC CGCGACGACC TGATGATCTC CTACGCATCA CCCGACGGCT CGGTCGGGCC CCATGTCGAT GCCTACGATG TCTTTCTGTT CCAGGCGGCC GGGCGGCGGC GCTGGCAGAT CCAGTCACCA CCGGGACCGC TGGACTGCCA CGACGACCTG CCGCTGGCGA TCCTGCGCGA GTTTGAACCA ACCGAGAGCT GGGACCTTGA ACCCGGCGAC CTGCTCTACC TGCCCCCCAA CCTGCCCCAC TACGGCCTTT CACTGGATGA CCAGTGCATG ACCTGGTCGA TCGGTTTTCG GGCCCCGACC TACCTCGACC TGCTGACCGG GTTCCTGGAG GAACGGGCCA ACCGGGTTGG CGAAGCGCCC CGATACAGCG ATCCCCAGCG CCCGGTGTCC GCCTACGTGA GCGAACTGCC GTCACACGAC CGCACCCGAC TGCGTGACAT CCTGCGCGAG ATGCTCGCGG CCGACGACAC GGAACTGGAT GCCTTCCTTG GGCGCTTCCT GACCCGCCCG GCGGGGAACG TCGAACTGCA TACAGGTGAT CCCCCCGCCG AAGCGAGGGA GTGCCGTGTA CACCCGGGTA TTCGGCGCTA CTGGCTGCAG ACACCGGCCG GGCCGATCCT CTGCGCCGCC GGCCACAGCT ATCCGGCATC CAGCCTGGCT CCTGGGGATC TGGAACAGCT GTGTGCGACG GAGATCGTCA AGCCCGAGCA GTGGGAACAC CAGTGGCCGG CGGTTCACGA CATGCTTCGC GACGGGCTGG AGGAAGGCTG GCTCGAGCCC GCTGACGGCC CGAACTAG
|
Protein sequence | MPAQDPNQGL ELLGGLSPAE FLRDYWQQKP LLVRGAVSGF ANPIEPDDLA GLACDPDASA RLVLGDTDHG DWAVEYGPFE EDRFASLPDR AWTLLISDVE RFWPEGHDFL ARFDFVPRWR RDDLMISYAS PDGSVGPHVD AYDVFLFQAA GRRRWQIQSP PGPLDCHDDL PLAILREFEP TESWDLEPGD LLYLPPNLPH YGLSLDDQCM TWSIGFRAPT YLDLLTGFLE ERANRVGEAP RYSDPQRPVS AYVSELPSHD RTRLRDILRE MLAADDTELD AFLGRFLTRP AGNVELHTGD PPAEARECRV HPGIRRYWLQ TPAGPILCAA GHSYPASSLA PGDLEQLCAT EIVKPEQWEH QWPAVHDMLR DGLEEGWLEP ADGPN
|
| |