Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2138 |
Symbol | |
ID | 8807913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2258154 |
End bp | 2259149 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | PhoH family protein |
Protein accession | YP_003461364 |
Protein GI | 289209298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.888079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.261527 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACC GTCTGGATTT CACCCTGGAG CCGGGTGACC AGGAGACGCT GGCGCGTCTG TCCGGTCCGC TGGATGCCCA TCTGCGCCTG ATCGAGTCGC GCCTGGGTGT GGCGGTGCAC AACCGTGGAC CGCGCTTCAA CCTGACCGGG CCAAAGGCCG CGGTGCAGGC TGCCGGCGCG CTGATTCACG ACCTCCACCG CATGGCGCAG GAGGAACCGG TCAGCCTGGA ACAGGTGCAC GCCGCGTTGC AGACCGCGCA TATGGAAGGG CTGGCGGAGG CCGAGCCGGC CGCAGCCGAT CCCGCCCTGC GCCTGAAGCG CGCGGTGATC CGGGGCCGTG GTCCGCGCCA GAGCGGCTAC CTGGCCTCGA TCCAGCAGCA TGACCTGAGC TTCGGGATCG GACCCGCCGG GACCGGCAAG ACCTTTCTCG CGGTCGCCGC CGCTGTCGCG GCGCTGGAGC AGGACCGCGT GCAGCGCCTG GTGCTGGTGC GTCCGGCGGT GGAGGCGGGC GAGCGCCTGG GCTTCCTGCC CGGCGATCTC GCGCAGAAGA TCGATCCCTA CCTGCGGCCG ATGTACGACG CCCTGTACGA ACTGATGGGC TTCGACCAGA CCGCACGCCT GATGGAGCGT CAGGTGATCG AGGTGGCCCC GCTGGCCTAC ATGCGCGGGC GCACGCTGAA CGAGGCCTTC ATCATCCTCG ACGAGGCACA GAACACCACC GTCGAGCAGA TGAAGATGTT CCTCACCCGC ATCGGCTTTG GTTCCACGGC GGTCGTCAAC GGCGATATCA CCCAGGTGGA TCTGCCGCGC GGCCAGCGCT CGGGTCTGAA ACACGCGATG CAGATCCTGC AGGGTGTGGA GGGTGTGAGC ATTACGCAGT TCCAGGCTGG CGACGTGGTG CGCCACCCGC TGGTGCAGCG CATCGTCGAG GCCTACGACG CACACGAGGA TGACGCGGCG CAGGAGGGCG GACCCCGAGG CGATGGCGAC GCTTGA
|
Protein sequence | MSNRLDFTLE PGDQETLARL SGPLDAHLRL IESRLGVAVH NRGPRFNLTG PKAAVQAAGA LIHDLHRMAQ EEPVSLEQVH AALQTAHMEG LAEAEPAAAD PALRLKRAVI RGRGPRQSGY LASIQQHDLS FGIGPAGTGK TFLAVAAAVA ALEQDRVQRL VLVRPAVEAG ERLGFLPGDL AQKIDPYLRP MYDALYELMG FDQTARLMER QVIEVAPLAY MRGRTLNEAF IILDEAQNTT VEQMKMFLTR IGFGSTAVVN GDITQVDLPR GQRSGLKHAM QILQGVEGVS ITQFQAGDVV RHPLVQRIVE AYDAHEDDAA QEGGPRGDGD A
|
| |