Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0413 |
Symbol | |
ID | 8806153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 434428 |
End bp | 435846 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | PhoH family protein |
Protein accession | YP_003459664 |
Protein GI | 289207598 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.205876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA CACCGGAGGA CCCGAAACGC CTGTTTGTCC TGGACACCAA TGTCCTGATG CACGATCCCA CCGCGCTGTT CCGCTTCAAG GAACACGACA TCTTCCTGCC CATGGTCGTG CTCGAGGAGC TCGACCGCGG CAAGAAGGGG GTCTCCGAGG TCGCGCGCAA CGTGCGCCAG GTCTCGCGCT TTATCGACGA GTTGATGAGT GGCGCGACCC ACGAGCAGAT CGCCGCCGGG TTGCCGCTAC AGGCCGAATC GCTCAGCGAG TCGACGCTTG CACCCTCCGG GCGGCTGTTT TTTCAGACCC GCAATCTGAC CTCGCGCCTG CCGGACTCGC TGCCCGGCTC GACGCCGGAC AACGACATCC TGGCCACCGC CCAGGCACTC AAGACGGAGT GGTCGGATCT CCCGGTCACG CTGGTCTCCA AGGACATCAA CCTGCGCATC AAGGCGGCGG TGATCGGCCT GCACGCGGAG GACTACTACA ACGACCAGGT CCTGGACGAC GTCAGTCTGC TGTTCTCCGG CATGACCGAG CTGCCGGCGG ACTTCTGGGA GACCCACGGG GCCAAGATGG ACTCCTGGCA GGAGTCCGGC CGGACCTACT ACCGGGTCAC CGGACCCCTG GTCAGCGAGT GGCAGCCCAA CCAGTTCGTG TACCGCGAGG GGGAGAACCC GCTGTCCGCG ATCGTGCTGG AGATCGATCA GCCGAACGAG TCCGCGGTGA TCGAGCTGGT GGACGACTAC ATGACCCCGC GCCACAGCGT CTGGGGTATC AATGCCCGCA ACCGCGAGCA GAACTACGCC CTGAACCTGC TGATGGACCC GGAGGTGGAT TTCGTCTCCC TGCTGGGGGC TGCCGGCACC GGCAAGACCC TGCTGGCGCT GGCCGCGGGG CTCGCGCAGA CGCTGGAGGA CAACACCTAC AAGGAGATCA TAATGACCCG GGTCACGGTC CCGGTGGGCG AGGATATCGG CTTCCTGCCC GGGACCGAGG AGGAAAAGAT GACCCCCTGG ATGGGCGCAC TGATGGACAA CCTCGAGGTG CTCGCCAAGT CCGAGGGTGC GGGCGACTGG GGGCGGGCGG CCACGAACGA CGTGCTGACC AGCCGCATCA AGATCCGCTC GCTGAATTTC ATGCGCGGGC GCACCTTCCT CAACCGATAC ATCATCCTCG ACGAGGCGCA GAACCTGACC TCCAAGCAGA TGAAGACCCT GGTCACCCGC GCCGGGCCGG GCACCAAGGT GGTGGCGCTG GGCAACATCG CGCAGATCGA CACCCCGTAC CTGACCGAGA CCTCGTCCGG CCTGACCTAC GTGGTGGACC GCTTCCGCAA CTGGACCCAC AGCGGCCACA TCACGCTCAC CCGCGGCGAG CGCTCGCGGC TGGCGGACTA CGCCTCGAAT CATCTTTAG
|
Protein sequence | MSETPEDPKR LFVLDTNVLM HDPTALFRFK EHDIFLPMVV LEELDRGKKG VSEVARNVRQ VSRFIDELMS GATHEQIAAG LPLQAESLSE STLAPSGRLF FQTRNLTSRL PDSLPGSTPD NDILATAQAL KTEWSDLPVT LVSKDINLRI KAAVIGLHAE DYYNDQVLDD VSLLFSGMTE LPADFWETHG AKMDSWQESG RTYYRVTGPL VSEWQPNQFV YREGENPLSA IVLEIDQPNE SAVIELVDDY MTPRHSVWGI NARNREQNYA LNLLMDPEVD FVSLLGAAGT GKTLLALAAG LAQTLEDNTY KEIIMTRVTV PVGEDIGFLP GTEEEKMTPW MGALMDNLEV LAKSEGAGDW GRAATNDVLT SRIKIRSLNF MRGRTFLNRY IILDEAQNLT SKQMKTLVTR AGPGTKVVAL GNIAQIDTPY LTETSSGLTY VVDRFRNWTH SGHITLTRGE RSRLADYASN HL
|
| |