Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3749 |
Symbol | |
ID | 4069324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4421992 |
End bp | 4425048 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985771 |
Product | putative signal transduction histidine kinase |
Protein accession | YP_592823 |
Protein GI | 94970775 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3292] Predicted periplasmic ligand-binding sensor domain [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0973084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATCT GTAATCCGGG AAAAATCGCT TCCACCCAAT GGCTCTCGAT TAGTCTTCTT CTGTTCTGCA TCATCACGCC CGGTTACGCC CTCAATCCTA AGAGTCACAT CACACAATTC GGCCACACCG CTTGGCGTGT GCAAGACGGA ATCTTCACTG GCACGCCCAG AACCTTGGCG CAGACGAAGG ATGGATATCT TTGGATAGGG ACGACGGCCG GATTGGTTCG TTTCGACGGC GTGCGGTTTT CTCCCTGGAG TCCTGCGAAT GGTGAAAAGC TCCCGTCGAA GAGAATTAAC TCGTTGCTCG GATCAACGGA CGGGAGCCTC TGGATCGGAA CGAGTGTTGG CCTTAGTCGT TGGCAAGACA ACCGCCTAAT CAGCTATTCG GATTTGCACG GAGTTACAAC GGCAATTTTT GAGGATAGAG ACAAAGCGGT CTGGGCCGCC ATTTCACCTT CACCATCCAA CACGCCATTG TGTCAGATTA GCGACTCCGC GATCCACTGT TATGGAACTG CCGATGGCAT ATCGCCTCAT CGTCTCTGGC CAATTGCAAA AGATGGTGAA GGAAATTTCT GGATCGGCGG CGACGAGTTA GTTCGCTGGA ATTTGAAATC ACACAGGAGT TACGAGGTCA TCGGAGCCGA ACATAGCAAT GCGTCGGTGA GCTCAATCAT TCCCGGAAGC GACGGGTCGT TATGGGTCGG GATCGATACC AAAGGTCCCA GGTTCGGCCT GCAGAGACTT GCCGGCGGTG CGTGGAAGCC CTTCGTAACC CCCGAGTTTA ATGGCACGAC TGTCGCTGTG AACGCGTTGC TCCTTGACCG TGACAATGCC CTTTGGGTCG GTACCGCCAG CAATGGGATT TACCGAATCT ACGACGGAGC GGTTGAGCAC TTTAGCAATG CGGATGGCCT GTCCAGCGAT TTCGTTTACA AGTTTCTTGA AGATGCCGAA GGGACTGTGT GGGCCGTTAC CGCGAAAGGC ATCGATAATT TTCGAGAATT AAAGGTCACG ACCTTTTCCA CACGCGAAGG TCTCGCTGCG GAGGAAGTCG ATTCCGTTTT CGCTACCCAC GATGGCGGCA TCTGGGTGGG AGGCCCCAGC TCTTTAGAAG TCCTTCGGAA CAGCCGGATA TCTTCAGTCT TGGCGGAACT CCACCTCTCA GGAGCAGCCA CATCCTTTCT TGAAGACCGT ACCCACCGAC TCTGGATCGG CATAGATGAC ACTTTGACGG TCTACGATGG TCGCAAAGTC CAGCGGATCG CACGTAGCGA TGGCAGCCCG ATGGGAATGG TCTATTCGAT GACCGAGGAT ACTGCGGGCG ACTTATGGGT CGAGACCCGC GGGCGACTCA CGCGCATTCG AGGGTTAAAA GCGGTTGAAG AGCTTCGTCC GCCGCAGGTC CCCTCAGCAT TCCGGGTCGT TGCCGATGCG AAGGGTGGCG TATGGCTCGG GCTGGCAAAT GGCGATCTGG CGCACTATCA GAACGGCCAC GCCGAGACAT TTCACTTCAG ACACAGCAGC GACACGCGAG TGGAGCAACT GGATGTAAAT GCAGATGGCT CGGTGGTCGG AGCCACGGCA GACGGGCTTG TGGGATGGCG GAACGGGACC CTGGCGACAC TTACGACGCA GAACGGCCTT CCTTGCGACA TTGCCTACGC GTCTCATTTC GATCGTGCAG AAAACCTTTG GATCTACATG CAATGCGGGC TCGTCGAGAT CAAGAAAGAT CAGGTGCAAA GCTGGTGGCA ACACCCCGAC GCGGCGGTCA AGTATGAGTT ATTCGATGTG TTCGATGGGG CACAGCCAGG CCGAGCGCCG TTCGGAGGCG TGACGAGAGA CTCAGAAGGA CGCTTGTGGT TCGCCAGTGG CGTGGTGTTG CAAACCGTCG ATCCGGAGCA CCTTTTGTCG AATTCCGTGC TGCCTCCTGT CCAGGTTGAG TCGATCGTTG CGGACCGCCG GAATTACATA CCTCAGCTCG GACTTCGCCT GCCACCGCTC ACACGAAATC TCGAGATCGA CTACACCGCT CTGAGCTTCG TGGTGCCGCA GAAGGTGTAC TTCCGATACA AACTTGAGGG CCGCGATGAG AGCTGGCAGG AGTCAGGCAC CAGACGTCAA GCTTTCTATA CCGATTTGCG TCCTGGGAAC TATCGTTTCC GTGTAATGGC TTCGAACAAC GACGGTATCT GGAACGAGCA AGGTGCAACG GTGGCCTTCT CCGTCGCAGC TGCTTGGTAT CAAACCAACT TATTCCGTCT CTTCTTGCTC TTCACTGCGA TCTTCATCGC ATGGCTGCTT TATCAAATGC GAGTCCGCCA GATTGCGAAA GCGATCAGCG CACGATTCGA TGAGCGACTC GCCGAACGTA CTCGGTTGGC GCGGGAACTT CACGACACTT TTCTTCAAAC CTTGCAGGGC AGCAAAATGG TAGCCGAAGT TGCGCTTAAC GGGCCCGCCG ATCCTGTTCG CATGCGTAAC GCAATCCAGC GCGTACTGGA ATGGCTTGAC AAGGCAATTC ACGAGGGCCG AGCCGCTCTG CATTCTCTTC GGAGTTCCAC CGTCATGGGG AATGATTTGG CCGAGGCCTT TCAGCGTGCC ACCGAGGACT GTCGCCTGCA AGGAATCAAC GAAGTCTCAT TCGTCGCGGA AGGCATCTCG ACAGAAATGC ACCCCATCAT TCGCGATGAA ATTTATCGTA TCGGCTACGA GGCGATCCGG AATGCGTGTC AGCACTCCGA GGCCGGCCGT CTTCAAGTTC GGCTATCGTA CGGGGCAGAT CTCGCGCTGC GGGTGTCGGA CAATGGCAAG GGGATCGAGC CGAAAATCGT CACTCTGGGA AAAGACGGAC ACTACGGATT ACAGGGGATG CGAGAACGGG CACAACGCAT TGGCGCAAAG CTCCTTATCG AGAGTTTGCC AACCTCCGGC ACTACTCTCG AATTGATCGT TCCCGGCCAC GTCGTCTTTC AGAATCCAAG ATCGACTTGG TCGAGCCGCC TACAGAAACT GACGGCCTTC TTTCGCAGTC CTGATAATCC GGCTTGA
|
Protein sequence | MGICNPGKIA STQWLSISLL LFCIITPGYA LNPKSHITQF GHTAWRVQDG IFTGTPRTLA QTKDGYLWIG TTAGLVRFDG VRFSPWSPAN GEKLPSKRIN SLLGSTDGSL WIGTSVGLSR WQDNRLISYS DLHGVTTAIF EDRDKAVWAA ISPSPSNTPL CQISDSAIHC YGTADGISPH RLWPIAKDGE GNFWIGGDEL VRWNLKSHRS YEVIGAEHSN ASVSSIIPGS DGSLWVGIDT KGPRFGLQRL AGGAWKPFVT PEFNGTTVAV NALLLDRDNA LWVGTASNGI YRIYDGAVEH FSNADGLSSD FVYKFLEDAE GTVWAVTAKG IDNFRELKVT TFSTREGLAA EEVDSVFATH DGGIWVGGPS SLEVLRNSRI SSVLAELHLS GAATSFLEDR THRLWIGIDD TLTVYDGRKV QRIARSDGSP MGMVYSMTED TAGDLWVETR GRLTRIRGLK AVEELRPPQV PSAFRVVADA KGGVWLGLAN GDLAHYQNGH AETFHFRHSS DTRVEQLDVN ADGSVVGATA DGLVGWRNGT LATLTTQNGL PCDIAYASHF DRAENLWIYM QCGLVEIKKD QVQSWWQHPD AAVKYELFDV FDGAQPGRAP FGGVTRDSEG RLWFASGVVL QTVDPEHLLS NSVLPPVQVE SIVADRRNYI PQLGLRLPPL TRNLEIDYTA LSFVVPQKVY FRYKLEGRDE SWQESGTRRQ AFYTDLRPGN YRFRVMASNN DGIWNEQGAT VAFSVAAAWY QTNLFRLFLL FTAIFIAWLL YQMRVRQIAK AISARFDERL AERTRLAREL HDTFLQTLQG SKMVAEVALN GPADPVRMRN AIQRVLEWLD KAIHEGRAAL HSLRSSTVMG NDLAEAFQRA TEDCRLQGIN EVSFVAEGIS TEMHPIIRDE IYRIGYEAIR NACQHSEAGR LQVRLSYGAD LALRVSDNGK GIEPKIVTLG KDGHYGLQGM RERAQRIGAK LLIESLPTSG TTLELIVPGH VVFQNPRSTW SSRLQKLTAF FRSPDNPA
|
| |