Gene Acid345_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3749 
Symbol 
ID4069324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4421992 
End bp4425048 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content56% 
IMG OID637985771 
Productputative signal transduction histidine kinase 
Protein accessionYP_592823 
Protein GI94970775 
COG category[T] Signal transduction mechanisms 
COG ID[COG3292] Predicted periplasmic ligand-binding sensor domain
[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0973084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCT GTAATCCGGG AAAAATCGCT TCCACCCAAT GGCTCTCGAT TAGTCTTCTT 
CTGTTCTGCA TCATCACGCC CGGTTACGCC CTCAATCCTA AGAGTCACAT CACACAATTC
GGCCACACCG CTTGGCGTGT GCAAGACGGA ATCTTCACTG GCACGCCCAG AACCTTGGCG
CAGACGAAGG ATGGATATCT TTGGATAGGG ACGACGGCCG GATTGGTTCG TTTCGACGGC
GTGCGGTTTT CTCCCTGGAG TCCTGCGAAT GGTGAAAAGC TCCCGTCGAA GAGAATTAAC
TCGTTGCTCG GATCAACGGA CGGGAGCCTC TGGATCGGAA CGAGTGTTGG CCTTAGTCGT
TGGCAAGACA ACCGCCTAAT CAGCTATTCG GATTTGCACG GAGTTACAAC GGCAATTTTT
GAGGATAGAG ACAAAGCGGT CTGGGCCGCC ATTTCACCTT CACCATCCAA CACGCCATTG
TGTCAGATTA GCGACTCCGC GATCCACTGT TATGGAACTG CCGATGGCAT ATCGCCTCAT
CGTCTCTGGC CAATTGCAAA AGATGGTGAA GGAAATTTCT GGATCGGCGG CGACGAGTTA
GTTCGCTGGA ATTTGAAATC ACACAGGAGT TACGAGGTCA TCGGAGCCGA ACATAGCAAT
GCGTCGGTGA GCTCAATCAT TCCCGGAAGC GACGGGTCGT TATGGGTCGG GATCGATACC
AAAGGTCCCA GGTTCGGCCT GCAGAGACTT GCCGGCGGTG CGTGGAAGCC CTTCGTAACC
CCCGAGTTTA ATGGCACGAC TGTCGCTGTG AACGCGTTGC TCCTTGACCG TGACAATGCC
CTTTGGGTCG GTACCGCCAG CAATGGGATT TACCGAATCT ACGACGGAGC GGTTGAGCAC
TTTAGCAATG CGGATGGCCT GTCCAGCGAT TTCGTTTACA AGTTTCTTGA AGATGCCGAA
GGGACTGTGT GGGCCGTTAC CGCGAAAGGC ATCGATAATT TTCGAGAATT AAAGGTCACG
ACCTTTTCCA CACGCGAAGG TCTCGCTGCG GAGGAAGTCG ATTCCGTTTT CGCTACCCAC
GATGGCGGCA TCTGGGTGGG AGGCCCCAGC TCTTTAGAAG TCCTTCGGAA CAGCCGGATA
TCTTCAGTCT TGGCGGAACT CCACCTCTCA GGAGCAGCCA CATCCTTTCT TGAAGACCGT
ACCCACCGAC TCTGGATCGG CATAGATGAC ACTTTGACGG TCTACGATGG TCGCAAAGTC
CAGCGGATCG CACGTAGCGA TGGCAGCCCG ATGGGAATGG TCTATTCGAT GACCGAGGAT
ACTGCGGGCG ACTTATGGGT CGAGACCCGC GGGCGACTCA CGCGCATTCG AGGGTTAAAA
GCGGTTGAAG AGCTTCGTCC GCCGCAGGTC CCCTCAGCAT TCCGGGTCGT TGCCGATGCG
AAGGGTGGCG TATGGCTCGG GCTGGCAAAT GGCGATCTGG CGCACTATCA GAACGGCCAC
GCCGAGACAT TTCACTTCAG ACACAGCAGC GACACGCGAG TGGAGCAACT GGATGTAAAT
GCAGATGGCT CGGTGGTCGG AGCCACGGCA GACGGGCTTG TGGGATGGCG GAACGGGACC
CTGGCGACAC TTACGACGCA GAACGGCCTT CCTTGCGACA TTGCCTACGC GTCTCATTTC
GATCGTGCAG AAAACCTTTG GATCTACATG CAATGCGGGC TCGTCGAGAT CAAGAAAGAT
CAGGTGCAAA GCTGGTGGCA ACACCCCGAC GCGGCGGTCA AGTATGAGTT ATTCGATGTG
TTCGATGGGG CACAGCCAGG CCGAGCGCCG TTCGGAGGCG TGACGAGAGA CTCAGAAGGA
CGCTTGTGGT TCGCCAGTGG CGTGGTGTTG CAAACCGTCG ATCCGGAGCA CCTTTTGTCG
AATTCCGTGC TGCCTCCTGT CCAGGTTGAG TCGATCGTTG CGGACCGCCG GAATTACATA
CCTCAGCTCG GACTTCGCCT GCCACCGCTC ACACGAAATC TCGAGATCGA CTACACCGCT
CTGAGCTTCG TGGTGCCGCA GAAGGTGTAC TTCCGATACA AACTTGAGGG CCGCGATGAG
AGCTGGCAGG AGTCAGGCAC CAGACGTCAA GCTTTCTATA CCGATTTGCG TCCTGGGAAC
TATCGTTTCC GTGTAATGGC TTCGAACAAC GACGGTATCT GGAACGAGCA AGGTGCAACG
GTGGCCTTCT CCGTCGCAGC TGCTTGGTAT CAAACCAACT TATTCCGTCT CTTCTTGCTC
TTCACTGCGA TCTTCATCGC ATGGCTGCTT TATCAAATGC GAGTCCGCCA GATTGCGAAA
GCGATCAGCG CACGATTCGA TGAGCGACTC GCCGAACGTA CTCGGTTGGC GCGGGAACTT
CACGACACTT TTCTTCAAAC CTTGCAGGGC AGCAAAATGG TAGCCGAAGT TGCGCTTAAC
GGGCCCGCCG ATCCTGTTCG CATGCGTAAC GCAATCCAGC GCGTACTGGA ATGGCTTGAC
AAGGCAATTC ACGAGGGCCG AGCCGCTCTG CATTCTCTTC GGAGTTCCAC CGTCATGGGG
AATGATTTGG CCGAGGCCTT TCAGCGTGCC ACCGAGGACT GTCGCCTGCA AGGAATCAAC
GAAGTCTCAT TCGTCGCGGA AGGCATCTCG ACAGAAATGC ACCCCATCAT TCGCGATGAA
ATTTATCGTA TCGGCTACGA GGCGATCCGG AATGCGTGTC AGCACTCCGA GGCCGGCCGT
CTTCAAGTTC GGCTATCGTA CGGGGCAGAT CTCGCGCTGC GGGTGTCGGA CAATGGCAAG
GGGATCGAGC CGAAAATCGT CACTCTGGGA AAAGACGGAC ACTACGGATT ACAGGGGATG
CGAGAACGGG CACAACGCAT TGGCGCAAAG CTCCTTATCG AGAGTTTGCC AACCTCCGGC
ACTACTCTCG AATTGATCGT TCCCGGCCAC GTCGTCTTTC AGAATCCAAG ATCGACTTGG
TCGAGCCGCC TACAGAAACT GACGGCCTTC TTTCGCAGTC CTGATAATCC GGCTTGA
 
Protein sequence
MGICNPGKIA STQWLSISLL LFCIITPGYA LNPKSHITQF GHTAWRVQDG IFTGTPRTLA 
QTKDGYLWIG TTAGLVRFDG VRFSPWSPAN GEKLPSKRIN SLLGSTDGSL WIGTSVGLSR
WQDNRLISYS DLHGVTTAIF EDRDKAVWAA ISPSPSNTPL CQISDSAIHC YGTADGISPH
RLWPIAKDGE GNFWIGGDEL VRWNLKSHRS YEVIGAEHSN ASVSSIIPGS DGSLWVGIDT
KGPRFGLQRL AGGAWKPFVT PEFNGTTVAV NALLLDRDNA LWVGTASNGI YRIYDGAVEH
FSNADGLSSD FVYKFLEDAE GTVWAVTAKG IDNFRELKVT TFSTREGLAA EEVDSVFATH
DGGIWVGGPS SLEVLRNSRI SSVLAELHLS GAATSFLEDR THRLWIGIDD TLTVYDGRKV
QRIARSDGSP MGMVYSMTED TAGDLWVETR GRLTRIRGLK AVEELRPPQV PSAFRVVADA
KGGVWLGLAN GDLAHYQNGH AETFHFRHSS DTRVEQLDVN ADGSVVGATA DGLVGWRNGT
LATLTTQNGL PCDIAYASHF DRAENLWIYM QCGLVEIKKD QVQSWWQHPD AAVKYELFDV
FDGAQPGRAP FGGVTRDSEG RLWFASGVVL QTVDPEHLLS NSVLPPVQVE SIVADRRNYI
PQLGLRLPPL TRNLEIDYTA LSFVVPQKVY FRYKLEGRDE SWQESGTRRQ AFYTDLRPGN
YRFRVMASNN DGIWNEQGAT VAFSVAAAWY QTNLFRLFLL FTAIFIAWLL YQMRVRQIAK
AISARFDERL AERTRLAREL HDTFLQTLQG SKMVAEVALN GPADPVRMRN AIQRVLEWLD
KAIHEGRAAL HSLRSSTVMG NDLAEAFQRA TEDCRLQGIN EVSFVAEGIS TEMHPIIRDE
IYRIGYEAIR NACQHSEAGR LQVRLSYGAD LALRVSDNGK GIEPKIVTLG KDGHYGLQGM
RERAQRIGAK LLIESLPTSG TTLELIVPGH VVFQNPRSTW SSRLQKLTAF FRSPDNPA