Gene Acid345_2399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2399 
Symbol 
ID4071397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2840288 
End bp2843683 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content61% 
IMG OID637984415 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_591474 
Protein GI94969426 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CGCGAGATTG GCGGGCCCAG CCGTTGCCCA GCAGTAAAGC GAACGAAATG 
GAAGACGCCG CAAAAATTTT CGTGCTCAGC GGCGATAGCG AACTTGCGCA GCTAATGCGC
GACCACGATT GGTCGCAAAC TCCCCTGGGT CCGGTGTCGC AATGGCCGCC GAACCTTCGC
ACCTGCGTCA ACCTCATCCT GAATTCGCAA CATCCCATGT GGATCGGCTG GGGCGACGAA
GCCACCTTCC TCTACAACGA CGCCTACATC CAGGTGCTGA GTTCCGCCAA GCATCCCTGG
GCGCTAGGGC GTCCAGCGTC GGAAGTCTGG GCGGAGATCT GGGACATCTG CGGTCCTCTC
GCCGCCAAGG TCTTCGAACG CGGTGAAGCG ACTTTCGTAG ATGAAGTGCG GCTCTTCATG
AACCGCGGCG ACTTCCTCGA AGAAACTTAT TACTCGTTCT CTTACAGCGC CATTTGCGAC
GATTCCGGCA AGGTCGTCGG ACTCTTCTGC CCGTCCGCGG AAGTAACGGC GCGCGTGCTG
AATGCCCGAC GGTTGCGCAC CCTGTCGGAG CTATCATCCT CGGCGTTCCT CGCCAAGACG
GTGGAAGATG CCTTCACATC TGCCTCGCAG ACCATCGCCG AGAATCCTGA CGACATCCCG
TTTTCGCTGC TCTACATCGT CTCGCCGGAG ACACTCGAAC TCGAACTGAA GGCGAACGCG
CGAACTCCGG AGAGTGTCGT CGAACGATTC TTCTCCCACG CGAACCTGGC GAACGCCGAT
AACGAAGCGG CGAAGGGAAT CAACGAAGTC CTCCGCCTCG GGGCTCCGCG GGTTATCTCG
ACCGACGATA TTCCTGACCT TCCGCTTGGC CTTGCTGACC AGCGCGTGAA GCAAGCGATT
GTGCTCCCGG TGAATTCGCG TGCGGAAGAC CGCGCCCTTG GCGTGCTCGT CGCCGGCATC
AATCCCACGC GGCAACTCGA TACCGAGTAT CGGACCTTCT ACTCGCTCGT CGCCGGGCAT
GTCGCGACGG CGATCGCCAA TGCGCGCGCC TATGACGAAG AGCGTCGCCG TGCCGAAGCG
TTAGCGGAAT TAGACCGCGC CAAGACCACG TTCTACAGCA ACGTGTCGCA CGAATTTCGC
ACGCCGCTCA CGCTCATGCT GGGCCCCCTC GAAGAATTAC TGGGTAAGTC CGAGCGCGCA
ATTCGTCCGG AAAACCATCA TCTCGTGGAG GTGGCGTATC GCAACGGCAC GCGGCTGCTG
AAGCTGGTGA ATACGCTGCT CGACTTCTCG CGCATTGAAG CGGGCCGCAT GAATGTCAGC
TTCCAGGCGA CCGACCTGGT GGCATTCACC GCCGAACTCA CCGCCCTGTT CCGCTCCGCC
ACAGACAAGG CCGGTCTGCG CTTGGAGATC GAGTCTGATG CCCTCCCGCA TCGCGTGTAT
GTGAACCGCG AAATGTGGGA AAAGATCGTC CTCAATCTGC TCTCCAACGC ATTTAAGTTC
ACCTTCGAAG GTGGCATCAC GATCGCGCTG CGCGATGGTG GCGACGGCGT TGCCGTATCG
GTTCGCGATA CAGGCGTTGG CATTCCACCA AATGAACTCC CGCGCGTCTT CGAGCGTTTC
CATCGAGTAG AAGGCGCGAA GGGGCGCAGT TTCGAAGGAT CGGGCATCGG GCTGTCGCTG
GTGCAGGAAC TGGTTAAGAC CCACGGCGGC ACCATCGAAG TTGAGAGCGA AGTCGGCAAA
GGAACGACAT TTCACATCCA CCTGCCCTAC GGCACCGAGC ATCTGCCTGC AAACCGCGTG
TCGGAAGAAG CCGGTTCGGG GAGTGCCCGC GCGATTGCCG TGCCGTATGT AGCAGAGGCG
TTGAGTTGGC TCGGGCCACA AGCGGCGGAA GCAGCCGCGG CGCAACTGGC CGAGGTGGGT
GCCACACTCG CAGCTCTCGC AGGCCGTCCG TCGTTGCTGC TGGCAGATGA CAACCGCGAC
ATGCGGGAGT ACATCGAGCG CCTGCTTGGG TCGCGCTATC GCATCCGCGC GGCGTCGAAT
GGTAAGCGCG CTCTGGAAAT GGCCACCGAA GACCCACCCG ACCTCGTACT CACCGATGTG
ATGATGCCGG AGATGGATGG CTTCGAACTG CTTGCGGCTC TCCGTGCCAA CAGCGCCACC
AGTACGATCC CCATCATCGT GCTCTCCGCG CGCGCAGGGG AAGAAGCGCA GATTGAAGGC
CTGCAACACG GCGCCGACGA CTACCTGGTG AAGCCGTTCA GCGCACGGGA ATTGGTCGCG
CGGGTCGAGA ACAACCTGCG GCTGTCGACC TTCCGGCGCG AAACCGAACA GCGCATCCGC
GAGAGTGAAG GACGTTTCCG CGCGCTGGTG TCGGCGACCT CCGATGTCGT CTACCGCATG
AGCGCCGACT GGAGCCAGCT GCGGCAGCTC GACGGGCGAG ACTATCTGGC GAACGTGGAG
CGCCCGCCCC AGACCTGGTT GCAGACCTAC ATTCATCCCG ATGATCAGCC GACGGTGCTC
AGCGCCATCC ACGAAGCCAT TCGCGATAAG AAGATCTTCC AGCTTGAACA TCGGATGTTG
CGGGCAGACG GCAGTCTCGG CTGGACGTTC TCGCGAGCGG TGCCGATGCT CGACAACAAA
GGTGAGATCG TCGAATGGTT CGGCGCGGCT ACGGACATCA CCCCGCGCAA AAACGCGGAG
GACGCGCTTC TGCGCAGTGA AAAGCTCGCG TCCGTCGGAC GCATGGCCGC GACCATCGCG
CACGAAATCA ACAATCCGCT GGAAGCCATC ACCAACACGC TGTTCCTCGC GCTCCACGCA
CAGGACCTTC CGGAATCAGC CCGCGAATAT CTCGAAATGG CAGACGGCGA ACTGCGGCGC
GTGGCGCACA TCACCCGCCA GGCGCTCGGC TTCTATCGCG AGTCGAATGC ACCGCGCCAG
ATCGCGTTAA ACGCCGTGAT GGATTCCACG CTCGACCTCC TGAAGAACCG AATTCGCACC
AAGAACGTCA CCGTTGAGAA GCAATGGGAC GGCGATGTTC ACGCCGAAGC TGTCGCCGGT
GAGATGCGCC AGGTGTTCTC CAACCTGCTC CAGAACAGCC TCGATGCCGT CGACGAGGGT
GGAGCGATCA AGGTGCGTAT CTCCACCATA CGGAACGGCA GTGCGGTGCG CGTGACGATC
GCCGATAGCG GGAAGGGAAT TGAAGCGAAT TTCCGCGAGC GAATTTTCGA GCCGTTCTTC
ACCACCAAGG GTGCGGTTGG AACCGGTCTT GGCCTATGGG TGACGCGCCA GATCGTGGAG
AAACACGGCG GACGAATTCG CGTGCATTCG CTGACCAGCG GCGATCAGCG GGGAACGTCG
TTCTCGATCG TGCTGCCGGT GCAAGCCGCG CACTGA
 
Protein sequence
MSTARDWRAQ PLPSSKANEM EDAAKIFVLS GDSELAQLMR DHDWSQTPLG PVSQWPPNLR 
TCVNLILNSQ HPMWIGWGDE ATFLYNDAYI QVLSSAKHPW ALGRPASEVW AEIWDICGPL
AAKVFERGEA TFVDEVRLFM NRGDFLEETY YSFSYSAICD DSGKVVGLFC PSAEVTARVL
NARRLRTLSE LSSSAFLAKT VEDAFTSASQ TIAENPDDIP FSLLYIVSPE TLELELKANA
RTPESVVERF FSHANLANAD NEAAKGINEV LRLGAPRVIS TDDIPDLPLG LADQRVKQAI
VLPVNSRAED RALGVLVAGI NPTRQLDTEY RTFYSLVAGH VATAIANARA YDEERRRAEA
LAELDRAKTT FYSNVSHEFR TPLTLMLGPL EELLGKSERA IRPENHHLVE VAYRNGTRLL
KLVNTLLDFS RIEAGRMNVS FQATDLVAFT AELTALFRSA TDKAGLRLEI ESDALPHRVY
VNREMWEKIV LNLLSNAFKF TFEGGITIAL RDGGDGVAVS VRDTGVGIPP NELPRVFERF
HRVEGAKGRS FEGSGIGLSL VQELVKTHGG TIEVESEVGK GTTFHIHLPY GTEHLPANRV
SEEAGSGSAR AIAVPYVAEA LSWLGPQAAE AAAAQLAEVG ATLAALAGRP SLLLADDNRD
MREYIERLLG SRYRIRAASN GKRALEMATE DPPDLVLTDV MMPEMDGFEL LAALRANSAT
STIPIIVLSA RAGEEAQIEG LQHGADDYLV KPFSARELVA RVENNLRLST FRRETEQRIR
ESEGRFRALV SATSDVVYRM SADWSQLRQL DGRDYLANVE RPPQTWLQTY IHPDDQPTVL
SAIHEAIRDK KIFQLEHRML RADGSLGWTF SRAVPMLDNK GEIVEWFGAA TDITPRKNAE
DALLRSEKLA SVGRMAATIA HEINNPLEAI TNTLFLALHA QDLPESAREY LEMADGELRR
VAHITRQALG FYRESNAPRQ IALNAVMDST LDLLKNRIRT KNVTVEKQWD GDVHAEAVAG
EMRQVFSNLL QNSLDAVDEG GAIKVRISTI RNGSAVRVTI ADSGKGIEAN FRERIFEPFF
TTKGAVGTGL GLWVTRQIVE KHGGRIRVHS LTSGDQRGTS FSIVLPVQAA H