Gene Acid345_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3018 
Symbol 
ID4071573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3580991 
End bp3582559 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content57% 
IMG OID637985037 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_592093 
Protein GI94970045 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0381782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAT CCAGACCAAC TCGTGACACG GTACCTCCGA GCGCCGGGAG CATCGATCCG 
TCATTGTATG AGGCGGTATT TCACAACTCG CTCGACGGCA TCCTGATTGC CGACGACGAT
GCCCGTTACA TTGACGTCAA TGCAGCGGCC TGCGAAATCG TCGGCCGCAC GCGGAAGGAA
CTCATCGGCG AGCGCGTCGG GACTTTCGTT GAACATCCCA GCGACGCGCA AGTTCTTTGG
GAAAAGGCAA AGCAATCCGG GACCATCCGA GCGGAGATCG CGGTGCTTCG ACCCGACGGA
CAGAAGCGGT ATATCGAATT CACCGCCGTC GCGAATTTTG TTCCGGGTCG GCATCTCATC
CTGGTTCGCG ACTTCACTCA CCGGCGCAAT CTGGACGAAC AGATTCGCAA TGCTTCGAAG
ATGGAGGCGA TCGGACGCCT GGCCGGCGGC GTCGCCCACG ACTTCAACAA TCTGTTGATG
GTCATCACCA GCTATACCGA GTTGATGCTC GATTCCATGG CGGACTTCGA TCCGCTACGG
AAAAAGGCAC AAGAGGTCTT GAAGGCTTCG GCTCGTGCGG CCAGCCTCAC CCGCCAGCTT
CTTGCATTCA GCCGCAAGCA GGTACTCGAT CCTCAGTTCC AGGACTTCAA TATTTTGCTT
CGCGAGATGT CGAAGCTGCT TGGGCGGGTG CTCGGTGAAA ACGTGGAAGT AAAGCTCGAC
CTGAGGGAAA ACCTCGGCAG CGTCTATGCC GACCGCGGTC AGATCGAACA GATCATGATG
CATCTCGCGG TGAACGCCCG AGACGTCATG CCCGAAGGCG GAAGGTTTAC GGTCCGAACT
GCTAACGTGG ACTTCGACGG GTCCTACTCG CGCCTGCCGG GATCGCCTCC GCCGGGTGAA
TTCGTGATGA TGTCCGTCGA AGATACAGGG AATGGCATGA GCCGCGACGT TCTCTCTCAC
CTCTTCGAAC CGTTCTTCAG CACCAAGGCA ATGGGGAAGG GGACTGGTTT GGGGCTCGCC
GCTGTCTATG GCATCGTCAA GCAAAGTGGT GGTTTCATTT GGGTAGATAG CGAAGAAGGA
CATGGATCGC GTTTCAAGAT GTACTTCCCG CGCGCCGTCC AGGGTAAAGA AGAGCAACTC
GGCAGGCGTG CGTCGTTCAT TCCAGAGACC CGTCCCGCCG TCGTATTGCT GGTTGAGGAC
GAAGAGGCGC TACGCGCGGC TGCCGGCGAT TTCCTGGAGA CCCGCGGCTA CAAAATCATG
ACCGCCCGCG ACGGCACGGA AGCGCTGAGT ATGGCTTCCA AGTTCGCCGA ACGCATTGAT
GTATTGATCA CCGATCTCGT TATGCCCGGA ATCAGCGGCA GGGTTCTGGC GCAGGAACTA
GTTAAAATCC ATCCCGAGAC TAAGGTCATG TACATGTCGG GATACGACGA CGAGACCGTT
ATGGTGAATG GGGAGATAGA TTCCTCGAGC GCATTTCTAA GGAAGCCCTT CCGCATGGAT
GCACTCTCCG CTAAGATTCG TGAAGTTCTC GGCGAAGAGT CTCGCAGTAG CGGCAGCGGA
CCAACATAA
 
Protein sequence
MAESRPTRDT VPPSAGSIDP SLYEAVFHNS LDGILIADDD ARYIDVNAAA CEIVGRTRKE 
LIGERVGTFV EHPSDAQVLW EKAKQSGTIR AEIAVLRPDG QKRYIEFTAV ANFVPGRHLI
LVRDFTHRRN LDEQIRNASK MEAIGRLAGG VAHDFNNLLM VITSYTELML DSMADFDPLR
KKAQEVLKAS ARAASLTRQL LAFSRKQVLD PQFQDFNILL REMSKLLGRV LGENVEVKLD
LRENLGSVYA DRGQIEQIMM HLAVNARDVM PEGGRFTVRT ANVDFDGSYS RLPGSPPPGE
FVMMSVEDTG NGMSRDVLSH LFEPFFSTKA MGKGTGLGLA AVYGIVKQSG GFIWVDSEEG
HGSRFKMYFP RAVQGKEEQL GRRASFIPET RPAVVLLVED EEALRAAAGD FLETRGYKIM
TARDGTEALS MASKFAERID VLITDLVMPG ISGRVLAQEL VKIHPETKVM YMSGYDDETV
MVNGEIDSSS AFLRKPFRMD ALSAKIREVL GEESRSSGSG PT