Gene Acid345_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4055 
Symbol 
ID4072477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4795109 
End bp4796869 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content60% 
IMG OID637986086 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_593129 
Protein GI94971081 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.424185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGGG TTCGGCTGCG GACCAAGTTC CTGATCGCCA TGCTGCTCAC ATCCGCGGGC 
CTCACCATTG GCACTCTGCT GGTGGTGCAA CACACGGTTG CGGTGCGAGC ACGGGAAGGC
ATCGTCGCCG ACCTCCAGAA CTCCGTAGAG AACTTTCGCG CGGAACAACT AGAGCGCGAG
AAGAACCTGC GCGCTTCCGC GCGACTCCTC GCCGATCTCC CGATCGTGAA AGCCCTGATG
ACGTCGCGCC ACGCGCCCAC CATTCAGGAT GCTTCCGAGG AACTTTTTAA GCTTTCCGAG
CGAGACCTTT TCGCCCTGGT TGATCCAACT GGCCAGGTTG TCGGCTTCCA CACCAACCCT
GCCGGAGTGC CGTCGGCGCC GATCCAGCAC GTGCTCTCGC AACGCGCACC CACTGCAAAT
TCGTTCGAAT GGTGGTACAC CGGCGGACAC TTATACGAGG TGTATCTCGA GCCGATCTAT
TTCGGTCCGA GCAGCAACGA CACGCTGCTG GGTGTCATCG CAGTTGGATA CGAAATCAAT
GGTCCACTGG CGCGCCAGGT CGGAAGGCTG GCTGACAGTG AAGTCGCCGT GCTTTATCGC
GGAAATGTCG TCGCAAGCAC GCTGAACGAC CGGCCGTCGC TCGAGTTCCT GCAGCATACG
GGATCGAGCA ATTCCTCGCC TGCAGATGTC CTGCTGGGAC AGAAGAAGTT CGTCGCTACT
TCTATCCTCC TCAGCGCTGA CGATTCCACG CCAGTGACGA TGGTCGTGAT GAAGTCCTAT
GACAACGCCA TCTCCTTCCT GCAGCGTTTG CAGCGATTGC TTCTTGTAAT TGGGATTGCC
GCGGTACTTG CGGGCAGCAT CCTTGTTTAC TTCATCGCGC GAACCTTCAC CCGTCCGCTC
GAAACCCTTG CAGTAGGCGT GAGCGCTCTC GGCACCGGCG ACTTCGCGTT CCCTCTGCCG
GGAGGTGGTG GCGGAGAAGT GGTGAAGTTG ACGCAGGCCT TCGTGGATAT GCGTGACCGC
CTTCGCGCGA CGCAACAGAG CTTGATTGAG AGCGAGCGAC TGGCGACGAT CGGACGCATG
GCTGGTTCGA TCTCGCACGA CCTGCGCCAT CCGCTGACTG CCGTGCTCGC GAATGCGGAA
TTCCTCGCCG AAGCTAATCT CAACACGACA CAACGCGAAG AGCTGTACAT GGAAATCAGG
GTGGCGGTGA ATCGCCTGAC GGATCTTGTT GACTCTCTGC TTGAACTGTC GCGTCCGGCG
CAAGCGCTAA CGCTCACTGA GGGACCGATC GAAGGCAGCA TTTTGCGCTC GATCGAACTC
ATTCGGGCTC ATCCCGAGTT CCATAAGGTG AGTATCGAAG TCGAGGGCGC AGCGGGAGTG
GACACGCGGG TGGACGGGAG AAAGATGGAG CGCGTCTTCT ACAACCTCCT GTTGAACGCA
TGCCAAGCGG TGCAGAGTCG CGCGGGCAAG GTCGTCATCA GCGTCACGGA GAGCTCCGCG
GGAGTTGAAA TCCGGGTTCG TGACAATGGA CCGGGAGTGG AACCGTCGAT CGCGACAAAG
TTGTTTCAGC CATTCGTGAG CGTCGGCAAA GAAAACGGTA CCGGACTCGG GCTGACGATC
GCGCAGAAGA TCGTGCAGGA CCACGGCGGC TCGCTCGAAG TGGAGTGGTC GTCGCCGGGC
AACACCGTGA TGCGGATCGT CCTGCCGCAA CCAACGCGAT CAGACCGATG GCGCGCGCCC
CTCGCTAAGA GTTCTTTGTA A
 
Protein sequence
MGGVRLRTKF LIAMLLTSAG LTIGTLLVVQ HTVAVRAREG IVADLQNSVE NFRAEQLERE 
KNLRASARLL ADLPIVKALM TSRHAPTIQD ASEELFKLSE RDLFALVDPT GQVVGFHTNP
AGVPSAPIQH VLSQRAPTAN SFEWWYTGGH LYEVYLEPIY FGPSSNDTLL GVIAVGYEIN
GPLARQVGRL ADSEVAVLYR GNVVASTLND RPSLEFLQHT GSSNSSPADV LLGQKKFVAT
SILLSADDST PVTMVVMKSY DNAISFLQRL QRLLLVIGIA AVLAGSILVY FIARTFTRPL
ETLAVGVSAL GTGDFAFPLP GGGGGEVVKL TQAFVDMRDR LRATQQSLIE SERLATIGRM
AGSISHDLRH PLTAVLANAE FLAEANLNTT QREELYMEIR VAVNRLTDLV DSLLELSRPA
QALTLTEGPI EGSILRSIEL IRAHPEFHKV SIEVEGAAGV DTRVDGRKME RVFYNLLLNA
CQAVQSRAGK VVISVTESSA GVEIRVRDNG PGVEPSIATK LFQPFVSVGK ENGTGLGLTI
AQKIVQDHGG SLEVEWSSPG NTVMRIVLPQ PTRSDRWRAP LAKSSL