Gene Acid345_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3062 
Symbol 
ID4071969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3637087 
End bp3639075 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content60% 
IMG OID637985081 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_592137 
Protein GI94970089 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0701955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAACC GTCTTCAAGA ACGCGCGCTC TGGACTGCCA TTATCGTGGC GATGGTGGCG 
GTATTGGCGG CACTTGCCGT GCTTCAGTAT CGGTGGAGCA AACGAGTTAC CGACGCGACT
GAAGCACGAA TCAATGCGAG CCTGAAGGCA TCCATACTGG ACTGGCACTT AAGCCTGCTG
AGGGAGATTT CGGAGCCTGC CTTCGCGATG CAGGTCAGCT CCGAGCCCGA CAATCGAGAG
AACTGGAACA CTTATTTTGA GCGCTATCGA GCGTGGCACG GAACGGCGAC GCGTCCTGAA
CTGTTCGCGA ACCTGTACCT GATTCGCGTG CCCAGGTCCG GCGAGGAAGC TGTGTTTCGC
CGCGATATCG GGGAGAAGCG TTTCCATCGT GAGAACCCGC CGCCGCGTTT TGCAGCGATG
ATGAATGAAC TGCGCCGGCA GTCGCTTGAT CCAGAGATGC TGGAGCGGCG AGTGACCGAC
GATCCACACA CGCGGGAAGA CAATCTTCGC GGGCAGATTC CACACGATCC GTTGTTCGGC
TGGCAATTCG AGCAAAACCT TCCAGCGCTG GTACATCCGC TCATCCATAA CGACGTAGAG
ACCGATGAGC GGGAGGGACA CGGAAAGCGC TTTCGCGCAG CGGATGCGGA GTTGCGCGAA
GAACAGGAAC ACGCGCGATT GCCAGAGCGG GATTTCGATC GCGACGATAC GGAGCCGGTG
ACGTGGATCG TGATTGAACT GGACGCCAAG GTACTCGAAC AGAAATTACT GCCGGAACTG
GCGCTCCATT ACTTCGGCGG CCCACAGGGA GCGAACTACG ACAACGCGGT GATTGCCGGG
CGAACGGAGC GGGTGTTGTA CTCGTCGTCG CCGGAATTTG TGCAGCGTCC GTTCGACCAC
CCGGATGCGG TGCTTGATAT TTTTGGTCCG CCTCCGACCA ATGGCACGCA GCTCTCGCAC
GATTTTGCCG GCGACTTCCA TGCCGGAGGC GCGAAGCGCG CAACCAACGT GGAGGAGATG
CGAAACATGG CGGCGCCGAC GTGGATGCCG GTGATGCAGG ATGGCGGGCA TGAGCCGCAC
TGGAACCTGG TGGTGAGACA CCGGCGTGGT TCTCTGGAGG CGCAAATGGC GGCGATCCGG
CGGCGGGACC TTGGAATCAG CCTTGGCATC CTGTTGCTGC TGGGCGCGAC GATGACGATG
CTGATTGTGG TGACGCGCCG AGCGCAGAGG TTGGCGCGGC TGCAAATGGA GTTTGTAGCG
ACCGTATCGC ACGAATTGCG TACCCCGCTG GCCGTGATTT GTTCCGCGGC CGATAACCTG
AGCGACGGCG TGGTCAATGG GCAGCAGCAG CTACAACGCT ATGGGGAGAT GATCAAAGTC
CAGGGGCGGC AATTGATCGC GCTGGTGGAA CAGATATTGT TATTCGCCGC GACGCGTGAA
GGACGACAGA CCTACCATCT GGAGCGCCTG GACGTCAGAA AGATTATTGA AGTGGTGGTG
TCGAACACAG CCGGATTGCT CGATGTTTCC GGAGTCAAGC TCGAATCAGA GATGGAGAGC
GGATTGCCAC AGGTGATTGC GGACCTGAGT GCGGTGTCGC AGTGTGTGCA GAACCTGGTG
ATCAATGCCG TGAAGTATGG CGGCGAAGCG AAGTGGGTGC GCGTTACAGC GCGCCGGAAG
GCGAGTCCGG AGGGACGGGA GATCTTGATT GGGGTAGAAG ACCGCGGCAT GGGCATTGCA
GCGGATGAGT TGGAACATGT CTTCGAGCCG TTCTATCGCA GTCCACAGAT CGCTGCCGCG
CAAATCCGCG GCACGGGCCT GGGATTGCCG CTGGCAAAAA GCCTGGCGGA GGCGATGAAC
GGATCATTGA CCGTGAGCAG TGAAGTCGGC AAGGGGAGCA CATTTACGCT GCACCTGCCG
TTTGCGGAAG ATTCGCCGCT GCGGACTGCG GCGGAAGCAG CAACTACGGG AGTGTTGACC
AGCGCATGA
 
Protein sequence
MRNRLQERAL WTAIIVAMVA VLAALAVLQY RWSKRVTDAT EARINASLKA SILDWHLSLL 
REISEPAFAM QVSSEPDNRE NWNTYFERYR AWHGTATRPE LFANLYLIRV PRSGEEAVFR
RDIGEKRFHR ENPPPRFAAM MNELRRQSLD PEMLERRVTD DPHTREDNLR GQIPHDPLFG
WQFEQNLPAL VHPLIHNDVE TDEREGHGKR FRAADAELRE EQEHARLPER DFDRDDTEPV
TWIVIELDAK VLEQKLLPEL ALHYFGGPQG ANYDNAVIAG RTERVLYSSS PEFVQRPFDH
PDAVLDIFGP PPTNGTQLSH DFAGDFHAGG AKRATNVEEM RNMAAPTWMP VMQDGGHEPH
WNLVVRHRRG SLEAQMAAIR RRDLGISLGI LLLLGATMTM LIVVTRRAQR LARLQMEFVA
TVSHELRTPL AVICSAADNL SDGVVNGQQQ LQRYGEMIKV QGRQLIALVE QILLFAATRE
GRQTYHLERL DVRKIIEVVV SNTAGLLDVS GVKLESEMES GLPQVIADLS AVSQCVQNLV
INAVKYGGEA KWVRVTARRK ASPEGREILI GVEDRGMGIA ADELEHVFEP FYRSPQIAAA
QIRGTGLGLP LAKSLAEAMN GSLTVSSEVG KGSTFTLHLP FAEDSPLRTA AEAATTGVLT
SA