Gene Acid345_4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4461 
Symbol 
ID4070944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5293240 
End bp5294598 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content61% 
IMG OID637986500 
Producthistidine kinase 
Protein accessionYP_593535 
Protein GI94971487 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGT TCGTCACCCT GTCGGTGATC ACGACCATGC TCTTTGTGCG CCTCCAGACA 
TCGGATTCCG CAATGGAAAC GATGACGCGA ACCGCCGTAC AGGCGTATCA GGCTGGCGGT
CCGGCATCGC TTCACAACTA CTTCCACACG ATCGAGCGCG ACCAGCTTTT CCGCGCGATT
TTGTTTGACG ATCAGGGTCA CGAGCTGACC GGCCGTCCAG CCCCGCGGTT CCTTGGGCCG
AATGGTGAAT ATGCGCCTCC ACCGCCACAG GGCCCAATAC CAGGGCCGCC TTCATTTGAC
GAACTGATTA AGCGTAACTT CCCGCGCCAC ACCATCCAGG CTGTAGATGG CCACAAATAC
ACTCTGATCC TGCTGCCGCC ATCGCGAGCG CACCTCTGGT TCCTCACCGC GCCAACGCGA
CTGATCGGTA TTGTGATCGG CCTATGCGCC ACTGGCATCA TCTGCTTCTC CCTGGCCCGC
TATGTAACCA AGCCATTGCA GCGACTTCGT GAAGCAAGTT CGAAGCTGGC TTCGGGCGAT
CTGTCGGCAC GCGCGGGCAA TGGCATTCAC CGGCGTGATG AAATCGGCAG CCTCGTTCAT
GATTTCGACC GCATGGCCGA CCGCATCGAA AACCTCATCA CCACCCAGCG CCGGCTCCTG
AGCGACATTT CGCACGAACT GCGTTCGCCA CTGGCGCGAT TGAACGTTGC CGTGGGACTC
GCCCGTCGCC AAGCTGACGT CGAGACGCAG AAGGCCCTCG AGCGCATCGA AATCGAAGCC
GACCGCCTCA ACGACATGCT CCAGAATCTG CTGACGCTTT CCCGGTTGGA GAGTGGCGAA
CCCGTTGAAA TGCGCACTAC GGTGGACATG AGCACTCTGG TGACAGACGT CGTTGCTGAC
GCCGATTTCG AGGCACAAGC ATTTGGACGC GAAGTGCATC TCAGCACCTG CGAACCCTGC
GAGGTTGAGG GGAACATCAC CCTCCTGCGC AGCGCGGTAG AGAACGTGGT CCGCAACGCC
GCGCGTTACA CCGACGAGAA CACAAAGGTT ACGGTCGCAC TGACCACTAG CGGCAATCAT
GCCGTCGTCG AAGTGCACGA CCAGGGGCCT GGCGTACCGG ACGAGTCGCT GCCAAAGTTG
TTCCTTCCCT TCTATCGCGT GGATGCAACC CGTGATCGCA ACACCGGCGG CGTCGGACTC
GGGCTCTCGA TTGCCGAGCG CGCCGTGCGG CTCCACGGCG GTTCAGTTGT GGCGAGGAAT
GGAAGGCCAC ACGGTCTGAT CGTGCGCATC GAACTGCCGC TGCTGGCCCA CGAGTCCGCC
CCAGTGAAGT CGGAACCAGC TGTGGTAAAG ACGACGTAA
 
Protein sequence
MAMFVTLSVI TTMLFVRLQT SDSAMETMTR TAVQAYQAGG PASLHNYFHT IERDQLFRAI 
LFDDQGHELT GRPAPRFLGP NGEYAPPPPQ GPIPGPPSFD ELIKRNFPRH TIQAVDGHKY
TLILLPPSRA HLWFLTAPTR LIGIVIGLCA TGIICFSLAR YVTKPLQRLR EASSKLASGD
LSARAGNGIH RRDEIGSLVH DFDRMADRIE NLITTQRRLL SDISHELRSP LARLNVAVGL
ARRQADVETQ KALERIEIEA DRLNDMLQNL LTLSRLESGE PVEMRTTVDM STLVTDVVAD
ADFEAQAFGR EVHLSTCEPC EVEGNITLLR SAVENVVRNA ARYTDENTKV TVALTTSGNH
AVVEVHDQGP GVPDESLPKL FLPFYRVDAT RDRNTGGVGL GLSIAERAVR LHGGSVVARN
GRPHGLIVRI ELPLLAHESA PVKSEPAVVK TT