Gene Acid345_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2431 
Symbol 
ID4072865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2870504 
End bp2871535 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content61% 
IMG OID637984447 
ProductGHMP kinase 
Protein accessionYP_591506 
Protein GI94969458 
COG category[R] General function prediction only 
COG ID[COG2605] Predicted kinase related to galactokinase and mevalonate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.696498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.378909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTA AGAAACCCGG CTCTCCTCAA CAGGTAATTG CCGAGGCGTG CTGTCGCGTG 
GACCTCGCCG GCGGCACCCT CGATCTGTGG CCTCTTTACC TTTTTCATAA AAACTCCGTC
ACGGTGAATT TTGGGGTCAA TATCATGACC CGCTGCCAGA TCACCGCCCG CGACGACGAC
CACATTTCGC TGATCTCAAA AGACACGCTG CGTGGCGACG ACTTCGAAGA CCTGAAGACG
CTGCGTGCGG CGAAAGAACA CCGTCATGCA CTCGCCGCGC AACTGCTGCG CTTCTTCGAG
CCGGACTGCG GCTTGAACCT GGAGACGAAT TCCGAATCGC CCGCGGGCGC GGGAATCTCC
GGTTCGTCGG CGCTGATGAT CGCCATTACC GCGGCGCTGG CGCGGTTCAC CGGTCGCAAG
CTCACGCTGG AGCAGATTCG CACCATCTCG CAAAACGTTG AAGCGCAGGT GATCAACGTT
CCTACCGGAT GCCAGGATTA CTATCCGGCG CTTTATGGCG GCGTGAACGC GGTGCATCTG
CAGCCGGATG GAATAATCCG CGAGGCGATT GATGTTGCAC CCGAGGAGAT CGAGAAGCGC
TTCGTGCTGA TCTATACCGG CGCGCCGCGG CAATCGGGGA CCAACAACTG GGAGGTCTTC
AAAGCGCACA TCGACGGCGA CAGCATTGTG CAGCGCAACT TCGACCGCAT CGCCGACATC
GCCGACAGCA TGCACCACGC GCTCGCCGCC CACGATTGGG ATGAAGTCGC GCGCCTGCTG
CGCGAAGAGT GGAAGCAGCG TCGAACGAAC GCGCCGAACA TCACGACGAA GTTCATTGAT
GAACTGATCG AAGTAGCCCG GAAGAAGGGC GCCCGCGCAG CGAAAGTCTG CGGCGCCGGC
GGCGGCGGCT GCGTGATCAT CATGACCCAC GAAGATTCCC GCGATAAAGT AAGCGCGGCG
CTGGCCGAAG CGGGAGCTAC GGTGTTGCCG TTGCAGGTGG CCCGGAAGGG GCTGCAGGTT
CGGAGTAAGT AG
 
Protein sequence
MARKKPGSPQ QVIAEACCRV DLAGGTLDLW PLYLFHKNSV TVNFGVNIMT RCQITARDDD 
HISLISKDTL RGDDFEDLKT LRAAKEHRHA LAAQLLRFFE PDCGLNLETN SESPAGAGIS
GSSALMIAIT AALARFTGRK LTLEQIRTIS QNVEAQVINV PTGCQDYYPA LYGGVNAVHL
QPDGIIREAI DVAPEEIEKR FVLIYTGAPR QSGTNNWEVF KAHIDGDSIV QRNFDRIADI
ADSMHHALAA HDWDEVARLL REEWKQRRTN APNITTKFID ELIEVARKKG ARAAKVCGAG
GGGCVIIMTH EDSRDKVSAA LAEAGATVLP LQVARKGLQV RSK