Gene Acid345_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4541 
Symbol 
ID4070220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5385169 
End bp5386107 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content58% 
IMG OID637986581 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_593615 
Protein GI94971567 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000250612 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.557772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT TTGTCCGTTC ATACGCGAAG ATCAATCTTG GGCTGCGCAT TGGGCCGCGG 
CGCGCGGATG GGTTTCACGA TCTGCGCACG ATGTACACGA CGATCGCGTT GCACGAGCTT
GTTTCGGTTG AAGTCGAAGA CGGCGACGGG ATCGAGATCC GGTGTGACGA TCCGCGGGTG
CCTTGCGATT CGACGAATAC GTGCCACAAG GCGGCGGCGC TGGTGTTGGA GGCGCTCGGT
CTGCGCTCTA AGGTATTGAT TTCCATCGAG AAGCGGCTGC CGGTACAGGG TGGACTCGGG
GCAGCTTCTG GGAATGCGAT TGCGACGATT TTTGGGCTGG AGCGGATGCT GGGGAAGGAG
CTTTCGGCCA AAGAAAGGGC AGAGATTGCC GAGAAGATCG GATCCGACCT TAACTTGTTC
TTATACGGTG GGTTAACGTT GGGTACCGGG CGCGGCGAGG AGGTCTGGCC GCTGCCGGAT
TTGCCATCGC TGCCGCTGGT GATCGTGACG CCGGAGGTCG GAGTGTCGAC GCCGGTGGCA
TTTAAGGCGT GGGATTCATT GACCCATCCG GAGGGCTCCG TTACAATAAA TGAGTTCAAC
CATCTCGTTT ACGAGTGGTT GTCTGCATCC GGTGTTCCCG CCTTGAGCGG GGACCGGGCC
GAGACGCTGC TTCTCGACCT TGTCCGAACC GGGATTTCGA ACGACTTCGA ACGCGTTGTC
TTTCCAGAAA TTCCCGTATT ACGAGAGGTC AAGTGTGCGC TCGAGCGCGA AGGCGCTTTG
TATGCGTCGC TTTCCGGCTC AGGTTCAACC TTGTATGGGT TGTTTCGTTC GTCTGCGGAA
GCATCGACAG CGGCGGAGCG GTTGAACTCG AGCGGGCTGA AAGCGACGGC GACACAGACC
TTGCCGCGTG AGCAGTACTG GCGGGAAATG TTTCAGTAA
 
Protein sequence
MSTFVRSYAK INLGLRIGPR RADGFHDLRT MYTTIALHEL VSVEVEDGDG IEIRCDDPRV 
PCDSTNTCHK AAALVLEALG LRSKVLISIE KRLPVQGGLG AASGNAIATI FGLERMLGKE
LSAKERAEIA EKIGSDLNLF LYGGLTLGTG RGEEVWPLPD LPSLPLVIVT PEVGVSTPVA
FKAWDSLTHP EGSVTINEFN HLVYEWLSAS GVPALSGDRA ETLLLDLVRT GISNDFERVV
FPEIPVLREV KCALEREGAL YASLSGSGST LYGLFRSSAE ASTAAERLNS SGLKATATQT
LPREQYWREM FQ