Gene Francci3_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3958 
Symbol 
ID3906917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4735611 
End bp4736642 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content75% 
IMG OID637881285 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_483037 
Protein GI86742637 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.604239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGCCC CCGCCAAGGT CAACCTGCAT CTCGGCGTCG GACCGCTCCG ACCGGACGGC 
TACCACGACG TCATCACAGT GCTGCAGGCC GTCTCGCTGT TCGACGACGT CTCGGCGACG
TCCGTCGATC CCCCGCGGTT CCACGACCCG GGGCCATCGA CCACCGACGG GAACGGCGGG
TCCGACGGGA ACGGCCGGTC CAGCGGGGAC ATCGTCGTGA CCGTCGAGGT GTCCGGGGAG
GGCGCCGACC CGGCGTCGCT GGGTCCGGCG ACGTCCACGC CGGAGGTCTC CATCGTCCCC
ACCGGCCGGG ACAACATCGC CGTCCGGGCC GCTCACCTGG TTGCCGAGGC CGCGGGCATC
ACCTCGGAAC GGGTTCATCT CACCCTGACG AAGGGCATCC CCGTCGCCGC GGGGATGGCC
GGGGGCAGCG CCGACGCGGC GGCGGCGCTC GTCGCCTGCG ACGCGCTCTG GCAGACCGGC
CTGGACCGGG CGACCCTGAC CCGACTCGCC GCCCAGCTCG GCAGCGACGT CCCGTTCCCC
CTGGCCGGCG GCACCGCACT CGGCACCGGG CGCGGCGAGC AGCTCACCGA CGTCCTGGCG
ACGGGCGAGT ACTACTGGGT GTTCGCGCTC GCCGACGGCG GCCTGTCCAC CCCCGCGGTC
TACAAGGAGT TCGACCGGCT GACCGAGGGC AAACTGCGGA CCGGCCCGAC CCCCGCCGAC
GACGTGCTCG CCGCGCTGCG CACCGGCGAC CCCGGCCAGC TCGGAGCCGC CCTGGTCAAC
GACCTGCAGC CGGCAGCACT GCGGCTTCGG CCGTCCCTGC GCCGCGTCCT GGAGGCGGGC
CGGGAGCTGG GAGCCGTCGG GGCGATCGTG AGCGGCTCCG GCCCGACCTG CGCCTTCCTC
ACGGCCGGAG CGCAGGAGAG CATCGCGCTC GCGGCGAGCC TCGCCGGGAT GGGGGTCGCC
CGCGCGGTAC GCCGGGCCTC CGGGCCGGCG AGCGGCGCCA GGATGGTGGA GGGAGCAGGC
GAAGCGCCGT GA
 
Protein sequence
MRAPAKVNLH LGVGPLRPDG YHDVITVLQA VSLFDDVSAT SVDPPRFHDP GPSTTDGNGG 
SDGNGRSSGD IVVTVEVSGE GADPASLGPA TSTPEVSIVP TGRDNIAVRA AHLVAEAAGI
TSERVHLTLT KGIPVAAGMA GGSADAAAAL VACDALWQTG LDRATLTRLA AQLGSDVPFP
LAGGTALGTG RGEQLTDVLA TGEYYWVFAL ADGGLSTPAV YKEFDRLTEG KLRTGPTPAD
DVLAALRTGD PGQLGAALVN DLQPAALRLR PSLRRVLEAG RELGAVGAIV SGSGPTCAFL
TAGAQESIAL AASLAGMGVA RAVRRASGPA SGARMVEGAG EAP