Gene Francci3_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3757 
Symbol 
ID3906041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4505353 
End bp4506330 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content71% 
IMG OID637881083 
Productdiacylglycerol kinase, catalytic region 
Protein accessionYP_482837 
Protein GI86742437 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGGC TGTTGGTCGT CAACCCCGTC GCCACTGCTA CGACCGAACG TGTTCGTGAC 
GTCCTCGCCA GCGCGCTCGC CGCGGACCTC GCTCTGGAGA CTGTGGTCAC CAAGGGGCGG
GGGCACGGGG TGGAGCTGGG GGCCAGAGCG GCCGAACTCG GCGTCGACGT GGTGCTTGCG
CTCGGTGGGG ACGGTACCGT CAACGAGATC ATCAACGGAC TGCTCCAGAA CGGTCCGGTA
CCGGGCGGCC CAGCGTTCGC CGTGGTCCCG GGTGGCAGCA CCAACGTCTT CGCTCGCGCG
CTAGGCTACT CCGCGTCACC GGTGGAGGCG ACCGGCGAGC TGCTGAGCGC GCTCCGCGAG
GGCCGCACCC GGCGGATCAG CCTCGGCCGG GCTCAGTACG GCGACGAACG GCGCTGGTTC
ACGTTCTGTT TCGGTATCGG CCTGGATGCC CGGGTCGTCG CGCGGGTCGA GGAGAAGCGT
GGCAAGGGCA GGCGCAACAC CGCCGGCCTG TTCCTGCGGA CCGCGGCGGG TCAGATCCTC
CACGGCACCG ACCGCCGCGG TTCGCCCATC ACCCTGACGA CCGCCGACGC CGGCTCGGCC
CCCGCCCCCG GGACCACCGA CCCGGCGGGC GGGCGGGCCG AGGAGCACAT CGCCTTCTGC
ATCGTCTGCA ACACCCGCCC GTGGACCTAC CTCAACTCCC GTCCCGTCCT CGCCTGCCCG
GACGCGTCCT TCGACACCGG TCTGGACCTC CTCGCGCTGC GGCGGGTACG GTTGCCCAGC
GTGCTGCGGG CCGCCTCGCG GATACTCACC GACGGCGAGG GTCCGCGCGG TCGCAACGTG
GTGCGCCGCC ACAACACGTC GCGTATCCGC TTCTCCACGG ATACGCCCCT GCCGGTGCAG
ATGGACGGCG AGTATCTCGG GATGTTCTCC GAGGTGCTGC TGACCCACCA GCCGCACGCG
CTGCGCGTCG TCGCCTGA
 
Protein sequence
MRGLLVVNPV ATATTERVRD VLASALAADL ALETVVTKGR GHGVELGARA AELGVDVVLA 
LGGDGTVNEI INGLLQNGPV PGGPAFAVVP GGSTNVFARA LGYSASPVEA TGELLSALRE
GRTRRISLGR AQYGDERRWF TFCFGIGLDA RVVARVEEKR GKGRRNTAGL FLRTAAGQIL
HGTDRRGSPI TLTTADAGSA PAPGTTDPAG GRAEEHIAFC IVCNTRPWTY LNSRPVLACP
DASFDTGLDL LALRRVRLPS VLRAASRILT DGEGPRGRNV VRRHNTSRIR FSTDTPLPVQ
MDGEYLGMFS EVLLTHQPHA LRVVA