Gene Franean1_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1093 
Symbol 
ID5669507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1304976 
End bp1306565 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID641240025 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001505455 
Protein GI158312947 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGTCG TACTCGTCGC CGAGGAACTC TCACCGGCCG GGCTGGAGGT CCTGTCCGGG 
GACTTCGAGA TCCGCCATGT GGACGGGGCC GACCGGTCCG CCCTGCTGCC GGCGCTGGCG
GACGTCGACG CGGTCCTCAT CCGCTCGGCG ACGAAGATCG ACGCCGAGGC GCTCGCGGCC
GCCCCGCGGC TGAAGGTCGT GGCCCGCGCC GGGATCGGGC TCGACAACGT CGACGTCCCC
GCCGCCACCA ACCGCGGCGT CATGGTGGTG AACGCGCCGC AGTCGAACAT CGTCAGTGCC
GCCGAGCACG CCATCGCACT GCTGCTCGCG GTCGCCCGCC GGGTCCCGGC CGCCCATGAG
TCGCTCGTCG GCGGTGAGTG GAAGCGCTCG AAGTACGTCG GCGTGGAGCT GACGGAGAAG
ACCGCGGGCG TCGTCGGCCT CGGCCGCATC GGTGTCCTGG TCGCGCAGCG GCTGGCGGCC
TTCGGCATGA AGGTCCTGGC CTACGACCCC TATGTCTCCG TCGCCCGCGC CTCGCAGCTC
GGTGTGCGCC TGGTGGACCT CGACGAGCTG CTCACGTCCA GCGACGTCAT CACGATCCAC
CTGCCGAAGA CACCCGAGAC GCTGGGGCTC ATCGGGGCCG ACGAGCTGGC CCGGGTGAAG
CCGGGCGTGA TCATCGTCAA CGCGGCGCGC GGCGGCCTGG TCGACGAGGG CGCCCTGGCC
GACGCGGTCC GGTCCGGCCG GGTCGGCGGT GTCGGGCTCG ACGTGTACGT CAAGGAGCCG
ACCACCTCCT CGCCGCTGTT CGGGCTGGAG AACGTCGTCG TCACCCCGCA CCTGGGCGCC
TCGACGCAGG AGGCGCAGGA CAAGGCCGGT CTGGCCGTGG CCCGTTCGGT GCGCCTCGCG
CTCAGCGGCG AGTTCGTCCC GGACGCGGTG AACGTGCAGG CCGGCGGGGT CGTGGCCGAG
GACGTGCGGC CCGGTCTGCC GCTGGCGGAG AAGCTGGGCC AGCTCTTCTC CGGGCTGGCC
GCGGGCGTGG CCGCCGCGAT CACCGTCGAG GTGCGCGGCG AGATCGCCGC GCACGACGTG
TCGGTGCTGC AGCTCGCCGT CCTCAAGGGT GTCTTCATCG ACATCGTCGA GGAGCAGGTC
ACCTACGTGA ACGCGCCGCT GATCGCCAAG GAGCGCGGCG TCGACGTGGC GCTGGAGACC
TCCGAGGAGA GCCCCGACTA CCGCAACCTC GTCACGGTGC GCGGTGTCCT GCCCGACGGG
ACGGCGGTGT CGGTCAGCGG GACGCTCGTC GGCTCCCGCC AGGTCGAGAA GATCACCGCG
ATCGACGGGT TCGAGGTCGA CCTGCGTCCC GAGGACCACC TGGCGTTCTT CCGTTACGAG
GATCGTCCCG GCATCGTCGG GGCCGTCGGC GCGCTGCTGG GCGAGGCCCA CATCAACATC
GCCAACGCTC AGGTCAGCCG GCTCAGCGCC GGTGGCGAGG CCCTCATGTC GCTGTCCCTG
GACGACGCGG TGGCGCCCGA CATCCTGGCC GAGATCGCCA AGATCATCGG TGCGTCGTAC
GCCCGCGCGG TGAGCATCTC CGCGGGCTGA
 
Protein sequence
MPVVLVAEEL SPAGLEVLSG DFEIRHVDGA DRSALLPALA DVDAVLIRSA TKIDAEALAA 
APRLKVVARA GIGLDNVDVP AATNRGVMVV NAPQSNIVSA AEHAIALLLA VARRVPAAHE
SLVGGEWKRS KYVGVELTEK TAGVVGLGRI GVLVAQRLAA FGMKVLAYDP YVSVARASQL
GVRLVDLDEL LTSSDVITIH LPKTPETLGL IGADELARVK PGVIIVNAAR GGLVDEGALA
DAVRSGRVGG VGLDVYVKEP TTSSPLFGLE NVVVTPHLGA STQEAQDKAG LAVARSVRLA
LSGEFVPDAV NVQAGGVVAE DVRPGLPLAE KLGQLFSGLA AGVAAAITVE VRGEIAAHDV
SVLQLAVLKG VFIDIVEEQV TYVNAPLIAK ERGVDVALET SEESPDYRNL VTVRGVLPDG
TAVSVSGTLV GSRQVEKITA IDGFEVDLRP EDHLAFFRYE DRPGIVGAVG ALLGEAHINI
ANAQVSRLSA GGEALMSLSL DDAVAPDILA EIAKIIGASY ARAVSISAG