Gene Franean1_4592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4592 
Symbol 
ID5672937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5473599 
End bp5474459 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content73% 
IMG OID641243453 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001508869 
Protein GI158316361 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.338469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACGCA TGGACGGCAA GGTCGTCTTC ATCACCGGTG CGGCGCGTGG CCAGGGGCGG 
GCGCACGCCG TCCGGGTAGC GGCGGAGGGA GGCGACGTCG TGGCCGTCGA CCTGTGCGCC
GACATCGCCT CCACGCCCTA CCCGATGGCG ACCCGGGACG ACCTGGACGA GACGGCCCGC
CTGGTCAAGG AGCGCGGCGG CCGCGTCGTC GCGCAGGTCG CCGACGTCCG CGACCGGGCC
GCGCTGGCCG CCGCCGTCGC CGAGGGCATC GCCCAGTTCG GCCGGTTGGA CGGCGTGGTG
GCCCAGGCCG GTATCTGCCC GCTCGGTACG ACGGCGCCGC AGGCCTTCGT CGACGCGGTC
AGCGTCGACT TCGGTGGCGT CTTCAACGCC GTCGACGTCG CCCTGCCCCA CCTGCAGCCC
GGGGCCTCGA TCGTCGCGAC GGGAAGCCTG GCCGCGTTGA TCCCCGGCAC ATTGGACAAC
GCGGCCAAGG GGTCCGGCGG CCTGGGCTAC GCCTGGGCCA AGCGGGCGGT GGCGTCGCTG
GTCCACGACC TCGCCGTCGT CCTGGCTGGT CAGAGCATCC GGGTGAACGC CGTCCACCCG
ACCAACGTCA ACACCGACAT GCTGAACAAC GACGTCATGT ACCGGGCGTT CCGCCCGGAC
CTGGCCGAGC CCACCCTCGA GGACGTGCTG CCGTCGTTCC CGGCCATGAC CGCGACAGGC
GACCCGTACG TCGAGCCCGA GGACATCGCC GACGCGGTCC TCTTCCTGCT CTCCGACGAG
TCCCGCTTCA TCACCGGCAC CCAGCTCCGC GTCGACGCGG GAGGTTACGT CAGGCTGCGG
CCGCAGGTGC CCGCCTTCTG A
 
Protein sequence
MGRMDGKVVF ITGAARGQGR AHAVRVAAEG GDVVAVDLCA DIASTPYPMA TRDDLDETAR 
LVKERGGRVV AQVADVRDRA ALAAAVAEGI AQFGRLDGVV AQAGICPLGT TAPQAFVDAV
SVDFGGVFNA VDVALPHLQP GASIVATGSL AALIPGTLDN AAKGSGGLGY AWAKRAVASL
VHDLAVVLAG QSIRVNAVHP TNVNTDMLNN DVMYRAFRPD LAEPTLEDVL PSFPAMTATG
DPYVEPEDIA DAVLFLLSDE SRFITGTQLR VDAGGYVRLR PQVPAF