Gene Franean1_3365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3365 
Symbol 
ID5671736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3990170 
End bp3991060 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content74% 
IMG OID641242253 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001507673 
Protein GI158315165 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGT TCTTGCCGCC CTACGCAGAC CGGCCCGTCG CCGTCGTCAC GGGCGCCAGC 
TCGGGGATAG GCCGGGCGAC CGCCCGCACT CTCGCCGAGC AGGGCTGGCA GGTGATCGGC
GTCGGCCGTG ATCCGGTACG CAGCGCCGCC ACCGAGGCCG AGATCGCCGC GGCGGCGCGC
AAGGACGGCG GCTTCACGAT GCTGCGCGGC GACTTCACCC TCATGGCGGA TGTCCGGCGC
GTCGCCGGCG AGATCCAGCA CCTCACCCCG CGCCTCGACA TCCTCATCAA CAACGCCGGC
GGCGTGCGGG ACCGGCAGAT CATCAGCGCG GAGGGTACCG AGGCGACCTT CGCCGTCAAC
CACCTCGCCC CGTTCCTGCT GACCCGGGAG CTCACGCCCC TGCTGCGGGC GAGCGCGGCC
AGCCTGCCCG CCGGGTCGAC GCGGGTGATC GCCGTGTCGT CGAGCGGCCA CCGCGCGGTC
GACGGGCTCG ACTGGGACGA CCTGCAGAGC CTCGGCGAGT TCCGTCCCGC CGTCGCCTAC
TGCCGGGCCA AGCTCGCCAA CATCCTGTTC ACCCGCGAGC TGGCCCGGCG GGCCGGGCCG
GACGGGATCG TCGCCCAGGC CATGCATCCC GGCGTCGTGG CCAGCAACTT CAGCGCGCAC
GGCGACGCGG CGATGCGGGC GCACATGGCC GCGGCCGACA CGGTGCCACC CGACGAACCG
GCGGAGACGC TGGCCTGGCT GGCCACCGAG CCCGAAGGCG GCCGCACCGG CGGGCGCTAC
TTCCACCGCA GGGCGGAGGA GCTACCCGCC GAAGCCGCGC GGGACGATGC CGCCGCCGCC
CGCCTCTGGA CCGAGAGCGA GCGGCTGCTG GACGGGCTCG GCTTCCGCTG A
 
Protein sequence
MEMFLPPYAD RPVAVVTGAS SGIGRATART LAEQGWQVIG VGRDPVRSAA TEAEIAAAAR 
KDGGFTMLRG DFTLMADVRR VAGEIQHLTP RLDILINNAG GVRDRQIISA EGTEATFAVN
HLAPFLLTRE LTPLLRASAA SLPAGSTRVI AVSSSGHRAV DGLDWDDLQS LGEFRPAVAY
CRAKLANILF TRELARRAGP DGIVAQAMHP GVVASNFSAH GDAAMRAHMA AADTVPPDEP
AETLAWLATE PEGGRTGGRY FHRRAEELPA EAARDDAAAA RLWTESERLL DGLGFR