Gene Franean1_3875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3875 
Symbol 
ID5672238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4605029 
End bp4606039 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content74% 
IMG OID641242753 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001508173 
Protein GI158315665 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.893798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0397379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGG CCACCTCCGG CGCCGGCGGC CCGGCCACCG GTGCCGGCGC GGTGGCCGGG 
CCCGGGTTCG ACCGGGAGTC GACGGCGCTG GAGGTGGCCC GCAGTGCCGA CCTGCGTGGG
CGCGTCGCGG TGCTGACGGG GGCCTCCTCC GGCATCGGCG TCGAGACGGC GCGCGCCCTG
GCAGCCACCG GCGCCGACGT CGTGCTGGGC GTTCGGGATG TCGCCGCCGG CGAGGAGCTG
GTCCGCGAGG TGCGGGCCGG CGCCACAGGC GACATCCGGG CGGAGCGGCT GGACCTGAGC
GATCTCGGTT CGGTCGTCGC GTTCGCCGCC CAAGTCACTG GCCCCGTCGA CCTGCTGATC
GCCAACGCGG GGGTCTCCAG GACCCCGGAG TCACACCTGC CCAACGGGCT CGACGTCCGC
TTCGCGACGA ACCACCTGGG CCACTTCCTG CTGGCGCTGC GCCTGAGCGA ACAGCTCGCC
GAGCGTGGAG CGCGGATCGT CGTGGTCAGC TCGGGCGCGC ACAGGAGCAT CCCCGTCCGC
CTCGACGACC TGCAGTGGAC CGCCCGGCGG CACAACCCGG GGATGGCCTA CGCCGAGTCG
AAGACCGCGA ACATCCTCTT CGCCCAGGAG GCGACCCGCC GGTGGGGACC CGACGGGATC
TTCGCGAACG CGGTGCTGCC CGGCTCGGCG CTGACCGGCC TGCAACGCTT CCACGGGGAC
GAGATGAAAC GCCGGATCGG CTTCCTCAAC GAGGACGGAT CTCCCAACCC GGTGCTTAAA
TCCCCTGCCC AGGCCGCGGC CACGACCCTC TGGGCCGCCA CGGCCCCGGA ACTGGCCGGG
CGTGGCGGCC TCGTCCTCGA GGACTGCGCA GAGGCGCTAC CCCCCGGCCC GCCCGGCTCG
GACGTCCTGG TCCGCTCGGG CTTTGACCCC TCGGTCGCCG ACCCCGACAC GGCCCGCCGC
CTGTGGGACC GCTCCATCGA GCTGCTCCGG GTCCTCGGCC GACCAGAATG A
 
Protein sequence
MTAATSGAGG PATGAGAVAG PGFDRESTAL EVARSADLRG RVAVLTGASS GIGVETARAL 
AATGADVVLG VRDVAAGEEL VREVRAGATG DIRAERLDLS DLGSVVAFAA QVTGPVDLLI
ANAGVSRTPE SHLPNGLDVR FATNHLGHFL LALRLSEQLA ERGARIVVVS SGAHRSIPVR
LDDLQWTARR HNPGMAYAES KTANILFAQE ATRRWGPDGI FANAVLPGSA LTGLQRFHGD
EMKRRIGFLN EDGSPNPVLK SPAQAAATTL WAATAPELAG RGGLVLEDCA EALPPGPPGS
DVLVRSGFDP SVADPDTARR LWDRSIELLR VLGRPE