Gene Franean1_4846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4846 
Symbol 
ID5673187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5811722 
End bp5812600 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content68% 
IMG OID641243702 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001509118 
Protein GI158316610 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.145119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCTCCA ACGCACCGCC GAACGCCCGG AAACCTGTGG GCCGGGTCCT CGTCGTCAGT 
GGCGGGACAG ACGGGATGGG CCGCGCCCTC GCACTCGCTC GCGCCGATCG CGGCGACCAG
GTCGTCGCGA TCGGCAGCAA CCCGACAAAG GGCGAGGCGT TGCTGGCCGA GGCCGCCCGG
CGTGGCGTGG CCGGGCGAGT CCGCTTCGTC CGGGCTGACC TCGCGACTGT CGCGGGCAAC
CGCCGTGTCC TCGAAGACAC TCTCGGCCAG CACGACAGGA TCGACGTGCT GGCACTGTTC
GCCAACCGGC AAGCACCCAA ACGAACCCTG ACAGCGGACG GGCTCGAAAG CACCTTCGCG
CTCTACTACC TCAGCCGCTA CGTCCTCAGC CATGGCTTCC GCGACGCGCT GGAGGCCAGT
GACGCTCCGG TCATCGTGAA TGTCGCCGGC GTCGGCATCA CCAAGGGATC GATCCACTGG
GACGACCTCC AACTGGAACG TGGCTACAGC ATGATCGCCG CGCAGCTGCA AGCAGCCCGA
GCCAACGACC TACTCGGCGT CGCCTACACC GAGCACGCCA ACAGCAAGGC GCGCTATGTG
CTCTACCACC CCGGATTCAC CAGGAGCGGA GACCTCAGCC CCCTGCCCGC GGCGCTGCGC
GCCAGCATCC GGGCCGCCGC GAGGATCTCG GCGCGCCCCA TCGCCGAATC GATCGGCGCC
ATCCACCACT TCATTGATGC GCCCCCTGCC GCAGGGTTGA CCGCGATCGA TCGGAACAAG
CACCTACCGC TGACGCTCGA AACCCTGAAT CCGCAGAACG CGGAACGCCT CGCGCGAGCA
ACCGAAGCGC TGGTCGCCGC GCTACCCAGC ACCCCGTAG
 
Protein sequence
MFSNAPPNAR KPVGRVLVVS GGTDGMGRAL ALARADRGDQ VVAIGSNPTK GEALLAEAAR 
RGVAGRVRFV RADLATVAGN RRVLEDTLGQ HDRIDVLALF ANRQAPKRTL TADGLESTFA
LYYLSRYVLS HGFRDALEAS DAPVIVNVAG VGITKGSIHW DDLQLERGYS MIAAQLQAAR
ANDLLGVAYT EHANSKARYV LYHPGFTRSG DLSPLPAALR ASIRAAARIS ARPIAESIGA
IHHFIDAPPA AGLTAIDRNK HLPLTLETLN PQNAERLARA TEALVAALPS TP