Gene Franean1_4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4809 
Symbol 
ID5673150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5740854 
End bp5741846 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content70% 
IMG OID641243665 
Productaldo/keto reductase 
Protein accessionYP_001509081 
Protein GI158316573 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.419234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.447394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTACG TCAAGCTGGG GTCGACCGGC CTGGAGGTCT CCCGGGTCTG CCTGGGCTGC 
ATGAGCTACG GCACCCCGGG GGAGGGGAAC TGGCCGTGGT CTCTTGACGA GGACGCGTCG
CGGCCGTTCT TCCGGCGTGC GATCGAGGCG GGGATCAACT TCTTCGACAC CGCGAACGTC
TACTCGCTGG GCCGCAGCGA GGAGATCACC GGCCGGGCGC TGAAGGACTT CGCCCGCCGG
GACGAGATCG TGCTGGCCAC CAAGGTCCAC TCCCGGATGC GGTCCGGCCC GAACGGGGCC
GGCCTCTCCC GCAAGGTGAT CATGCACGAG ATCGACGCGA GCCTGCGGCG CCTCGGCACG
GACTACGTCG ACCTCTTCCA GATCCACCGC TGGGACGAGA CGACGCCGAT CGAGGAGACG
CTCGAGGCGC TGCACGACGT CGTGAAGGCC GGCAAGGCCC GTTACATCGG CGCCTCGTCG
ATGTACGCCT GGCAGTTCAC CAAGGCGTTG TTCATCTCCG AGCGGCACGG CTGGACCCGT
TTCGCGACGA TGCAGAACCA CTACAACCTG CTCTACAGGG AGGAGGAGCG GGAGATGCTC
CCGCTGTGCG CGGACCAGGG GATCGGCGTG ATCCCGTGGA GCCCGCTGGC CCGCGGCCGC
CTCACCCGCG ACTGGGACGC CACCACGACC CGCGCCGAGT CCGACCCCTT CGCCCGCGCC
TTCTACCAGG ACGACGACCG GCTGATCGTC GAGGAGGTCG CCCGCATCGC CGACGAGCGC
GGCGTGAGCC GGGCCCAGGT GGCGCTGGCC TGGGTGTCAC GCAATCCCGT CGTCACCGCG
CCGATCGTCG GCGCCACAAA GCCCGGGCAC CTCGACGACG CGCTCGCCTC CCTGGAGCTG
ACCCTCACCG ACGACGAGGC CGCCCGGCTG GAGGCCCCGT ACCGCCCACG GCCCGTCGCC
GGTATCCAGG TGCCGCAGCG GCGACGGCTC TGA
 
Protein sequence
MEYVKLGSTG LEVSRVCLGC MSYGTPGEGN WPWSLDEDAS RPFFRRAIEA GINFFDTANV 
YSLGRSEEIT GRALKDFARR DEIVLATKVH SRMRSGPNGA GLSRKVIMHE IDASLRRLGT
DYVDLFQIHR WDETTPIEET LEALHDVVKA GKARYIGASS MYAWQFTKAL FISERHGWTR
FATMQNHYNL LYREEEREML PLCADQGIGV IPWSPLARGR LTRDWDATTT RAESDPFARA
FYQDDDRLIV EEVARIADER GVSRAQVALA WVSRNPVVTA PIVGATKPGH LDDALASLEL
TLTDDEAARL EAPYRPRPVA GIQVPQRRRL