Gene Franean1_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5041 
Symbol 
ID5673377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6041357 
End bp6042328 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content69% 
IMG OID641243892 
Productalcohol dehydrogenase 
Protein accessionYP_001509307 
Protein GI158316799 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.750804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTG CGGTGCATGC CAGGTACGGC CCACCGGAGG TTGTGGGAAT TCAGGAGGTC 
GACAAGCCCA CGGCCGGCGT CGGTGAGGTG CTGGTCAAGG TGCATGCGGC GACGGTGAAC
CGCACCGACT GTGGCTACCG GGCCGCCAGA CCGTTTATCG TGCGGTTCTT CGCGGGGCTG
GCCAGGCCGA AGGCGATGAT CCTCGGGAAC GAGTTCGCCG GGGTGGTCGA GGCCGTCGGC
ACCGATGTCA CCACCTTCCT GGTCGGGGAC GGCGTCTTCG GCTACAACGA GGGTCCCTTC
GGAGCCCACG CGGAGTACTT GGCCGTCCGT GCCGACGGCC TGCTCGCGCA CGTACCGGCG
GGGGTGGCCT TCGAGCAGGC CGCCGCTGCC ACCGAGGGCG CGCACTACGC CCTGTCGTTC
ATCACCAAGA TCCCGGCCTG GGACGGGGCG CGGATCCTGG TCAACGGGGC GACTGGGGCC
ATCGGTTCGG CGGCGGTCCA GCTCCTGAAG TGCCGCGGCG CCCAGGTGAC CGCGGTATGC
GGCCCGGACG GCGTCGACCA GGTGCGAGAG CTGGGCGCCG ACCGGGTCAT CGACCGCACG
ACGTGCGACT TCACCAGGGA CGAGCATGTC TACGACGCCG TCTTCGACGC GGTCGGCAAG
AGCTCGTTCG GCCGGTGCAG ACGGCTGCTG CGTCCCGGCG GGGTGTACTC CTCGACCGAG
CCCGGCCGGT TCGCGCAGAA CCTGGTGCTG GCGATGCTCA CCCCGCTGCT GCGTGGCAGG
AAGGTGCTGT TCCCGCTCCC GTCGATCGAC AGGAAGACGG TGGAATACAT CCGGGACCTG
CTCGCTTCGG GACGGTTCCG GCCGCTTCTC GACCGGCGGT ACCCGCTGGA GCAGATCGTG
GAGGCCTACC GGTACGTCGA GTCCGGGCAG AAGATCGGCA ACGTTGTGAT CGCGGTCCGG
CCCTCGGAAT GA
 
Protein sequence
MRAAVHARYG PPEVVGIQEV DKPTAGVGEV LVKVHAATVN RTDCGYRAAR PFIVRFFAGL 
ARPKAMILGN EFAGVVEAVG TDVTTFLVGD GVFGYNEGPF GAHAEYLAVR ADGLLAHVPA
GVAFEQAAAA TEGAHYALSF ITKIPAWDGA RILVNGATGA IGSAAVQLLK CRGAQVTAVC
GPDGVDQVRE LGADRVIDRT TCDFTRDEHV YDAVFDAVGK SSFGRCRRLL RPGGVYSSTE
PGRFAQNLVL AMLTPLLRGR KVLFPLPSID RKTVEYIRDL LASGRFRPLL DRRYPLEQIV
EAYRYVESGQ KIGNVVIAVR PSE