Gene Franean1_6998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6998 
Symbol 
ID5675309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8525789 
End bp8527429 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID641245844 
Product2-alkenal reductase 
Protein accessionYP_001511235 
Protein GI158318727 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCT ACGGTATCGA CCTGGGGACG ACCTATTCGT GTGTCGCCTA CATCGACGAC 
ACAGGGCGGC CCGCCGTCAC GAAGAACATG GTCGGCGAGG ACACCACGCC CTCGGTGGTC
TACTTCGAGA CTCCTGACAA CGTGGTCGTC GGGCGGGACG CGAAGAACTC GGCGAAGCTC
GAGCCCGACC TCGTGGTGTC GCTCATCAAG CGGCAGATGG GCCAGGACGT CGAACTCTCC
TTCCACGGGA GCACGCACAC TCCGGAGAGC ATCTCCGCAC TGATCCTCAG CGAGCTGGCG
CGAGCGGCGA CGGAGTTCAC CGGCGAACCA GTCCGGGACG TCGTGGTAAC GGTGCCCGCG
TACTTCGGCG TGGCCGAGCG GGAGGCCACC CGGAACGCGG GGCGGATCGC CGGCCTGAAC
GTCCTGAACG TCGTGCCGGA ACCGGTCGCC GCCGCCCTGC ACTACGAGGT CGTCGCCCCC
GCCGGCGAAC GCACGATCCT GGTCTACGAC CTGGGCGGTG GAACGTTCGA CACGACCGTC
ATCCGAGTCG CCGGCAACGA GATCAACGTC GTCTGCACCG ACGGCGACCA CCATCTCGGT
GGCGTGGACT GGGACGAGAG GATCGTCCGG TACCTGTTGG AGGGTTTTCT CGCCGAGCAC
CCCGATTCCG AGGCAGGCGA CAACGAGGAC TTCCTCCAGG AGCTCACGAT CGCCGCCGAG
GAGATGAAGA AGGCGCTGAG CAGTACCACG TCGCGCCGGC ACAACATGCG TTTCGGCGGC
GACACCGCAC GGCTCGAGCT GACCCGCGAG GAGTTCGAGC AGATCACCGG CGAGCTGCTC
GAACGGACCC TCGACATCAC CGAGCGGACC GTCACCACCG CCCGGGAGAA GGGTGTGACG
TCGTTCGACG ATGTCCTCCT CGTCGGCGGC GCCACCCGGA TGCCGGCCGT CGCGGCCCGC
CTGAACAAGC GATTCGGCTT CGAGACGAAG CTCCACGACC CCGACCTCGC GGTCGCGAAA
GGGGCAGCCC GGTTCGCGCT CATCGAGTCG GTGAAGCTCC AGCTCCCCGA GGACTCGGAC
GCGGCACCGG GAACGCAGGG CGACGCCCGT ACGTCCGCCT CGCCGGCGGC CGTGCAGCGG
GTCGCCGACC AGCTGGGCAT CACGACCGAA GCCGTCCGCC AGCTCGCGGA GAAGAAGGTC
CGGACGGTCG TGCCCCGCGC GTTCGGCATC AAGGTCGTGG ACAGCGCGGA CCCCGCTCTG
AAGCGGCTGA AGGTCGAGCA CATCCTGCCG CCCAACACAC CGCTGCCCGC CTCGCCCGAC
ACCGAGCGGT TCGGGACGGT CGAGGACAAC CAGACCGGGA TCGAGATCGA GATCTGGGAA
CAGGCCGGAG CCACGGTCTC CCCGGAACTC ACCGACAACG CGGCGATCGG CCGGGGTCTC
ATCAGCGGCC TGCCACCGCT GCCGCGTAAC TCCCCGATCG ACGTCACCTT CACCATGAAC
GAGACCGGCG TGCTGCGGGT ACACGCGGTT GAGCTGAAGA CGGGAAAGGA CCTGCACATC
GAGCTGCAGA TTCAGGGCCT CACCGAGGAA CAGGTCGAGA AGGCGCGCAA CGCCGTGGCA
CGGTACACGC TCAGCGAATA G
 
Protein sequence
MATYGIDLGT TYSCVAYIDD TGRPAVTKNM VGEDTTPSVV YFETPDNVVV GRDAKNSAKL 
EPDLVVSLIK RQMGQDVELS FHGSTHTPES ISALILSELA RAATEFTGEP VRDVVVTVPA
YFGVAEREAT RNAGRIAGLN VLNVVPEPVA AALHYEVVAP AGERTILVYD LGGGTFDTTV
IRVAGNEINV VCTDGDHHLG GVDWDERIVR YLLEGFLAEH PDSEAGDNED FLQELTIAAE
EMKKALSSTT SRRHNMRFGG DTARLELTRE EFEQITGELL ERTLDITERT VTTAREKGVT
SFDDVLLVGG ATRMPAVAAR LNKRFGFETK LHDPDLAVAK GAARFALIES VKLQLPEDSD
AAPGTQGDAR TSASPAAVQR VADQLGITTE AVRQLAEKKV RTVVPRAFGI KVVDSADPAL
KRLKVEHILP PNTPLPASPD TERFGTVEDN QTGIEIEIWE QAGATVSPEL TDNAAIGRGL
ISGLPPLPRN SPIDVTFTMN ETGVLRVHAV ELKTGKDLHI ELQIQGLTEE QVEKARNAVA
RYTLSE