Gene Franean1_4576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4576 
Symbol 
ID5672923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5458891 
End bp5459958 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID641243439 
Productputative oxidoreductase 
Protein accessionYP_001508855 
Protein GI158316347 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03559] probable F420-dependent oxidoreductase, Rv3520c family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.248726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.45285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGTCG GGATGCCGCT GAAGTACTCC GGCGGCTTCA CCGAGACCGT CGCGGACCTT 
CGCGACTTCG AGGCGGCCGG GCTCGACCTG GTCATGCTCC CGGAGGCCTA CAGCTTCGAC
TCGGTGAGCC AGCTGGGCTA CCTGGCGGCC CGGACCTCGA CGGTGCTGCT GGCCACGAGC
ATTCTGAACA TCTACTCGCG CACCCCGGCC CTGCTGGCCA TGACGGCGGC CGGGCTGGAC
TACGTGTCCG ACGGCCGCTT CGTGCTCGGC CTGGGCGCGT CGGGGCCGCA GGTGATTGAG
GGGTTCCACG GCGTGCGCTA CGACGCCCCG CTCGGGCGCA CCCGCGAGGT CGTCGAGATC
TGCCGGGCGG TCTGGCGGCG CGAGCGGCTC AGCTACGAGG GCCGGCACTA CCACCTGCCG
CTGGATGCCG CGCACGGCGG CAGCGGCCTG GGGAAGCCGC TGAAGCTGAT CAACCACCCG
GTCCGGTCCG CGGTGCCGAT CGTGCTCGCC GCGCTGGGAC CCCGCAACGT CGAGCTGGCC
GCCGAGATCG GTGACGGGTG GGAGCCGATC TTCTACCTCC CCGAGGCGGC GCCGGCCGCC
TTCGGTGAGC CGCTGGGCCC GCTCGACATC GTGGTGCCCA CCCAGCTGCT GATCAGCGAC
GACGCCGACG AGATCGAGGC CGCGGTCCAG GCCGTGCGCG AGCACCTCGC GCTCTACGTC
GGCGGCATGG GCGCCCGGGG CCGGAACTTC TACAACGAGC TCGCCGGCCG CTACGGGTTC
GCGGCGGCGG CCGCCGAGGT GCAGGACCAC TACCTCGCCG GGCGCAAGGC GCAGGCCGCT
GCCGCGGTGC CGGAGCGGCT GGTGCGCGGT GTCTCGCTGA TCGGACCGCC CGGGTATGTG
CGGGAGCGGG TGGCGGCGTT CGCCGAGAGC GGGGTGACGA CGCTGAACGG GCTGCCGCTG
GCCGGCACCC ACCGCCGGCG GCTCGCCGAC GTCGAGCGGC TCAAGGAGTA CGTGTCGTCG
ACGCTTCCCG GAACTTATCG TGATAGGTAT AATGAGCTAA CTAATTGA
 
Protein sequence
MRVGMPLKYS GGFTETVADL RDFEAAGLDL VMLPEAYSFD SVSQLGYLAA RTSTVLLATS 
ILNIYSRTPA LLAMTAAGLD YVSDGRFVLG LGASGPQVIE GFHGVRYDAP LGRTREVVEI
CRAVWRRERL SYEGRHYHLP LDAAHGGSGL GKPLKLINHP VRSAVPIVLA ALGPRNVELA
AEIGDGWEPI FYLPEAAPAA FGEPLGPLDI VVPTQLLISD DADEIEAAVQ AVREHLALYV
GGMGARGRNF YNELAGRYGF AAAAAEVQDH YLAGRKAQAA AAVPERLVRG VSLIGPPGYV
RERVAAFAES GVTTLNGLPL AGTHRRRLAD VERLKEYVSS TLPGTYRDRY NELTN