Gene Franean1_3465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3465 
Symbol 
ID5671836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4095632 
End bp4096792 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content76% 
IMG OID641242353 
Productluciferase family protein 
Protein accessionYP_001507773 
Protein GI158315265 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.469803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGGTA CGGCCCCCCA ACCCCGGCCT GCGAGCGACG TGCCACTGGG CGTGCTCGAC 
CTCGCCCCGA TCAGTGCGGG CGAGAGCGTC GCGGACGCGC TGCGCAACAG CCTCGGCCTG
GCCCGGCACA CCGACCGGCT CGGCTACCAC CGGTACTGGG TCACCGAGCA CCATGTGAAC
CCGAGCACGG CCGGGCTCTC CTCGACCCTG CTGACCGCCC TGGTGGCCGA CGCCACCACC
CGCATCCGGG TCGGCTCCGG CTCGTTGCAG CTGGGCCACC GCACCGCGCT GTCCGTCGCG
GAGGAGTGGG GCCTGCTCGA CGCGCTCCAC CCGGGCCGGC TGGACCTGGG CCTGGGCCGC
GCCGGCTGGC ACCCCCCGGC GCCCGTCGCC GGCGACGCGG AGGCCACCGG CGACGCCCCG
GCCGGACCGG CCGGACCCGT CGAGCCGGAC GAGCCGCCCG CGCCGTACCC CCGGCTGCGC
GGCTCGGACC TGTTCGCCCT GCACCGGCTG CTGCTGCCGC TCAACCCGCT CAACAGCCCC
GACTACCTCG ACCAGGTGAC CGAGGTGGTG GCCCTGCTCG ACGGCACCCG CCGCGCCGGT
GACCTGCGGG TCCACGCCGT CCCCGGCCAC CAGGCCGAGG TGGCGGTCTG GGTTCTCGGA
CGCACCGCCG GCGAGAGCGC GGAGGTCGCC GGCCGGCTGG GCCTGCCCTA CGCGACGAAC
TACCACGCGA GCCCGTCCAC CACGGCGGAC TCCGTCGCCG CCTACCGGAA GGCGTTCCGC
GCGTCGGAGA CCCTGGCCCG CCCCTACGTG ATCGTCACGG CGGATGTGGT GGTCGGACCC
GACGACGAGT CCGCCCGGCG GGCGGCCGTC GGCCACGACC AGTGGAGCCT GGCCAACCGC
GTCGGCGAGC CGACCGTGTT CCCCTCGCCC GAGGAGGCGT CCGCCTTCCC GTGGGAGCCG
GCGGACCGCG AGCTGGTCGC CGACCTGGGA CGGAGCCGGC TGGTCGGGAC GGCGGAGCAC
GTCGCCACCG GGCTGCGCGA GCTGCGCGAC CGCTTCGACG CCGACGAGCT GCTGGTCACG
ACCACGACCT TCGCCCAGGC CGACCGGCTC CGCTCCTACG AGCTGCTCGC CGAACAGTGG
CGCCTCCCGC GAGCCGACTG A
 
Protein sequence
MTGTAPQPRP ASDVPLGVLD LAPISAGESV ADALRNSLGL ARHTDRLGYH RYWVTEHHVN 
PSTAGLSSTL LTALVADATT RIRVGSGSLQ LGHRTALSVA EEWGLLDALH PGRLDLGLGR
AGWHPPAPVA GDAEATGDAP AGPAGPVEPD EPPAPYPRLR GSDLFALHRL LLPLNPLNSP
DYLDQVTEVV ALLDGTRRAG DLRVHAVPGH QAEVAVWVLG RTAGESAEVA GRLGLPYATN
YHASPSTTAD SVAAYRKAFR ASETLARPYV IVTADVVVGP DDESARRAAV GHDQWSLANR
VGEPTVFPSP EEASAFPWEP ADRELVADLG RSRLVGTAEH VATGLRELRD RFDADELLVT
TTTFAQADRL RSYELLAEQW RLPRAD