Gene Franean1_5730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5730 
Symbol 
ID5674056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6963956 
End bp6965017 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID641244583 
Productluciferase family protein 
Protein accessionYP_001509986 
Protein GI158317478 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.859618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCG GAGTGGTGCT GCAGACCAAC CCGCCCGCGT CTCGGGTGGT CGAACTCGCC 
CGGCAGGCCG AGACGCTCGG CTTCAGCCAC GTGTGGACCT TCGACTCTCA CCTCCTGTGG
GAGGAGCCGT TCGTCATCTA CAGCCAGATC CTCGCGGCCA CCCGCAAGGT CAAGGTCGGC
CCGATGGTCA CCAACCCGGC GACCCGCGAC TGGACGGTCA CCGCCTCCCT GTTCGCCACG
CTCAACGAGA TGTTCGGCAA CCGCACGATC TGCGGGATCG GCCGCGGCGA CAGCGCCGTC
CGGGTCCTCA ACGGCCGGCC GACGACGCTG GCCACCCTGC GCGAGTGCGT CGCCGTCGTC
CGCGCCCTCG CCAACGGGCG GGAGGCGGAG GTGAACGGCG CGAAGCTGCG CTTCCCATGG
GGCACCGACA GCCGGCTGGA CGTCTGGATC GCCGCCTACG GCCCGAAAGC CCTGGCCCTG
GCCGGCGAGA TCGGCGACGG ATTCATCCTG CAGCTAGCCG ACCCCGACAT CGCCGCGTGG
ACCATCCGGG TGGTGCGCGA GGCCGCCGAG AAGGCCGGCC GTGACCCGGC GTCGGTGCGG
TTCTGCGTCG CCGCGCCCGC CTACGTCGGC GACGCCGACC CGCTGTCCCT CGCCCACCAG
CGCGACCAGT GCCGCTGGTT CGGCGGGATG GTCGGCAACC ACGTCGCCGA CCTCGTCGCC
CGCTACGGCA CCCCGACCGC GGCCGGCCCT GTTCCGGCAG GCGGCACGGC GCTGCCGTCA
GCCCTGACCA GTTACATCAC CGGCCGCCAC GGCTACGACT ACAACGAGCA CGGTCGCGCC
GGGAACACCC ACACCGACTT CGTCCCCGAC GAGGTCATCG ACCGGTTCTG CCTGCTCGGC
CCGCCGGCAG CGCACATCGA GCGGCTCACC GAGCTCGCTG GCCTGGGCGT CGACCAGTTC
GCGGTCTACC TCCAGCACGA CGCCAAGCGC GCCACCCTGG AGGCCTACGG CGAGACCGTC
ATCCCGGCGG TCAGCGCCAC CATCCAGGCG AAAACACGCT GA
 
Protein sequence
MDVGVVLQTN PPASRVVELA RQAETLGFSH VWTFDSHLLW EEPFVIYSQI LAATRKVKVG 
PMVTNPATRD WTVTASLFAT LNEMFGNRTI CGIGRGDSAV RVLNGRPTTL ATLRECVAVV
RALANGREAE VNGAKLRFPW GTDSRLDVWI AAYGPKALAL AGEIGDGFIL QLADPDIAAW
TIRVVREAAE KAGRDPASVR FCVAAPAYVG DADPLSLAHQ RDQCRWFGGM VGNHVADLVA
RYGTPTAAGP VPAGGTALPS ALTSYITGRH GYDYNEHGRA GNTHTDFVPD EVIDRFCLLG
PPAAHIERLT ELAGLGVDQF AVYLQHDAKR ATLEAYGETV IPAVSATIQA KTR