Gene Franean1_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0517 
Symbol 
ID5668936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp602210 
End bp603124 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content72% 
IMG OID641239446 
Productluciferase family protein 
Protein accessionYP_001504884 
Protein GI158312376 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03564] F420-dependent oxidoreductase, MSMEG_4879 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG GACTCACCGG CGGCGCCTCG ACCCCCGAGA AGATCATCGA CCAGGCGAGA 
AAGGCGGAAG CCGATGGCTT CCACCACCTG TGGTACGCGA GCGTCGTCCA AGGCGACCCG
TTAGCGTCCA TGGCGCTCGC GGGCCGCCAG ACATCGACGA TCGAGCTCGG CACCGCGGTG
CTGCAGACCT ACCCGTGTCA TCCGCTGCTC CAGGCCCAGC GGGCGGCGTC GGTCACGGCG
GCGATGGGCC GGCCGGGCTT CACGCTTGGT CTCGGCCCAT CGCACGCGGG ACACATCCAC
GCCGAGTACG GCCTGTCCTA CGACCGGCCG GGCAAGAACA CCGAGGACTA CCTCCGCATC
GTCACCACCC TGCTGCGCGA TGGCGGAGCG GACATCACCG GCGAGGAATG GAGCACGCAC
GTGCGTTCCG GGACCGTACA GCCGGCGCAC CCGGTGCCCG TGCTGCTCGC CGCGCTCTCA
CCGCGGCTGC TGCGGGTCGC GGGCGAGCAC GCCGACGGCG TGATCCTCTG GATGGCGCCC
GCGACCGCGA TCGAGAAGCA CATCGAGCCG AAGCTGTCGG CCGCGGCGAA GGCGGCGGGC
CGCGGCGCGT TGCGGATCGT CGCCGGGCTC CCGGTCGCCG TGCACGACGA CGAGTCCGAA
GCCCGCGCGG CCGTCGCGAA GAACTCGACG ATGTACGCCA GCTCGCCGGC CTACCAGCGG
ATCATGGCAA TCGGCGGCGC CGATAGCCCC GCGCAGGCCG CGATCGCCGG CGACGAGGCG
TCGGTCGAAC GGCAGCTGCA CGCGCTGCTC GACGCGGGCG CGACCGACAT CTGGGCACAG
CCGATCCCGG TCGGCGACGA CCCGCGCGCC TCGCTGCGCC GCACCCGCAA CCTGCTCGCC
GCTCTGGCCC GGTGA
 
Protein sequence
MRIGLTGGAS TPEKIIDQAR KAEADGFHHL WYASVVQGDP LASMALAGRQ TSTIELGTAV 
LQTYPCHPLL QAQRAASVTA AMGRPGFTLG LGPSHAGHIH AEYGLSYDRP GKNTEDYLRI
VTTLLRDGGA DITGEEWSTH VRSGTVQPAH PVPVLLAALS PRLLRVAGEH ADGVILWMAP
ATAIEKHIEP KLSAAAKAAG RGALRIVAGL PVAVHDDESE ARAAVAKNST MYASSPAYQR
IMAIGGADSP AQAAIAGDEA SVERQLHALL DAGATDIWAQ PIPVGDDPRA SLRRTRNLLA
ALAR