Gene Franean1_5124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5124 
Symbol 
ID5673458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6138483 
End bp6139841 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content77% 
IMG OID641243974 
Productzeta-phytoene desaturase 
Protein accessionYP_001509388 
Protein GI158316880 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.229738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0794369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGG TGTTCGAGGA GCTGTTCGCC GCCACCGGCG GGCCGCTCGG CGCGCAGCTC 
ACGCTGCGCC GCCTCGACCC GATCGCCGCC TACCGGTTCG CCGACGGGAC GGCGGTGACC
GCGCACGCCG ACGACGCCGC GTTCCACGCG GAGCTGGACG CGAGGCTGGG CGCCGGGGCC
GGGGCCCAGT GGCGGCGCCT CGACGAGCGC GCCCGCCGGG TCTGGGCGGT GTCCGAACAG
CCCTTCCTGC GCAGCCCGGT CTCGGCCGCG GCGCTGGCAC GGACCGCCGC GCGCCGCCCG
CTGGGGCTGG TCACCGTCGC GCCCGGCACC ACCCTGCGCG GTATCGGGCG CCGTCACCTC
ACCGATCCGC GGCTGCGCAT GATGCTGGAC CGGTACGCGA CCTACACCGG CTCGGATCCG
CGCCGGGCGC CGGCCGCGCT GGTCACGGTC CCGCACGTGG AGCGACGGTT CGGCGGCTGG
TACGTCCCGG GCGGGCTGCG GCTGCTCGGG CAGGCGATCG CCGAGCGGGC GGCCGAGCGC
GGCGCGGTGA TCCGCATCGG CGCGCCGGTC GCGCGGATCA CCCGCACGCC GGGGGGGTGG
GTGGACGGCG TCCGCCTGGC CGACGGAACG CTGCTGGGTG CCGACCTGGT CGTGTCCGAC
GTCGACGCGG CCCGGCTCTA CGACGGCGCA CCTCCCGCAG CGCCCGGGCG GGCCGGCCCG
CGCCCGCTCG TCGACCATCC GGCCAGCCGG CGGCGGATCC GGCGGCTGGC GCCGTCGCTG
TCCGGTTTCG TCCTGCTGCT GGCACTGCGC GGGCGGACGC CGGGCCTGGC CCACCACACC
GTGCTGTTCC CCGCCGACTA CGAAGACGAG TTCAACGCGG TCTTCGGTGG CCGGCTGGCC
TGGGATCCGA CCGTCTACAT CGCCGCGCCG GACGACCCGG CGACCGCGCC GCCGGGCGAC
GAGGCGTGGT TCGTCCTGGT GAACGCCAGC CCGCACGCCG CCGCCACTGC CGCTGCCGGC
CCGCGGGGTC CCGGTGTGGA CTGGGACCGG CCCGGGCTCG CCGACGCCTA CGCCCGCCGC
ATCCTCGAGG TGCTGGCCTC ACGTGGGCTC GACGTCCGCG CCCGGGTGCG CTGGTACCGG
ACGATCTCGC CGGCCGACCT CGCACGCGCG ACCGGCGCCG TGGGCGGCTC GATCTACGGC
GTGTCCTCCA ACGGGCCGCG GTCGGCGTTC CTACGCCCGC GCAACCGCTC GCCGGTACCC
GGGTTGTTCC TGGTCGGCGG CTCGGCCCAT CCGGGCGGCG GGCTGCCGCT GGTCACGCTC
TCCGCGAAGA TCGTCGCCGA TCTCATCGGC CCGGCCTGA
 
Protein sequence
MPEVFEELFA ATGGPLGAQL TLRRLDPIAA YRFADGTAVT AHADDAAFHA ELDARLGAGA 
GAQWRRLDER ARRVWAVSEQ PFLRSPVSAA ALARTAARRP LGLVTVAPGT TLRGIGRRHL
TDPRLRMMLD RYATYTGSDP RRAPAALVTV PHVERRFGGW YVPGGLRLLG QAIAERAAER
GAVIRIGAPV ARITRTPGGW VDGVRLADGT LLGADLVVSD VDAARLYDGA PPAAPGRAGP
RPLVDHPASR RRIRRLAPSL SGFVLLLALR GRTPGLAHHT VLFPADYEDE FNAVFGGRLA
WDPTVYIAAP DDPATAPPGD EAWFVLVNAS PHAAATAAAG PRGPGVDWDR PGLADAYARR
ILEVLASRGL DVRARVRWYR TISPADLARA TGAVGGSIYG VSSNGPRSAF LRPRNRSPVP
GLFLVGGSAH PGGGLPLVTL SAKIVADLIG PA