Gene Franean1_5128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5128 
Symbol 
ID5673462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6142910 
End bp6144430 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content75% 
IMG OID641243978 
Productzeta-phytoene desaturase 
Protein accessionYP_001509392 
Protein GI158316884 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAG TGACCGGGCC GACGGACCAC GTCGTCATCG TCGGAGCCGG GCTGGGCGGA 
CTTTCGGCCG CGCTCCGGCT GACCGGCGCG GGCCGGCGGG TCACCGTCCT GGAACGTGAC
GACACCCCCG GTGGGCGCGC GGGGTCGCTG CGGCTGGGCG GCTACCGGTT CGACACCGGG
CCGACCGTGC TCACCATGCC GGACCTGGTG GCGGACGCGC TCGACTGCGT GGGCGAGGAC
CTCGGCCGGT GGCTGCCACT GCGCCGCCTC GACCCGATGT ACCGGGCCCG CTTCGCGGAC
GGGTCCGTGC TGGACGTCCG CGCCGACCCG GAGCACACCG AGCAGGGGGT GCGGGAGCTG
TGCGGGCCGG CCGAGGCGGC CGGATTCCGT GACTTCACCC GGTTCGCGAC CCGCATGTTC
CAGGCGCAGA TGCGCGACTT CATCGACGCC CAGGTCGACT CGCCGCTGGC GCTGATGCGG
CCCTCGCTGG CCCGGGTGGC GGCGCTCGGC GGGTTCCGCC GCCTGGACAC CGTGGTCGGA
CGGTACCTTC GCGACCCGCG GACCCGGCGG CTGTTCTCCT TCCAGGCGAT GTACGCCGGG
CTCGACCCGC ACGAGGCGCT CGGGCTGTAC GCCGTGATTG CCTACATGGA CTCGGTGGCC
GGCGTCTTCC ACGCCGACGG CGGCCCGCAC GCGGTCGCCG CGGCCATGGC CGGGGCGGCG
GCCCGGCACG GGGCCACCTT CCGCTATGGC ACCGAGGTGA ACCGGGTGGA GGTCCGCGGC
GGGCGGGCGG TCGCCGTGCA CACGACCGCC GGCGAGCGGG TGCCGGCGGA CGTCGTGATC
CTCAACCCGG ACCTCCCGGT GGCCTACCGT GACCTGCTGC CCGCCTCCGC CACCCCGGCG
CGGCTGGACC GCCTGCGGCA CTCGCCGTCC TGCTTCCTGC TGCTCGCCGG TGCCCGGGGC
GCCCACCCGT CCGGCGCGCA CCACACGATC CACTTCGGCG GCGCGTGGCG GCGCACCTTC
GACGAGATCA TTCGACGCGG GGAGCTGATG AGCGACCCGT CGTTCCTGGT GAGCACGCCG
TCGGTCACCG AGCCGGCGGC CGCGCCGGCG GGCGGGCACA GCTACTACGT CCTGTTCCCC
ACCCCGAACC TGACCGCGCC GCTGGACTGG TCGGTCCTCG GCCTGCGGTA CCGCGACGAG
GTCGTCGCCA CGCTGGAGCG TGCCGGCTAC CCCGGGTTCG GGACGTCGAT CGACGTCGAG
CAGGTGACCA CCCCGGCGGA CTGGCGGGCC CGTGGGATGG CCGCCGGCGC GCCGTTCGCG
GCGGCGCACA CCTTCCGCCA GACCGGCCCG TTCCGGCCGT CCAACCTGGC GCCGGGACTG
GCCAACGTGG TGTTCGTCGG CAGCGGAACC CGGCCGGGGG TGGGAGTTCC GATGGTGCTG
ATATCGGGCC GGCTGGCCGC CGAGCGGGTT CTCGGCCGTG ATCGTGGCTA CCGCACGCGG
ACACTGAGGG CTATTCCCTG A
 
Protein sequence
MRTVTGPTDH VVIVGAGLGG LSAALRLTGA GRRVTVLERD DTPGGRAGSL RLGGYRFDTG 
PTVLTMPDLV ADALDCVGED LGRWLPLRRL DPMYRARFAD GSVLDVRADP EHTEQGVREL
CGPAEAAGFR DFTRFATRMF QAQMRDFIDA QVDSPLALMR PSLARVAALG GFRRLDTVVG
RYLRDPRTRR LFSFQAMYAG LDPHEALGLY AVIAYMDSVA GVFHADGGPH AVAAAMAGAA
ARHGATFRYG TEVNRVEVRG GRAVAVHTTA GERVPADVVI LNPDLPVAYR DLLPASATPA
RLDRLRHSPS CFLLLAGARG AHPSGAHHTI HFGGAWRRTF DEIIRRGELM SDPSFLVSTP
SVTEPAAAPA GGHSYYVLFP TPNLTAPLDW SVLGLRYRDE VVATLERAGY PGFGTSIDVE
QVTTPADWRA RGMAAGAPFA AAHTFRQTGP FRPSNLAPGL ANVVFVGSGT RPGVGVPMVL
ISGRLAAERV LGRDRGYRTR TLRAIP