Gene Franean1_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3971 
Symbol 
ID5672332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4753155 
End bp4754561 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content75% 
IMG OID641242850 
Productputative phytoene dehydrogenase (phytoene desaturase) 
Protein accessionYP_001508267 
Protein GI158315759 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.262975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGCG GCGGGAACGT GCCCGGGGCC GGGAGCGCCA CGGTCGGTGG GGGAGTGGGC 
CGGCGAGGGC ACGCCGTCGT GCTCGGCGCC AGCGTCGCGG GCCTGTTGGC GGCCCGGGTG
CTGGCCGAGC ACGTCGGCCA GGTGACGGTC ATCGACCGCG ACGACGTCAC GCCCACCGCT
GCCGTCGCCC ACCGCGGCGG GGCGGCGCAG GGCCGTCACC TGCATGCCCT GATGGAACGG
GGCCGCCAGA TCCTCGACGA GCTGTACCCG GACTTCACCG CCAAGATCGC CGCAGACGGG
GTGCCGACGG CGGAGACCCT CGTCGGTACC CGCTGGTACT TCGACGGCGC CCGGGTCACC
CCGGTGCCGA CCGGCCTCAC CTCGGTGCTC GCCAGCCGGC CGGCGCTGGA GGCCGCCCTG
CGGGCCGCGA CACTCGGCCA CGACCGGATC CGGCTGCTCC CCGGGCTCCG GGCGGTCGGC
CTCGTCCGCG GATCGGGCTC CACCCGAGCC GCCGGGGTCG TCGGGGTGCG CGTCGAGCCG
CCCGCCGGTG ACGCCCCGGC CCGCACCATC GAGGCCGACC TCGTCGTCGA CGCCACCGGC
CGCGGGTCCC GCGCCTCGGA GTGGCTCGCC GACCTGGGCT TCGACGTCCC ACGGGAGGAG
ACCGTCGGCG TCGACCTCGC CTACGCCTCC CGGACCTACC GCCGGCGGCC CACGGACCTC
GGCGGCGACC TCGGCGTCAT CATCTCGACG CTGCCGGGCC GGCGCGGCGG TGGCGCCGTC
ACCCAGGAGG GCGACCGCTG GATCGTGACC CTCGCCGGCA TGCTCGGCGA CCACCCGCCG
GTCGACGTCC CCGGCTACGA ACGGTTCGCG GCCTCCCTGC CCGCCCCCGA CATCAACCGC
CTCATCCAGG ACGCCGAGCC GCTCGACGAC CCGGTTCGCT ACCGGTTCCG GGCGTCACGG
CGGCTGCGCT ACGACCTCCT GCGCACCCCG GCCGCCGGTT TCGTCGCGAT CGGCGACGCC
CTGTGCACCC TCAACCCGCT CTACGCGCAG GGCATGACCG TCGCCGCGCA GCAGGCTCTC
GAGCTGCGGG CATGTCTGCG TTCGGGTGGC CTGGACGACC TCGCCGCACG ATACTTCACG
GCGGCCGCCC GGCCGACCTC CCGGGCGTGG TCGATCGCCA CCGACTCCGA CCTGCGCTAC
CGGGAGGTCG AGGGCCGCCG CGGGCCCCGC ACCCGGATCA CCAACGCTTA CATCCCCCGA
GTCCAGGCGG CGACCCGATC CGACCCTGTC CTCGCCCGCA ACCTGCTGCG CGTGGTCAAC
CTTGTCGAAC CCCCATCAGT CCTGCTCACC CCAGCGGCCG TGCTGCGGAC CGCCCGACAC
GCACTTACCC GGCGTGGTCA ATCATGA
 
Protein sequence
MPGGGNVPGA GSATVGGGVG RRGHAVVLGA SVAGLLAARV LAEHVGQVTV IDRDDVTPTA 
AVAHRGGAAQ GRHLHALMER GRQILDELYP DFTAKIAADG VPTAETLVGT RWYFDGARVT
PVPTGLTSVL ASRPALEAAL RAATLGHDRI RLLPGLRAVG LVRGSGSTRA AGVVGVRVEP
PAGDAPARTI EADLVVDATG RGSRASEWLA DLGFDVPREE TVGVDLAYAS RTYRRRPTDL
GGDLGVIIST LPGRRGGGAV TQEGDRWIVT LAGMLGDHPP VDVPGYERFA ASLPAPDINR
LIQDAEPLDD PVRYRFRASR RLRYDLLRTP AAGFVAIGDA LCTLNPLYAQ GMTVAAQQAL
ELRACLRSGG LDDLAARYFT AAARPTSRAW SIATDSDLRY REVEGRRGPR TRITNAYIPR
VQAATRSDPV LARNLLRVVN LVEPPSVLLT PAAVLRTARH ALTRRGQS