Gene Francci3_4412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4412 
Symbol 
ID3907387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5275757 
End bp5277037 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID637881743 
Productgeranylgeranyl reductase 
Protein accessionYP_483487 
Protein GI86743087 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0475548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGGG TTGACGACCC GGAGAACACC TTTGACGTCA TTGTCGTCGG GGCGGGTCCG 
GCCGGTGGCA GCGCGGCGCG GGCGGCGGCG AACGCCGGGG CCCGGGTGTG CGTGCTCGAA
CGCTCGGCGG TACCTCGTTA CAAGACCTGC GGTGGCGGCC TGGTCGGCCT TTCGGCGGAT
CATCTCGGCA TCGAACTGGC CGGTCTGGTT CGAGCCAGCG TCAGCTCCCT GACGATGACC
TGGGACGGCC GCTTCGAGGT CACCCGCGGC CGGTCCGGCG GCCGTCCCTG GATGTCGATG
GTGATGCGAT CCGATCTGGA CGCCGCTCTC CTGGCCGCGG CGGCCGCGGC CGGCGCGACC
GTGCGGACCG GTGCGCACGT GGTGGGGCTG ACCGAGAGCG GGCCGGGCGG GACCGGGATC
GCCGAGGACG CCGGGGATCA GGACGACGGT TGGGTGACCG TGCGTCTGCG TGCGGGGGCG
GACCTGCGCG CTCGGGTGGT GGTGGGTGCG GACGGCACCT CGGGTCGTTG CGGCACCTAT
GTCGGTGTCC GCTGTGACCA GGTCGACGTC GGCCTGGAGG GCGAGTTTGT CGCCGACCCG
GCGCGTGCCG CGCGATGGGC CGGACGCATG CTCATCGACT GGGGGCCGGT GCCGGGTTCC
TACGGGTGGG TGTTCCCCAA GGGACGGGTG CTCACGGTCG GCGTGATCGG TGATCGTGCC
GAGGGCGCCG CGCTGCGCGC CTATTACACC CGGTTCGTCG ACCGTCTCGG GCTCACCGAC
CTGCCGCGCC AGCATGACTC CGGACATCTC ACCCGGGTGC GGGCCGTCGC CTCCCCGCTG
CGGCGGGGCC GGGTGTTGGT CGCGGGGGAC GCGGCCGGGC TCCTGGATCC GTGGACACGC
GAAGGCATCT CGTTCGCATT ACGGTCGGGT CGGCTCGCCG GGGAGGCGGC CGCCGCGGCG
AGTGTGGGCG TCGGTCGCGC GAGGGCGGGC GTGACCCGGG CCGGGGGGGT AACGGATACA
AGGCCCACTG GCCCGGGCGC CACGGCGCTC GACGCCTTGG CGGCCTATCC CGCGCGCGTC
GAGGCGATCC TGGGCCCCGA GATGGCGGCG GGACGAGAGG TCCTGGCCGC GTTCACCCGC
CGGCCGGGCG CGATGCACAC GGTCATGGCG GCTCCGGGCA CCTTCGGGCT GTTCACCCGG
CTCATCGAGG GACGGACCAC CATCGCCCGG CAACTGCGTC GTCCCGGCGT GCGGGCGGCG
GTGGCGGCAC TGAGCCACTG A
 
Protein sequence
MSRVDDPENT FDVIVVGAGP AGGSAARAAA NAGARVCVLE RSAVPRYKTC GGGLVGLSAD 
HLGIELAGLV RASVSSLTMT WDGRFEVTRG RSGGRPWMSM VMRSDLDAAL LAAAAAAGAT
VRTGAHVVGL TESGPGGTGI AEDAGDQDDG WVTVRLRAGA DLRARVVVGA DGTSGRCGTY
VGVRCDQVDV GLEGEFVADP ARAARWAGRM LIDWGPVPGS YGWVFPKGRV LTVGVIGDRA
EGAALRAYYT RFVDRLGLTD LPRQHDSGHL TRVRAVASPL RRGRVLVAGD AAGLLDPWTR
EGISFALRSG RLAGEAAAAA SVGVGRARAG VTRAGGVTDT RPTGPGATAL DALAAYPARV
EAILGPEMAA GREVLAAFTR RPGAMHTVMA APGTFGLFTR LIEGRTTIAR QLRRPGVRAA
VAALSH