Gene Francci3_0819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0819 
Symbol 
ID3906446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp954966 
End bp956039 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content73% 
IMG OID637878152 
Productsqualene/phytoene synthase 
Protein accessionYP_479932 
Protein GI86739532 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID[TIGR03464] squalene synthase HpnC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.224139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCA TCGACGCCCG AACCAGCCCA TCACGCACGC CTGCCCAGCC GGCCGCGACG 
GGCCTGGCAA CTCCCGCCGA GCAGCTGCTG CGCGCCGCCC CGGCGGAGAA CTTCCCGGTC
TCGCCGTTCG TTCTGCCCGC GGCGGTTCGT TTCCACTTCA ACGCGTTGTA CGCCTTCAGC
CGGCTCGTCG ACAACCTCGG CGACGAGGCG GCCGGCGACC GGCTCGCCCT GCTCGACCGG
TTGTCGGCGG ACCTCGAGGT GATCTGGACA GGTGGGCAGC CGGAGCTGCC GGTCCTGCGC
CTGCTCGCGC GGACCGTGCG GGACTGTGAC CTGCCCGCCG AACCGTTCCA GCGCCTCGTC
GAGGCCAACC GGCAGGACCA GCGGGTCACC CGTTACGAGA CCTTCGACGA CCTGGTCCGC
TACTGCACGC TCTCGGCCGA TCCGATCGGA CGGATGGTGC TGGGCGTCTT CGGGCTGGCG
ACCCCCGACC GGGTCGTGCT GTCGGACCGG GTGTGCACCG CCTTGCAGCT CGCCGAGCAC
TGGCAGGACG TGGCCGAGGA CCTCGCCGCC GGCCGGATCT ACCTGCCGCT GGAGGATCTG
GACACCTTCG GGGTGACCGA GGCCGATCTG CGGGCTTCCG TCGCGAGTCC GGCCGTGCGC
CATCTGATGG CCTTCGAGGT CGCTCGGGCC CGTACGGTGA TCGACCAGGG CGCTCCCCTG
GTGTCGATGG TGCCCGGGCG GCTGCGGCTG GCCCTGGCCG GTTTCGTGGG CGGGGGCCGG
GCGGCGTTGG ACGCGATCCG GCGCGCCGAC TACGACGTGC TCGGTGGGCC ACCGAAGGCG
ACGAAGCCAC GGGTCGCCGA GTTCGCGCTG GCGGCGCTGG CCCGGTCGCT GGCTCCCGGA
GCGTCGGCGG TGGCGCACAC GGCGGCCGCC GTCGCCACCG CGACCAGCGC GGCCGGGGCC
TGGCCGGGTT CCGGTTCCGG TTCCACAGCC CACGGCGGCA CCGCCGCGAC GAGTACCCAG
GCCGGCGTCC CCGCAGCACA GTCCGTTCTT CCGGAGATGG GTGAGGTTCG ATGA
 
Protein sequence
MTAIDARTSP SRTPAQPAAT GLATPAEQLL RAAPAENFPV SPFVLPAAVR FHFNALYAFS 
RLVDNLGDEA AGDRLALLDR LSADLEVIWT GGQPELPVLR LLARTVRDCD LPAEPFQRLV
EANRQDQRVT RYETFDDLVR YCTLSADPIG RMVLGVFGLA TPDRVVLSDR VCTALQLAEH
WQDVAEDLAA GRIYLPLEDL DTFGVTEADL RASVASPAVR HLMAFEVARA RTVIDQGAPL
VSMVPGRLRL ALAGFVGGGR AALDAIRRAD YDVLGGPPKA TKPRVAEFAL AALARSLAPG
ASAVAHTAAA VATATSAAGA WPGSGSGSTA HGGTAATSTQ AGVPAAQSVL PEMGEVR