Gene Francci3_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2068 
Symbol 
ID3904641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2432596 
End bp2433753 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID637879404 
Productsalicylate 1-monooxygenase 
Protein accessionYP_481170 
Protein GI86740770 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.341777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA GTGGGCCTCG AGTCGGAATT CTTGGTGCCG GGATAGCAGG CGTGACCCTG 
GCGTTGATGT TGGCGGATAG TGGCGTACCT AGCGTCCTGT TCGAGCAGGC GGAACTGTCC
GGCGAGATCG GCGCGGGAGT GCAGCTCGCG CCGAGCGCGG TGCGGCTGTT GCACCGATTG
GGCTTGGCGG ATGCCCTGCG GGAGATCGCG GTCGAGGTCG AGTCCGCACT GATGTGTCAG
TGGGACGACG GAACGGTGGT CGCCCGGACA CCGTTCGGGA AGGACTGCGC GGACAGGTAC
GGCGCGCCGT ACTACACCGT GCACCGCGCC GACCTACACG CGCTGTTGAC CTCCAGGCTG
GAGACGCCCG TCGCCACCGG GCGGCGGTGC GTCGAGGTGA CGGAGGACGA CGAGTCCGTT
CGACTTCTCT TCGCGGACGG CTCGACCGAG GAAGTCGGAC TGCTGATCGG TGCGGACGGG
ATACGTTCGG TTGTCCGTTC ACGCATATCG GCCGACGTCC CGTGTTTCTC AGGCGAGCTC
ATCTATCGGG GGCTTGTCGA CGCACGGCGG CTGCCCGGAG CGTTCGGGAA CTCGATCCGG
GTCTGGAAAG GCACGGACGG CCACGCCGTG ATGTATCCCG TCCGACGCGG CCAGCTGATC
AGTGTCGCCG CGACGGTCCC GGCCGAAGAG GGCGGCCACG AGTCGTGGTC GCGGCGCGGC
GACCTCACCG CGATGCGGGC CAGATACGAC TGCTGGCACG ACTCCGTCCG CTCGGTGCTG
GCCGCCCTCG ACGAGGTCAC CGTGTGGGCG CTGCATGACC GTGAGCCCAT CGAATGCTGG
GCGACCGACC GCACCGTACT GATCGGCGAC GCCGCGCATC CGATGCTGCC GTTCCTCGCT
CAGGGCGCGA ACCAGGCGAT TGAGGACGCG ACGGTCCTGG CCCTGTGCCT GGCGGACGCC
GCTGACTCGC ACCGGGCCGC GTTCGACCGC TACCAACGGC TACGGGTTCC GCGGGCGAGA
GAGATCCTTC TTAGCTCCCG GCAGAACGCG GAGCATCTGC GGGCCGAGGA GCCTGCGTCG
CGCCAGGACG CCGCGGTGAC CGAGCTGCCC CTCGAGGACC ACGACTGGCT CTTCGGCCAT
CACGCGGAGG AGGTCTGA
 
Protein sequence
MKNSGPRVGI LGAGIAGVTL ALMLADSGVP SVLFEQAELS GEIGAGVQLA PSAVRLLHRL 
GLADALREIA VEVESALMCQ WDDGTVVART PFGKDCADRY GAPYYTVHRA DLHALLTSRL
ETPVATGRRC VEVTEDDESV RLLFADGSTE EVGLLIGADG IRSVVRSRIS ADVPCFSGEL
IYRGLVDARR LPGAFGNSIR VWKGTDGHAV MYPVRRGQLI SVAATVPAEE GGHESWSRRG
DLTAMRARYD CWHDSVRSVL AALDEVTVWA LHDREPIECW ATDRTVLIGD AAHPMLPFLA
QGANQAIEDA TVLALCLADA ADSHRAAFDR YQRLRVPRAR EILLSSRQNA EHLRAEEPAS
RQDAAVTELP LEDHDWLFGH HAEEV