Gene Francci3_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4207 
Symbol 
ID3907172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5023181 
End bp5024401 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID637881535 
Productsalicylate 1-monooxygenase 
Protein accessionYP_483284 
Protein GI86742884 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.880254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCAGG GAAGAAACAC GCCCGCTGTC GCGGTGATCG GGGCCGGCAT CGGCGGCCTG 
ACCCTGGCGC TGGCCCTGGC GAGGGCCGGC GTTCCGTGCC GGGTGTACGA ACAGGCGGAG
AACCTCTCGG AGGTCGGGGC CGGGATCCAG CTCGCGCCGA ACGCGGTGCG GCTGCTGAAC
CGGCTCGGCC TGACGGACAG CCTGCGCGTG ATTGCGGTCG CGCCCCAGGC CATAGAAATC
CGGCGCTGGC ACGACGACCA GTTGCTGTCC CTGACCAGTC TGGGATCCCT GTGCCAGGAG
TTGTACCGCG CGCCCTACTA CACGCTGCAC CGCGCCCATC TGCACGATGT GCTGAAGCGG
GCCGTCGGCA TGGAAAGGGT GTCGCTGGGG AGTCGGCTTG TCCGCGTGGT TGAACAGGAG
CACGGTGTCG AGCTTCACTT CGCGGACAGT ACCGTCCGAA CGGCCGACCT GGTGATCGGC
GCGGACGGAA TCCATTCGGC GGTACGAGAC GCGTTGATCC GCGATGAGCA GGTGTACTCG
GGTAACGTGG TCTACCGTGG CCTGATACCA GCGGAGCGGC TCTCCGGACT GGGCCGAATC
CCCAAGGTGC GCATATGGAT CGGACCGGGC AAGCACTGCG TGTCCTACCC CGTGGCAGGC
GGGCGACTGA TCAGCTTCGC TGCGACCGCA CCGCGTCCCC ACGTGTCGGA ATCATGGTCA
GCCGACGGGG ATCAAGAAGA ACTGCTCGCT GAGTATGCGG GCTGGAACGG CACCACACGA
CGGATCCTGG AGGCTGGGGA CAGCGTTCGG TGCTGGGCAC TGCATGACCG GGATCCGCTA
CGTACCTGGT GTTCGCAGCG GATCGCCGTC CTGGGTGATG CGGCCCATTC CATGCTGCCG
TTCCTGGCGC AGGGTGCCAA TCAGGCCATC GAGGACGCAG CGGCTCTTGC GGTCTGCCTG
GCCCAGGCCG ACGACATCCC GGATGCGCTG GGCCGGTACC AGCAACTACG CGTTCCACGC
ACCACGCTCA TCCAGCGCGA ATCCCGGCAC AACGCACGCG TCATGCATCT GGCTGACGGC
CCGGAGCAGC ACCGAAGGGA CCCCGCGTGG CTGGGCAACG TCCAACTGCG GCGGATGGCC
TGGCTCTACG GCTACGACGT CCTGCAAGAA GCCCGTCAGG CCGGTGGACC AAGGATCAAC
GGGACCCCGG CCTCCGCCTG A
 
Protein sequence
MVQGRNTPAV AVIGAGIGGL TLALALARAG VPCRVYEQAE NLSEVGAGIQ LAPNAVRLLN 
RLGLTDSLRV IAVAPQAIEI RRWHDDQLLS LTSLGSLCQE LYRAPYYTLH RAHLHDVLKR
AVGMERVSLG SRLVRVVEQE HGVELHFADS TVRTADLVIG ADGIHSAVRD ALIRDEQVYS
GNVVYRGLIP AERLSGLGRI PKVRIWIGPG KHCVSYPVAG GRLISFAATA PRPHVSESWS
ADGDQEELLA EYAGWNGTTR RILEAGDSVR CWALHDRDPL RTWCSQRIAV LGDAAHSMLP
FLAQGANQAI EDAAALAVCL AQADDIPDAL GRYQQLRVPR TTLIQRESRH NARVMHLADG
PEQHRRDPAW LGNVQLRRMA WLYGYDVLQE ARQAGGPRIN GTPASA