Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4207 |
Symbol | |
ID | 3907172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5023181 |
End bp | 5024401 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881535 |
Product | salicylate 1-monooxygenase |
Protein accession | YP_483284 |
Protein GI | 86742884 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.880254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCAGG GAAGAAACAC GCCCGCTGTC GCGGTGATCG GGGCCGGCAT CGGCGGCCTG ACCCTGGCGC TGGCCCTGGC GAGGGCCGGC GTTCCGTGCC GGGTGTACGA ACAGGCGGAG AACCTCTCGG AGGTCGGGGC CGGGATCCAG CTCGCGCCGA ACGCGGTGCG GCTGCTGAAC CGGCTCGGCC TGACGGACAG CCTGCGCGTG ATTGCGGTCG CGCCCCAGGC CATAGAAATC CGGCGCTGGC ACGACGACCA GTTGCTGTCC CTGACCAGTC TGGGATCCCT GTGCCAGGAG TTGTACCGCG CGCCCTACTA CACGCTGCAC CGCGCCCATC TGCACGATGT GCTGAAGCGG GCCGTCGGCA TGGAAAGGGT GTCGCTGGGG AGTCGGCTTG TCCGCGTGGT TGAACAGGAG CACGGTGTCG AGCTTCACTT CGCGGACAGT ACCGTCCGAA CGGCCGACCT GGTGATCGGC GCGGACGGAA TCCATTCGGC GGTACGAGAC GCGTTGATCC GCGATGAGCA GGTGTACTCG GGTAACGTGG TCTACCGTGG CCTGATACCA GCGGAGCGGC TCTCCGGACT GGGCCGAATC CCCAAGGTGC GCATATGGAT CGGACCGGGC AAGCACTGCG TGTCCTACCC CGTGGCAGGC GGGCGACTGA TCAGCTTCGC TGCGACCGCA CCGCGTCCCC ACGTGTCGGA ATCATGGTCA GCCGACGGGG ATCAAGAAGA ACTGCTCGCT GAGTATGCGG GCTGGAACGG CACCACACGA CGGATCCTGG AGGCTGGGGA CAGCGTTCGG TGCTGGGCAC TGCATGACCG GGATCCGCTA CGTACCTGGT GTTCGCAGCG GATCGCCGTC CTGGGTGATG CGGCCCATTC CATGCTGCCG TTCCTGGCGC AGGGTGCCAA TCAGGCCATC GAGGACGCAG CGGCTCTTGC GGTCTGCCTG GCCCAGGCCG ACGACATCCC GGATGCGCTG GGCCGGTACC AGCAACTACG CGTTCCACGC ACCACGCTCA TCCAGCGCGA ATCCCGGCAC AACGCACGCG TCATGCATCT GGCTGACGGC CCGGAGCAGC ACCGAAGGGA CCCCGCGTGG CTGGGCAACG TCCAACTGCG GCGGATGGCC TGGCTCTACG GCTACGACGT CCTGCAAGAA GCCCGTCAGG CCGGTGGACC AAGGATCAAC GGGACCCCGG CCTCCGCCTG A
|
Protein sequence | MVQGRNTPAV AVIGAGIGGL TLALALARAG VPCRVYEQAE NLSEVGAGIQ LAPNAVRLLN RLGLTDSLRV IAVAPQAIEI RRWHDDQLLS LTSLGSLCQE LYRAPYYTLH RAHLHDVLKR AVGMERVSLG SRLVRVVEQE HGVELHFADS TVRTADLVIG ADGIHSAVRD ALIRDEQVYS GNVVYRGLIP AERLSGLGRI PKVRIWIGPG KHCVSYPVAG GRLISFAATA PRPHVSESWS ADGDQEELLA EYAGWNGTTR RILEAGDSVR CWALHDRDPL RTWCSQRIAV LGDAAHSMLP FLAQGANQAI EDAAALAVCL AQADDIPDAL GRYQQLRVPR TTLIQRESRH NARVMHLADG PEQHRRDPAW LGNVQLRRMA WLYGYDVLQE ARQAGGPRIN GTPASA
|
| |