Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6383 |
Symbol | |
ID | 5674699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7750573 |
End bp | 7751712 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245232 |
Product | cytochrome P450-like protein |
Protein accession | YP_001510627 |
Protein GI | 158318119 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.158835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.229787 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTTC GGCCAGCTGT GTCAGCTCGA CCAGCCTTGT CAGTTCGGCC AGCCATGTCG GTTAGGGAAA GCCTGCGCGA CAACGCGCTC GGCATGGTCC TCGACATGTC GGCGTTGCCC GCGCCACTGC GGCTGATGGC CCGGCTCTCC CCCGACCCGG CGGTGAACGG GCCGCTGGAT CCGCCGTCCC TGCTGGCTAT CGACCCGCCG GACCACACCC GTCTGCGCCG GCTCGTGTCG AAGGTGTTCA CGCCCCGCGC GGTCGACGGG CTGCGCGGGG AGGTGGAACG GGCCGCCGAT GCGCTGCTCG ACGCCATGGC CGACGAGGGC ACCGTCGATC TCGTCGGCCG CTGCGCGAGC CGGCTGCCGG TGACCGTGAT CGCCACGATT CTCGGCGTGC CCGCGGCGAT GCACGCGCAG TTCCTCGCCT GGGGCACCGC GGCCGCGGCC ACGCTCGACT TCGGCCTTCC CTACGGCGCA TTCCGGCGGG TGGAACGGGC GCAGCAGGCC ATCAACACCT GGCTGCGCGG CCATTTCGAG GGGCTGCGCC GCGATCCCGG CGAGGACATT CTCAGCCAGC TCGTCAGGCT CGTCGACGAC GGTGACCGGC TGACCGAGAC GGAGCTCATC GCGACCGCCC AGCTCCTGCT CGCCGCCGGC TTCGAGACGA CCGTCAACCT GCTGGGCAAC GGCGCGGCGT TGTTGATGGA ACATCCCGAC CAGCTGGAGT ATCTGCGGTC CGGGCCCGAC CGGTGGCGGG TGCTGCTCGA CCCGGCCGGC CATCCGTTCT GCCGGCTGAA TTCGGTGGCT CCGGTGGCTC CAGCGGCAGA GTCGCCCAGC GCCGGCGCGA CCCACCACCA GCACAGCCCC AGCGCCGGCG CGGCCCCACC ACTGGCGCGA CCGCCGGCAC GACGATCCGT ACGACGACCG GCGCGCGGGC CGGCCCGGTG CCACAGTCGG CCCGGCCCGG CAGTCGGCCC GGCACCGCGA GGGCGCCGGG CGCGGCCACG GGCGCCGGCG CCGGAGGAGA CACATGCCAC GCCGGCTCCC CCGGGCCGCG CCCGGAGCGG TAGAAGGACT TCATGGGCGC GACGCACGAC CCCGCGAGGA CACAGCTGGC GAACTACCCG TTCGTCATGA
|
Protein sequence | MSVRPAVSAR PALSVRPAMS VRESLRDNAL GMVLDMSALP APLRLMARLS PDPAVNGPLD PPSLLAIDPP DHTRLRRLVS KVFTPRAVDG LRGEVERAAD ALLDAMADEG TVDLVGRCAS RLPVTVIATI LGVPAAMHAQ FLAWGTAAAA TLDFGLPYGA FRRVERAQQA INTWLRGHFE GLRRDPGEDI LSQLVRLVDD GDRLTETELI ATAQLLLAAG FETTVNLLGN GAALLMEHPD QLEYLRSGPD RWRVLLDPAG HPFCRLNSVA PVAPAAESPS AGATHHQHSP SAGAAPPLAR PPARRSVRRP ARGPARCHSR PGPAVGPAPR GRRARPRAPA PEETHATPAP PGRARSGRRT SWARRTTPRG HSWRTTRSS
|
| |