Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3475 |
Symbol | |
ID | 5671846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4131608 |
End bp | 4132804 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242363 |
Product | cytochrome P450 |
Protein accession | YP_001507783 |
Protein GI | 158315275 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000451038 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00686408 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGGGA CCGAGATCCC CGAGTACCCG CTGGCGCGGA CGTGCCCGTT CCACCCGCCC GCCGGTTACG CCCGCTACCG CGAGCACGGG CCGGTGAACC CGGTGCGGCT CTACGACGGC CGGCGGGTGT GGGCCGTCAC CGGGCACGCC GAGGCCCGCG AGGTGCTGCT GAACACGCGG CTGTTCTCCT CGGAGCGCGC CGACCCGCGC TATCCGGCGA CCAGCCCCCG TTTCGAGGCG GCCCGCAAGG TCCGCAACTT CATCGGCATG GACCCGCCGG ACCACACCGC GCAGCGGCGG ATGCTGCAGT CCAGCTTCAC CATGCGCCGG ATCAACGGCC TGCGCCCCGG CATCCAGCGG CTCGTCGACG AACTGCTCGA CGCGATCGTC GCCAAGGGCC CGGTGGTCGA CCTGGTGCCC GAGTTCGCGC TGCCGATCCC GTCGATCGTC ATCTCCGAGC TGCTCGGGGT GCCCTACGGG GACCACGCTT TCTTCGAGCA GCAGTCCCGG CGGGTGGCCA GCGGCACCTC GACGCTGGAG GAGAGCGCGG ACGCGTTCAC CCAGCTGCTC CAGTACCTCG ACGGGCTCAT CCAGGACAAG GAGCGCTCCG CCGGCGACGG CCTGCTCGAC GTCCTCATCG CCGAGCAGGT GCGCCCCGGG GTCCTGACCC GGCGCGAGCT CGTCGACATC TCGCTGCTGC TGCTCGTGGC CGGCCATGAG ACGACGGCCA GCGCCATCGC GCTCGGCGTG GTCGCCCTGC TCGAGCACCC CGACCAGCTC GCCGCGCTGC GCGCCGACCC CGCGCTGCTG CCCAGCGCCG TCGAGGAGCT GCTGCGCTTC ACCACGATCG CCGACAGCGT GGCCCGGTTC GCGACGGCCG ACACCGAACT GGCCGGCCAG CCCGTCGCCG CCGGGGACGG TGTTCTCGTC GTGCTCTCCG CCGCGAACCG CGACGGCACG GTCTTCCCCG ACCCGGACCT CCTCGACCTG GCCCGCCGCG CCCGCAGCCA CGTGGCCTTC GGCCACGGCG CGCACCAGTG CATCGGGCAC AACATCGCCC GCGCCGAGCT GGAGATCGCG TTCTCCACGC TGTTCGCCCG CCTCCCCGGC CTGCGGCTCG CGGTGCCGCT CGACCGGCTG CCCGGCAAGG ACGCCGGCGG GGTGCAGGGC GTCTTCGAGC TGCCCGTCGC CTGGTGA
|
Protein sequence | MTGTEIPEYP LARTCPFHPP AGYARYREHG PVNPVRLYDG RRVWAVTGHA EAREVLLNTR LFSSERADPR YPATSPRFEA ARKVRNFIGM DPPDHTAQRR MLQSSFTMRR INGLRPGIQR LVDELLDAIV AKGPVVDLVP EFALPIPSIV ISELLGVPYG DHAFFEQQSR RVASGTSTLE ESADAFTQLL QYLDGLIQDK ERSAGDGLLD VLIAEQVRPG VLTRRELVDI SLLLLVAGHE TTASAIALGV VALLEHPDQL AALRADPALL PSAVEELLRF TTIADSVARF ATADTELAGQ PVAAGDGVLV VLSAANRDGT VFPDPDLLDL ARRARSHVAF GHGAHQCIGH NIARAELEIA FSTLFARLPG LRLAVPLDRL PGKDAGGVQG VFELPVAW
|
| |