Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0622 |
Symbol | |
ID | 5669039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 722500 |
End bp | 723747 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239549 |
Product | cytochrome P450 |
Protein accession | YP_001504987 |
Protein GI | 158312479 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCA CCGCCACCCT CGACCGCGAA CGGGTCCGGC AGCTGTTCGA CCTGCGCAGC AGCTACAACG TGCACATGGG CGGCGGCTAC CGGCAGGACC CGTACCCGGT CTGGCACCGG CTGCGCGAGC AGGCCCCGGT GCACCCGGGG ATCGTCCACG AGCTGACCGG CTTCGACGGC CCGGCGATGT TCTCCGGTCT GCCGTACCCC GACCGCCCGC ACTTCTCGGC CTTCAGCTAC GCCGCGTGCG ACGCCGCCTA CCGCGATCCC GAGGTGTTCG CGTCGGCGGC CGGGCCGGTG GATCCCAACA ACGGCCCGTA CGGCGCGACG AACAGCATGC TGTCGATGGG CGGGAGGCAG CACCGCCGGT ACCGCGGGCT CGTACAGCCC TCGTTCGTGC CGGGCAAGGC GAAATGGTGG ATCAGCAACT GGATCGAGGA GACCGTCGAC CTGCTCATCG ACGGGTTCGT CGACGCGGGG CGGGCCGAGC TGAACGTGGA CTTCTGCGCG GCCATCCCGG TGCTCACGAT CACCAGCAGC TTCGGCGTGC CCGTCGACCG GGCGCTCGAC ATCCGTGCCG CGCTGACCAG GCCCGACGAG ATCGTGCGGA TGCTCGAGCC GATCGTCGCC GCCCGGCGTG CGGACCCGCA GGACGACCTG ATCAGCATCC TCGTGCAGGC CGAGATGACC GACGAGAACG GGGTCACCCA CCGGCTCACC GACGCCGAGA TCTACTCCTT CTCGGTGCTG CTGCTGACCG CCGGCTCCGG CACCACCTGG AAGCAGATGG GCATCACGCT CACCGCGCTG CTGCAGCACC CCGAGGCGCT GGCCGCGGTA CGCGCCGACC GCTCGCTGCT CCGGCTGGCG ATCGAGGAGT CGCTGCGGTG GTCACCCACC GACCCGATGT TCTCCCGCTG GGTGACCGAG GACGTCGACT TCTTCGGCGT GCACATGCCG GCCGGGTCGG TGCTGCACCT GTGCATCGGC GCGGCCAACC GCGACCCGGC CCGCTGGGAC CGCCCGGACG AGTACGACAT CACCCGCGCC CTGCGGCCGT CCTTCGCCTT CGGCGGGGGC GCCCACGTCT GCCTGGGCAT GCACGTGGCC CGCGCCGAGA TGCGGGTCGG GATCGGGGCG CTGCTCGACC GGCTCCCCGA CCTGCGCCTC GACCCCGACC GCGAGGCACC CCGCTTCATC GGCATGTACG AGCGAGGCGC GACCGAGATC CCCGTCGTCT TCGGGTGA
|
Protein sequence | MSTTATLDRE RVRQLFDLRS SYNVHMGGGY RQDPYPVWHR LREQAPVHPG IVHELTGFDG PAMFSGLPYP DRPHFSAFSY AACDAAYRDP EVFASAAGPV DPNNGPYGAT NSMLSMGGRQ HRRYRGLVQP SFVPGKAKWW ISNWIEETVD LLIDGFVDAG RAELNVDFCA AIPVLTITSS FGVPVDRALD IRAALTRPDE IVRMLEPIVA ARRADPQDDL ISILVQAEMT DENGVTHRLT DAEIYSFSVL LLTAGSGTTW KQMGITLTAL LQHPEALAAV RADRSLLRLA IEESLRWSPT DPMFSRWVTE DVDFFGVHMP AGSVLHLCIG AANRDPARWD RPDEYDITRA LRPSFAFGGG AHVCLGMHVA RAEMRVGIGA LLDRLPDLRL DPDREAPRFI GMYERGATEI PVVFG
|
| |