Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4256 |
Symbol | |
ID | 5672611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5076867 |
End bp | 5078087 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243129 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_001508546 |
Protein GI | 158316038 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.798604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.385246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACT CCCCCGCTCG TCCGCGCGCC GCGAGCGCGG GTGCCGCGCG GCTCCCCGTC CTCGACCTGC GCGACTACAC CGCGCCGGAC GGCCGTGCCG ACCGCCGGGA CTTCCTCGAC GCGCTGCGCG CGGCCTGCCG CGATCCAGGC TTTCTCCAGC TGACCGGGCA CGGCGTACCT TCCGGCCTCA CGGAGCGGAT CATCGCGGTG AGCCGCGCGT TCTTCGACCT GCCGCTCGCC GCCAAACTCG AGATCGAGAA CGTCCACTCC CCGCACTTCC GCGGCTACAC CTGCCTCGGG CACGAGATCA CCCGGGGCAG GCCGGACCTC CGCGAGCAGA TCGATATCTC CGACGAGCGG CCGGCCCGCG TCCTCGGGCC GGACGACCCG CCCTACCTGC GTCTGGACGG GCCCAACCAG TGGCCGGCGG CGCTGCCGGA GCTGCGCGTG GCCGCCCTCG TCTACCTGGC CGAGCTTGGG CGCGTCGCGC GGGTGCTGGT GCGCGCGCTC GCCGAATCGC TGGGGCTGCC GCCCGACCAC CTGGACCCGA CCTTCTCCGC CGAGCCCCGC TCCCACCTGA AGCTGCTGCG CTACCTGCCG ACCCCTGCCG GCAGCGCCAC CGGACACGAC GCTGCCGGAC ACGACGCCGC CGAGCCCAAC GCCGCCGCGG ACGGCCAGAA CGTCGACCAA GGCATCGGCC AGGGCGTCGG CGCGCACAAG GACGGCGGCT TCCTGACCTT CGTCCTGCAG GACGGCGTCC GCTCAGCCCG CCAGGACGGC GCCCACCCGG CCCCCCGCTC CCCCACGGGC CCGTCCGGCC TGCAGGTCGC CGACGGCGCG GGCGGGTGGA TCGAGGCCGC CGCAGTGCCT GGCGCGTTCG TGGTGAACAT CGGCGAGATG TTCGAGCTGG CCACCCGGCG TTACTACCGG GCAACCGTCC ACCGGGTCGT GAGCCCACCA CCAGGCCACG AGCGGGTGTC CGTGGCGTTC TTCTTCGGGC CACGGCTGTC GGCCACCCTC GAACCCATGC CGCTGCCCGA TGCCCTCCTC GCGGAGATCC CCGACGCCGA ACCACCCGAC CCGGAGAACC CGATCTTCGC CCAGCACGGG ACGAACACCC TGAAGAGCTG GCTGCGCAGC CATCCCGAGG TGGCCCGCCG CCACTACGCC GACGTGGCAC CCCCGGCGGG CGTGGCGCCG ACGGCCGGAG GCGGCGCGTG A
|
Protein sequence | MPDSPARPRA ASAGAARLPV LDLRDYTAPD GRADRRDFLD ALRAACRDPG FLQLTGHGVP SGLTERIIAV SRAFFDLPLA AKLEIENVHS PHFRGYTCLG HEITRGRPDL REQIDISDER PARVLGPDDP PYLRLDGPNQ WPAALPELRV AALVYLAELG RVARVLVRAL AESLGLPPDH LDPTFSAEPR SHLKLLRYLP TPAGSATGHD AAGHDAAEPN AAADGQNVDQ GIGQGVGAHK DGGFLTFVLQ DGVRSARQDG AHPAPRSPTG PSGLQVADGA GGWIEAAAVP GAFVVNIGEM FELATRRYYR ATVHRVVSPP PGHERVSVAF FFGPRLSATL EPMPLPDALL AEIPDAEPPD PENPIFAQHG TNTLKSWLRS HPEVARRHYA DVAPPAGVAP TAGGGA
|
| |