Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5441 |
Symbol | |
ID | 5673772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6582962 |
End bp | 6583867 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244296 |
Product | Dyp-type peroxidase family protein |
Protein accession | YP_001509702 |
Protein GI | 158317194 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.129694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCAGA CGGGAATCTT CGGGCTCGGC ACGCCGGAGC ACTGCTACCT CGAGTTCGAC CTTCGTGACG GCGCGTCAGC CGCGGAGCTG GTGAAGGCGG TCGCGGCGCT GACCGGTCCG CTGTCGACCG GCGGTGGGGT CAACCTCGTC GTGGGCTTCC GCCCGGAGCT GTGGGCGCTC GTCGAGCCCG ATGGAATGCC ACCGGGTGTC CGTAGCTTCA CCGAGCCGAT CGTGGGCATC GAGGGGTTCC AGATGCCCAC CACCCAGCAT GACGCGTGGG TCTGGGTCGC GGGAGGTGAC CGCACCGCCG TCTTCAACAA CACGCGGGAT GTCGTCGCCG CGCTGGCCAC GATCGCGACG GTCGCCAGTG AGGTCACCGG CTGGCTGTAC GAGCACGACC GTGATCTCAC GGGGTTCATC GACGGGACGG AGAACCCGTC ACTGCTCGAG GCGGCCGGCG TGGCCGTGGT GGCGGAGGGC GCGGGCGCCG GCAGCAGTGT CGTCCTGGTC CAGCAGTGGC GGCACGACTC CGACAGCTTC CAGGAGCTCC CCGTCGAGGA GCAGGAGCGG GTGATCGGCC GCACCAAGGC CGACAGCGTC GAGCTCGACG AGGACGTGAT GCCGCCGACC TCGCACGTCT CCCGGACAGT CGTCGAGGAG GACGGCGCCG AACTGAAGAT CTTCCGCCGG AACACCGCCT TCGGCACGGT CACCGACCAC GGCACCATGT TCGTTGGTTT CAGCAGCGAG CAACGACGCC TGGAGATCAT GCTGCGGCGG ATGGCCGGCA GCGACGACGG CCTGCGCGAC GCGCTCACCC GCTACACGAC GCCCGTCAGC GGCGCCTACT ACTTCATTCC GGCGGTGCCG GCCCTGGCGA GGTACGCCCC TGAGGAGGAC GACTGA
|
Protein sequence | MTQTGIFGLG TPEHCYLEFD LRDGASAAEL VKAVAALTGP LSTGGGVNLV VGFRPELWAL VEPDGMPPGV RSFTEPIVGI EGFQMPTTQH DAWVWVAGGD RTAVFNNTRD VVAALATIAT VASEVTGWLY EHDRDLTGFI DGTENPSLLE AAGVAVVAEG AGAGSSVVLV QQWRHDSDSF QELPVEEQER VIGRTKADSV ELDEDVMPPT SHVSRTVVEE DGAELKIFRR NTAFGTVTDH GTMFVGFSSE QRRLEIMLRR MAGSDDGLRD ALTRYTTPVS GAYYFIPAVP ALARYAPEED D
|
| |