Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7007 |
Symbol | |
ID | 5675318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8544478 |
End bp | 8545932 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245853 |
Product | Dyp-type peroxidase family protein |
Protein accession | YP_001511244 |
Protein GI | 158318736 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.54132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGAC TGCCCGATCC CAGCGGCCCG GACGCCGTCG CCGCCGAGGA CCCCACCACC GCCGTCACCT CCAGCGGCGC CGGCGTGAAC GACGCCGGCC CCACCACCGC CGTCACCTCC AGCGACGCCG GCGCGGGCGA CGCCGACGGC GACGGCCCGC GCAGCGGCCG CGGGAGCCAG CCGGCGCCCG CGGCACGCGG TTTCAGCCGT CGCCGCATGG TCGCGTTCCT CGGCGGCGCC GCGGCTGTCG GCGCGGGCGG CACCGCCGCC GGCTTCGTCG CCTCCAACTC GGAGGAGGAG TCGCCGGGGT CGGGCCAGAG GGTTCCGTTC TTCGGTTCGA ACCAGGCCGG AATCGTCACC CCGGTACAGG ACAGGCTGCA CTTCGCCGCC TTCGACCTCA GCCCGGCGGC CACCCGCGAC GACCTGATCG CGCTGCTCAC CGCTTGGACC AACGCGGCGT CCAGGATGAC CGCCGGGCTG GACGTCGGCA CCGGCGCTGT CACCGGTGCG CCCGGCTCCC CGCCGGACGA CACCGGTGAG GCGCTGGGGC TGTCGCCCGC CCGGCTGACC CTCACACTCG GCTTCGGCAC CAGCCTGTTC ACCGACGCCA GCGGCAAGGA CCGTTTCGGG ATCGCCGCTT CGCGACCGGC GCAGCTCGCC GACCTGCCCG CCTTCCCCGG CGACGCCCTC GACCCAGCGT CCAGCGACGG CGACCTGTGC GTGCAGGCCT GTGCCGACGA CCCGCAGGTG GCCGTGCACG CCATCCGCAA CCTGGCCCGC CTCGCCCGGG GCGCCGCCTC GGTGCGCTAC TCCCAGCTCG GGTTCGGCCG CACCAGCTCC ACCTCGACCG GCCAGGCCAC CCCACGCAAC ATGATGGGTT TCAAGGACGG CACCGCCAAC ATCAAGGCCG AGGACGCGGC CACGATGAAC ACCCACGTCT GGGCCCAGCC CGGTGACGGG CCGGACTGGA TGACCGGCGG CAGCTATCTC GTCAGCCGCC GCATCCGCAT GCTCATCGAG CCCTGGGACA GCACCCCGCT CACCGAACAG GAACGGGTCA TCGGCCGCGC CAAGGGAAGC GGAGCTCCGC TCGGCCAACG GGACGAGTTC GACCCGTTGG ACTTCGCGGC GAAGGACTCC GCCGGAGAGC TGGTCGTCGA CACCAAGGCC CACGTACGGC TCGCCCACCC GACCCAGAAC AACGGCGCCG TGATCCTGCG CCGTGGCTAC TCCTTCACCA ACGGCACCGA CAACCTCGGC CGCCTCGACG CCGGGCTGTT CTTCATCGCC TATCAGCGGG ACCCGCGGAC CCAGTTCGTC ACAATTCAGA AATCACTGGC CGGCAGGTCC AACGACGCGC TCAACGAATA CATTCAGCAC GTCGGCAGCG GCCTGTACGC CTGCCCGCCG GGCGTCCAGC CAGGACAGTA CTGGGGCCAG AAGCTCTTCG CCTGA
|
Protein sequence | MSRLPDPSGP DAVAAEDPTT AVTSSGAGVN DAGPTTAVTS SDAGAGDADG DGPRSGRGSQ PAPAARGFSR RRMVAFLGGA AAVGAGGTAA GFVASNSEEE SPGSGQRVPF FGSNQAGIVT PVQDRLHFAA FDLSPAATRD DLIALLTAWT NAASRMTAGL DVGTGAVTGA PGSPPDDTGE ALGLSPARLT LTLGFGTSLF TDASGKDRFG IAASRPAQLA DLPAFPGDAL DPASSDGDLC VQACADDPQV AVHAIRNLAR LARGAASVRY SQLGFGRTSS TSTGQATPRN MMGFKDGTAN IKAEDAATMN THVWAQPGDG PDWMTGGSYL VSRRIRMLIE PWDSTPLTEQ ERVIGRAKGS GAPLGQRDEF DPLDFAAKDS AGELVVDTKA HVRLAHPTQN NGAVILRRGY SFTNGTDNLG RLDAGLFFIA YQRDPRTQFV TIQKSLAGRS NDALNEYIQH VGSGLYACPP GVQPGQYWGQ KLFA
|
| |