Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4840 |
Symbol | |
ID | 5673181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5803125 |
End bp | 5804375 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243696 |
Product | oxidoreductase domain-containing protein |
Protein accession | YP_001509112 |
Protein GI | 158316604 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.364458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0440629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCCC CGCTGACGGG CGCACTGGGC GCGCAGCTGG TGCCTGTTCA GTCGCGGGTG ACCCAGCCGC AGCTCCAGCC CCAGCCGCCG CAGTCGCCGC GGCCGGTTCG GCTGGCGCCG CCGCGGCCCG CGACCGGCGC CGTCCGGGTC GCCGTCATCG GGCTGGGCTG GGCCGGCAGG TCGATCTGGC TGCCCCGGCT GCGTGACCAC CCGCGGTTCG CGGTCGCCGC GGCGGTCGAC CTCGACGCGG ACGCCCGCGC CGCGGTTGCG GCGGACGGCG TCGACGCCCC GCTGCTTGCC AGCCCGGATC TGCTGAGCCC CGACGAGATC GACCTGGCGG TCGTCGCCGT GCCGAACCAC CTGCACAGTG TGGTCGGCGG CCGGTTGCTC GCCGCGGGCC TGCCGGTCTT CCTGGAGAAG CCGGTCTGCC TCAGCGGCGC GCAGGCGGAG GAACTGGCCC GTGCTGAGCG GGCAGGTGGG GCGGTCCTGC TCGCGGGCAG CGCCGCTCGC TGCCGGGCGG ACGTCCGGGC GCTCTACACC CTCGCGCGGG CCTGCGGGAT GATCCGGCAC GTCGATCTCG CCTGGGTACG TGCCCGGGGC GTGCCCGACG CGGGCGGCTG GTTCACCGAC ACCACGCGCG CCGGGGGCGG GGCGCTGCTC GACCTCGGCT GGCACCTGCT CGACACGGTC GCTCCACTGG TCGGGACGGC GGACTTCACC CAGGTCGTAG GCACGGTCTC CGCGGACTTC GTCCGCAGCG GGGCGGGGGG AGCCACCTGG CGCCATGCCG GCGGCCCCGG CTCGGGCCGG AGCGAAGCGG ACCGCGGCCG GCGCGGCGAC GTCGAGGACA CGGCCCGGGG GTTCCTGGTC ACCGAACGCG GTGTGTCGGT GTCACTGCGG GCGAGCTGGG CCTCCCACGA GGCGCTCGAC AGCACGGTGA TCAGAGTGGA GGGCAGCGCG GGAACCGCCA CGCTGACCTG CACCTTCGGC TTCAGCCCGA ACCGGCGCGA CGGATCGGTG CTGACGTACA CGCGGGACGG CGACACCGTC CGGGTGCCGG TGCCGTCCGA GCCGGTGGGC GCCGAGTACC GGCGCCAGCT CGACGAGCTC CCGGCCCTGC TCGCCGACCC GGGAGCGCGG GGACGCGCGG TCGCGGAGGC GCGGCGTGCC GTCGACGCTG TGGAGCGGTT CTACCGTTCG GCGCGCCCGC CAGGTGCGGC GGCGGGCGAC CACCGGTCGG GACGGATCTG A
|
Protein sequence | MSAPLTGALG AQLVPVQSRV TQPQLQPQPP QSPRPVRLAP PRPATGAVRV AVIGLGWAGR SIWLPRLRDH PRFAVAAAVD LDADARAAVA ADGVDAPLLA SPDLLSPDEI DLAVVAVPNH LHSVVGGRLL AAGLPVFLEK PVCLSGAQAE ELARAERAGG AVLLAGSAAR CRADVRALYT LARACGMIRH VDLAWVRARG VPDAGGWFTD TTRAGGGALL DLGWHLLDTV APLVGTADFT QVVGTVSADF VRSGAGGATW RHAGGPGSGR SEADRGRRGD VEDTARGFLV TERGVSVSLR ASWASHEALD STVIRVEGSA GTATLTCTFG FSPNRRDGSV LTYTRDGDTV RVPVPSEPVG AEYRRQLDEL PALLADPGAR GRAVAEARRA VDAVERFYRS ARPPGAAAGD HRSGRI
|
| |