Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2007 |
Symbol | |
ID | 5670408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2412929 |
End bp | 2413975 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240928 |
Product | hypothetical protein |
Protein accession | YP_001506350 |
Protein GI | 158313842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0837693 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCGA GTTGGGAACT TGTGCCCCCA CCGCCTGCGG AGCCGCCGCT GCACCCGCGG CGCGGCATAC ACGAGCCGAC CGCAGGCAGC CCCCCGCGCC GTCCCGGTTC GGTGCGGCGA ACGATCACAC TCGACTCGCT GCGCCCGGAC GGCCCGACCG GAAACCTGAT CCTGATCGGC CGCGGTCGGG ACCTGGCCAC CGGCCCCGAG GGCGCCGCCT CCGTGATCGC CGAGGCGATG TTCCGCTTCG AGATCGCCTT CCAGAGCCGT CGCGAAATCC TGCGCGTCGA GACTGCTCCA GAGGTCCCCG AACTGGCCGG CCTCATCGGC GTGAGCTCCA CATCCGGCTT CCGTCGTCAC CTGGCCGAGA TCGTGCCGGA GCTCCGGGCC ACCCATCCTC TGCTTGCCGC GTTGTTCGAC GACGCCCCGG TGACCTCGCT CGTCTCCGGC TACGCGACAA CTCGCATGGG CCTCGTTCAC GGCGATGGAC GGATCATGCT GGCGCAGGCC GACCAATGCG CAGGCTGGGC GCGCGGTGCG ACGATCATCA ACTCGATCGA GGACGGGAGC GGCCCACCCA TGGTCACCGG CCCACGGGCG CCGTCCCTGC TGGTGCCCGA TGATCCGATG GCATGGCATG CGCTGACCGG ACTGCCTCCG CACGGCATGC GGCGGGCCCG CCGCCTTGAC CTGCTGCCCG CCGGCGTCGA CGCACTGCAG GTGGACGTGC TTTTCCGGGA CAGCTACCAG TCCGACAGCG GACCTGAGGC CGTCATCCAC GAGTACACGG TGAACGCGCT GCTCGACCCG GAGACAATGA CGTTCACCGA GATCGTGTCG ACTCCGCGCG CGCTGCCGTG GGTCGAGTGC CCGAGCGCGG CCGCGAGCGC CGCCCGGCTG GCTGGCGCCG CAGCCGCAGA CGCCCGGCAG GCCGTGGGTA CAACCTTCCG CGGTACGTCC ACCTGCACGC ACCTGAACGA CACACTGCGA TCGCTCGGCG ATGTCCCCGC TCTGGCGGCC ATGCTGCCAG GGCGCTCGGC ACCGTAG
|
Protein sequence | MASSWELVPP PPAEPPLHPR RGIHEPTAGS PPRRPGSVRR TITLDSLRPD GPTGNLILIG RGRDLATGPE GAASVIAEAM FRFEIAFQSR REILRVETAP EVPELAGLIG VSSTSGFRRH LAEIVPELRA THPLLAALFD DAPVTSLVSG YATTRMGLVH GDGRIMLAQA DQCAGWARGA TIINSIEDGS GPPMVTGPRA PSLLVPDDPM AWHALTGLPP HGMRRARRLD LLPAGVDALQ VDVLFRDSYQ SDSGPEAVIH EYTVNALLDP ETMTFTEIVS TPRALPWVEC PSAAASAARL AGAAAADARQ AVGTTFRGTS TCTHLNDTLR SLGDVPALAA MLPGRSAP
|
| |