Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5996 |
Symbol | |
ID | 5674317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7313081 |
End bp | 7314448 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244844 |
Product | hypothetical protein |
Protein accession | YP_001510246 |
Protein GI | 158317738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0958307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0425607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCG ACGAGAACGG CCGCTGGGTC AGCGACGACG GCGCCTATGT CTGGGATGAC GCGGCCCAGA CCTGGCGGCT GGCGAATGCC ACGTCGCCGT CCGCCATGCC CGCGACCGGC TCGGGCCGGA CGGCCTCGGG CCAGACCGGT CCGGGCCACA CCGGTTCGGG CCAGGCGGGC CCGGGGCAGG CGGGCTCCGG CCAGACGGGT TGGACACCGC CGCACACGGG TCCGCTCGCC CCGAGCTCCG GCAGTTCCGC CGGCACCGGG GCGACCGGGC CGATCTACAC CGGGCAGGCG TACGCGGGGC CGGCCGACAC AGGCTCCTCC TACGGCGGCC GCCCCGGCTC CGGCTACCCG GACCCGGCAC GGTCAGATCC GGCACGGTCA GATCCGGCAT GGACGGGCCC GGTGCGGGCG GGCCAGGCGC CGGCCGCGCG GGAATCGGGC CGGCCCACAA CGACCGGCCC GTTTTCGACC GGCCCGTTTC CAGCTGGGCA GCTTCAGAGC GGGCCGATGG CCACCGGGCC GATGCCGACG GGGCCGATGC CGACCGCCCG GACAGCCACG GGCGCGTCCG GGGGCCGGGC AGACTCCGCC GTGGCGCCGA GCGATCCTGC GGAGAACACG ACGGGCATCC CTCGCCGGAT GACGCGGCCG CGGCGGCCCC GGCCGGCCGG GCAGCTCGAC GTGGCCGACA CCGATCTCGA CGACACCGAT CTTGACGACG CCGATCCCGA CTACACCGAC CTGGACGGCC CGCACTTGGA CGACGCCGAC CCGGACCGCT GGGCCGGTGA CGATGACGAG GATGAGGCGG GCCTCCGGGG CCGCGGTGCC AGGCCGGGCG ACGGCCACGG CCCCGAGGGC GGGGGAGCGC GGCGGTTCGC CAGCGGGCTG CGGGCGTCCG CGCTGGATCC CCGGGCGCGC TGGGGCGCGG CGGGGCCGGG CGGGACGTCC GACAACGGCG GCGGGCCCGA CGGCTACCCG GCGGCCGAGG ACCCCGCCCG CGGGATCGCC GCGATGCTGC GCCGGCCGAC CGTGCTCGCG GTGGCCACGG CGTTCGCTCT GGTAGTCGCG CTCGGCGTGG CCGGGTTCCT CGTTCTCGGC GGGGGTGACG ACAGCGGTTC GACCGCCTCC GCGTCGTCCG CCGGGGTGGG CAGGTACGAC GCGGCCGTCC GGAAGGACTA CATCGACGCC TGCCTCGGCG TGAGCGACGG CAACGAGCGC TACTGCACCT GCACGCTGGA GAAACTCGAG GCTGACTACA CCCAGGAGCA GTATCTGGCC TTCAACAGCG ACGTCGAGTC CGACAACTCC CAGCGCATCG TGCGCGAGAT TTACGCGGCC TGCCGCAATC TCCGGTGA
|
Protein sequence | MRLDENGRWV SDDGAYVWDD AAQTWRLANA TSPSAMPATG SGRTASGQTG PGHTGSGQAG PGQAGSGQTG WTPPHTGPLA PSSGSSAGTG ATGPIYTGQA YAGPADTGSS YGGRPGSGYP DPARSDPARS DPAWTGPVRA GQAPAARESG RPTTTGPFST GPFPAGQLQS GPMATGPMPT GPMPTARTAT GASGGRADSA VAPSDPAENT TGIPRRMTRP RRPRPAGQLD VADTDLDDTD LDDADPDYTD LDGPHLDDAD PDRWAGDDDE DEAGLRGRGA RPGDGHGPEG GGARRFASGL RASALDPRAR WGAAGPGGTS DNGGGPDGYP AAEDPARGIA AMLRRPTVLA VATAFALVVA LGVAGFLVLG GGDDSGSTAS ASSAGVGRYD AAVRKDYIDA CLGVSDGNER YCTCTLEKLE ADYTQEQYLA FNSDVESDNS QRIVREIYAA CRNLR
|
| |