Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0795 |
Symbol | |
ID | 5669211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 927208 |
End bp | 928461 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239723 |
Product | hypothetical protein |
Protein accession | YP_001505159 |
Protein GI | 158312651 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.578649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0307748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACATC GCCGTGGGGA TGCCGGCGAC CCAGGTTCTT CGTCCGGCGA GGGGCCACCG CGGGACTCGT CCGACGACCA CGCCGGCGGC GGCGCCACCG TCCCGACGCC GCGGGCCGGT GGCCGTCGCG GCCCGGGCTC CAAGTGGGCG GCGGATCCCG CCGGCGGCCA GCCGGGGCGC CGGCGCCGCG TCCTGCTGCT CGGGGGCATC GGCGCGGCGA TGGCCGTGGT CCTGCTGGCG GTCGGGATCG TCGTGCTGAC CGGTGGGGAG GACGAGCCCC CGACCACGCA TCCGACCAGC GGCGGCGTGA GCTGGGTCTC CGGCGCGAAC GCCAACCCGC CGAACGACCT CGCCGGCTGG GAGAAGTGGA TCGGTCGGCC CACCGACATC GCGATGGTCT TCACCGCCCG GACGAACTGG CAGACCATCA CCCAGGCCGA CTGGCCGATG TCCGACTTCC GCCCCGAGAC CTACCCGGGC AAGCTCTCCG TCGCCCAGCC GCTGTTCCCG CAGAACGGCA GCGAGGCGGC GTGCGCCCGC GGGGACTACG ACAGCCACTG GCGCGACTTC GGCACCACGA TGATCCGTAA CGGCCGGCCG GACGCCATCG TCCGCCTCGG CTGGGAGTTC AACGGCGACT GGTTCTGGTG GCATCCCCGC GACACCGTGA CCTGGAAGAC CTGCTTCCGG CGGGCGGTCA CCGCGATGCG GTCGACCGAC CCCCAGGTGA GGATCGACTG GAACATGACC GCGCACCGGG ACACCATGGT CAACGGCGAC AACGTCTGGG ACGCCTACCC CGGCGACGAC GTCGTCGACA TCATCAGCAT CGACGCCTAC GACTCCTACC CGGCGTCGCC CACGCAGAAG ATCTTCAACA GCCAGTGCAA CCGGGCCTCC GGCGCGTGCT CGGTGGCGGC GGCCGCGCGC AAGCGCGGCA AGCAGTTCGC CGTCCCTGAG TGGGGCCTGG TCCGCTCCAC CGGCGGCGGC GGCGACAACC CGTTCTACGT GGAGAAGATG TACGAGCTGT TCGTCGCCAA CCGCGACATC CTGGCCTACG AGGGCTACTA CAGCGCCTCA CCGGCCGAGA CCGAGAACGT CGGCTCGTCG CTGCACAACC CGACCGCGAA CCCGGAGAGC GCCAAGCGGT ACCTCGAGCT GTTCGGGCCA CGGGTGAGCG CCGCGCAGGC CGGCGCCGGG ACCTCGACCC CGGCCGGAGC CGCCCCCCTG GCCGGCGACC CCGGCTCCTC CTGA
|
Protein sequence | MEHRRGDAGD PGSSSGEGPP RDSSDDHAGG GATVPTPRAG GRRGPGSKWA ADPAGGQPGR RRRVLLLGGI GAAMAVVLLA VGIVVLTGGE DEPPTTHPTS GGVSWVSGAN ANPPNDLAGW EKWIGRPTDI AMVFTARTNW QTITQADWPM SDFRPETYPG KLSVAQPLFP QNGSEAACAR GDYDSHWRDF GTTMIRNGRP DAIVRLGWEF NGDWFWWHPR DTVTWKTCFR RAVTAMRSTD PQVRIDWNMT AHRDTMVNGD NVWDAYPGDD VVDIISIDAY DSYPASPTQK IFNSQCNRAS GACSVAAAAR KRGKQFAVPE WGLVRSTGGG GDNPFYVEKM YELFVANRDI LAYEGYYSAS PAETENVGSS LHNPTANPES AKRYLELFGP RVSAAQAGAG TSTPAGAAPL AGDPGSS
|
| |