Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0002 |
Symbol | |
ID | 5668429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2937 |
End bp | 3959 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641238930 |
Product | hypothetical protein |
Protein accession | YP_001504377 |
Protein GI | 158311869 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.284088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGAAC GTGTGTACAG CCTGTGTGCA TCTGAGGGTC ACCTGTGGAA TCGATCAAGC CCTGGGGATA AACCCCTTGA TCATCCACAT CGGCACCCCC AGGATGCGCA CAGGGCCTCA GTGGGTCTGA CCTGGGAAAA CGGCCGTTTT CCACAGTTTC CACAGCCCCT ACTACTACTA CGAACTAGAT CTTCTTCAGA GAGATCAAAA CCTCGAAGCA GTAGGAGCTG GGGACAACCT CGAATTTCGA TCATGGTGTG GCCTGGCTGC CCGAAGCGAG CGCACGGAGC GATGCGGTCG CTGCCCTCGG TCAGCCGCCT CCCCGTCGGG GCATCCCGCG AAACCTCGAC CGCCGTCCGG GAGTCGTCCC TCCGCTGCGC GGCACGTGGA CGGATCACCG ACGCGGGTCC GCACGACCAG CGCCACCGAT GCGACCGGTG GCATCAGCGC GACCGGTGTG GCCCCTGGCT CCGTCGGTGC GACCGGCGCA CGCCAGATCC CCCGCGCGGC TGGCGCTCGT CTAGTCACCG ATGCGGCAGG CGCCTCCAAC GCGGCACGCG CCAGCCGGTG ATCAGGCGAG GACGGCGATC ACGACGCATG CGAAGCGAAG TCCTCATGCG GTCCTGCGGC ACGGTGGTTT CGGGCTCCTG CGGCGGCGCG GTGGTCCCTG GCGAGATGCG GAAGATCACC CCGGACACCC CGGACGCGGG TGACCTTCAG GCGCCACCGC GACAGTCAGC CGTCCGGTCT GCTCGCCACG GATCTTCGAT CGGCCGGTGC AGGGGGTGTG CCCGGTCGTC GAGCCCGGTG TGCCACCCTC GGAACACAGA CCACCCGCCG CCCGCTCGTG GCGCTCTGGG ATGCTGGATC GGGTCAGGCT GGTGCTCGAC TGCCCGCTCC CTGCCGACGA GGCGGGTGGC GCGCGTCAGG TGTTCGGCTC GGCGTAGCTA TGCCGGTGAT GTCAAGTTCC GCGAGCGGCC GTGGGAACGG CCACCGCGGG ACGACGGGAG GACATGTCCA TGA
|
Protein sequence | MDERVYSLCA SEGHLWNRSS PGDKPLDHPH RHPQDAHRAS VGLTWENGRF PQFPQPLLLL RTRSSSERSK PRSSRSWGQP RISIMVWPGC PKRAHGAMRS LPSVSRLPVG ASRETSTAVR ESSLRCAARG RITDAGPHDQ RHRCDRWHQR DRCGPWLRRC DRRTPDPPRG WRSSSHRCGR RLQRGTRQPV IRRGRRSRRM RSEVLMRSCG TVVSGSCGGA VVPGEMRKIT PDTPDAGDLQ APPRQSAVRS ARHGSSIGRC RGCARSSSPV CHPRNTDHPP PARGALGCWI GSGWCSTARS LPTRRVARVR CSARRSYAGD VKFRERPWER PPRDDGRTCP
|
| |