Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0235 |
Symbol | |
ID | 5668660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 287881 |
End bp | 289170 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239164 |
Product | hypothetical protein |
Protein accession | YP_001504608 |
Protein GI | 158312100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.27533 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA GCACCAGGCC GGCGACGCCC CGGGCCGGGC GGCCCGGGTT CTCCCAGCAC ACGGAATACC TGCGCGCGCG AGAGAACCAG GCGATGGTCC GACGCGACCG CCTGGCGCGC GAGGTGCCGC GCGCGGCCAT CCGCGCCGCG GTGGTCGGCA TGGGCCTCGG CCTGGTCATC GGGTTCGGGC TCGGGCTGCC CGGTCTCGGC GTGGCCGTCT TCGTGGTCCT GCTCATCGTC TGGCCCGGTG GCATCGCCGT GGCCGCCTTC GGTGTCTCAC CGGACGTCGA GACCCTGCGC GAGGCGGCCG AGGCTGAGCG CAAGACCGCC CGCGCGATCT CCCGGCTCCG CCGACACGGC TATGTGATCA TGCACGACCG GGCCGTCCCC TACTCGCAGG CCACAATCGG GCACCTGCTG ATCGGTCCCG GCGGCGTCAT GATCCTCGGC AGCGACACCA ACAAGGGCAT CGTCCGCTAC GCCAAGGGCG GCGCCATGGT GGACGGCGAG TCGCTCAAGC CCGCGATCGA CAAGACCTCA TGGCTCGGCG GCGAGGTGCG CAACCAGGTC CGCGCCGCCC TGCCCACCAC GAAGATCCCG GTCTACCCGG TCCTCGTGAT GGTCGAGGCG AGCGTCCTGT GGAGCGACGG CGCGCTGGAC GGCGTCACGA TCATCAGCGT CAAGGATGTC GTCAAGTACG TCCGGAGCAA GCCCGGGCGG CTCAACCCCG GGCAGGTCCA GCAGGTCCTC GCCGCCGCCC AGCGGCTCTT CCCGCCGTAC TCCTCCAACC GGCTCGCCGA GCACGTCGTC GTCGACCGCG ACCAGTGGCT CACCCTGATG GACGCCCTGC GCACAATCCG CGAGCGCGGC GGCGACGCCT CCGAGATGCT CGAGCGCCTC GCCCAGATCG AAGCCGACCT CGGCCGCCAG GCCGATCTCA TCGACCGCGC CGGCATGCCC CTCGCCCGGG CCGCCGACCA GCCCGACGGC CCGACCGACA GCCCACCGCC CGCTTCCGGG ACGGACACGG CGACCGACGC GATCGGCCTG CTGGACGTGG ACGGAACAGG CACCGCCAAG TCCCTGGAGG GACCCCCGCG GGCCCGCCCC GGCGAGGGCC GGCGCGGCCG CATCCTGGCC GCCGTCCGCC AGCCACGGGG CAGCGAGTCC ATCAGCACGT CGAGCCGGCC CCCCGGGGGC GACGGCCCGA CGACGGCAAA GGGCGACCAG CCCCCCGCCC CCGGCGACGA CCGCGCCCAC CCGACCTCCG GGCCCGGATC CGGGTCGTAG
|
Protein sequence | MATSTRPATP RAGRPGFSQH TEYLRARENQ AMVRRDRLAR EVPRAAIRAA VVGMGLGLVI GFGLGLPGLG VAVFVVLLIV WPGGIAVAAF GVSPDVETLR EAAEAERKTA RAISRLRRHG YVIMHDRAVP YSQATIGHLL IGPGGVMILG SDTNKGIVRY AKGGAMVDGE SLKPAIDKTS WLGGEVRNQV RAALPTTKIP VYPVLVMVEA SVLWSDGALD GVTIISVKDV VKYVRSKPGR LNPGQVQQVL AAAQRLFPPY SSNRLAEHVV VDRDQWLTLM DALRTIRERG GDASEMLERL AQIEADLGRQ ADLIDRAGMP LARAADQPDG PTDSPPPASG TDTATDAIGL LDVDGTGTAK SLEGPPRARP GEGRRGRILA AVRQPRGSES ISTSSRPPGG DGPTTAKGDQ PPAPGDDRAH PTSGPGSGS
|
| |