Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5852 |
Symbol | |
ID | 5674175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7099123 |
End bp | 7100133 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244702 |
Product | hypothetical protein |
Protein accession | YP_001510104 |
Protein GI | 158317596 |
COG category | [S] Function unknown |
COG ID | [COG1300] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.699479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCG ATGCGTACGT CGCCGTCCAC CGTCCGGAGT GGATCCGGCT GGGTCAGCTC GTCGACCGCG CCGGCCGGCC CCGGCGGATG TCTGCGGACG AGCTCGACGA GCTGGTCGAG CTGTATCAGC GCGCCGCCAC TCACCTGTCG GTGATCCGTG ACCGTTCCCA CGATCCGAAT CTGGTCGATG ATCTCGCGAG TCTGGTCACC CGGGGCCGGG CCGCCGTCGT CGGCGCGCCG GACCAGGGCT GGCATGTCGT GGGCAGGTTC TTCGCCGTCA CGTTCCCCGC CGCGGTCTAC GCGCGCCGTT ACTGGGTGAT CGCCACGACG GTGGTGTCGC TGCTGGTCGC GCTCGCGTTC GCACTCTGGA TCATGAACAG TGCGGACGCC CGGGCGAATC TCGTCCCGCC CGACGAGGTG GCCGACCTGT GCCGGTCGGA CTTCGCCAGC TACTACACCG AGAACCCGGC CTCGTCGTTC GCCAGCCAGG TCTGGACCAA CAACGCCTGG GTGTCGGCGC AGGCGGTGGC GTTCGGTGTC CTGTTCGGCG TGCCGACGCT GTTCGTCCTG CTGCTGAACT CGGTGAACCT GGGCGTCGTC GGCGGCTACA TGGGCAGCTG CGGTGAGGGC GGGCAGTTTT TCTCTCTCAT CCTGCCGCAC GGGATGCTGG AGCTCACCGT GGTCTTCGTG GCCGGCGCGG TCGGCCTGCG GCTGGGATGG TCGATCATCG CTCCGGGGCC TCGGCGCCGG GTCGAAGCAC TGGCCGCCGA GGGGAGGGCC GCGATCGGGA TCGTCCTCGG GATGGCGGTC GTGCTCGCCG TCTCCGGGGT CATCGAGGCG TTCGTGACGC CCTCGTCGCT GCCCACCGCG ATCCGGATCG GGATCGGCGC GCTGGCCTGG GCCGGTTTCG TCGCCTACGT GTGGATCTAT GGGAGCCGTG CGGTCGCCGC CGGAGAACGC GGCGATCTCG ACGAGTCCCT GGCCGCGGAT CTCGTGCCCG TAGCGCCTTG A
|
Protein sequence | MDLDAYVAVH RPEWIRLGQL VDRAGRPRRM SADELDELVE LYQRAATHLS VIRDRSHDPN LVDDLASLVT RGRAAVVGAP DQGWHVVGRF FAVTFPAAVY ARRYWVIATT VVSLLVALAF ALWIMNSADA RANLVPPDEV ADLCRSDFAS YYTENPASSF ASQVWTNNAW VSAQAVAFGV LFGVPTLFVL LLNSVNLGVV GGYMGSCGEG GQFFSLILPH GMLELTVVFV AGAVGLRLGW SIIAPGPRRR VEALAAEGRA AIGIVLGMAV VLAVSGVIEA FVTPSSLPTA IRIGIGALAW AGFVAYVWIY GSRAVAAGER GDLDESLAAD LVPVAP
|
| |