Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1953 |
Symbol | |
ID | 5670354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2348991 |
End bp | 2350037 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240874 |
Product | hypothetical protein |
Protein accession | YP_001506296 |
Protein GI | 158313788 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.098143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.528904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACG TCCTCGCGCT GGCCGTAACC GTGCTGCTGC TCGCCGGCAA CGCCTTCTTC GTCGGGGCCG AGTTCGCGAT CATCTCCGCC CGCCGGGACA CCATCGAGCC GATGGCGCTC GCCGGTTCCC GGGCGGCGAA GGTGACCCTG AAGGCGATGG AGAACGTGTC GCTGATGCTG GCCGGAGCCC AGCTCGGCAT CACGGTCTGC ACGCTCGGTC TGGGCGCGCT GAGCGAGCCG GCCATCGCGC ACCTGTTGGA GGGGCCGTTC GAGGCCGTGG GCCTGCCGCT GTCGCTGCGC CATCCGGTGG CGTTCGCGAT CGCGCTCGCC GCCGTCACCT ACCTGCACGT GGTGATCGGT GAGATGGTCC CGAAGAACAT CGCGCTGGCC ATGCCGGACC GGGCGGTCCT GCTGATGGCC CCGCCGCTGG TCGCGGTCGT CCGGGTGGTG AAGCCGGTGA TCTCGATCCT CAACCGGATC GCGAACCTCT CCCTGCGGGC GGCTCGGGTC GAGCCCAAGG ATGAGGTGAC CAACGTCTAC ACGCGCGACG AGGTGGCCGG GCTCATCGAG GAATCACACC GCGAGGGCCT GCTGGCGGAG GACGAGCACG ACCTGCTGAC CGGCGCGCTG TCGTTCGACG AGCGCACCGC GCGCAGCGTC CTGCTCCGCC CGGACAGCCT GGTCACCGTG CCGCCGTCCA TCACGCCCCG CGAGGTCGAG CGGCTCGCGG CCGACACGGG CTTCACCCGG TTCCCGGTCC GCGGGGACGA CGGTGACCTC GCCGGCTACC TGCACCTCAA GGATGTCCTG GAGAACCGCG AGGACCGACG TTCGGCCCCG GTGGCGGCCA AGTGGATCCG GCCGCTGGTC CGCGTCGGGG CGGACGACAG CCTGCGCACG GCGCTGGCCA CCATGCAGCA CTCGGGATCG CACCTGGCCC GGCTCTCCGA CGGCGAGGGC CGGATCCTCG GCCTGGTGGC GCTGGAGGAC ATCCTCGAGG AGCTGGTGGG CGAGATCCGC GACGAGGCGA CCCGTCAGCG CGCCTGA
|
Protein sequence | MNDVLALAVT VLLLAGNAFF VGAEFAIISA RRDTIEPMAL AGSRAAKVTL KAMENVSLML AGAQLGITVC TLGLGALSEP AIAHLLEGPF EAVGLPLSLR HPVAFAIALA AVTYLHVVIG EMVPKNIALA MPDRAVLLMA PPLVAVVRVV KPVISILNRI ANLSLRAARV EPKDEVTNVY TRDEVAGLIE ESHREGLLAE DEHDLLTGAL SFDERTARSV LLRPDSLVTV PPSITPREVE RLAADTGFTR FPVRGDDGDL AGYLHLKDVL ENREDRRSAP VAAKWIRPLV RVGADDSLRT ALATMQHSGS HLARLSDGEG RILGLVALED ILEELVGEIR DEATRQRA
|
| |