Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1580 |
Symbol | |
ID | 5669983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1886544 |
End bp | 1888382 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240499 |
Product | hypothetical protein |
Protein accession | YP_001505925 |
Protein GI | 158313417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0225613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGGC CAGGGCCGAT CCCACCGCCG AGACCGGGCC CGCCGCCCGC ACCGCAGCCG TTCTTCGTAC CTCCATTCCC CGCCGGCGAG GCCGAGTCGC CTGTGCTGCG CGCGCTCCGC GGGCTCGGCC CGCTGCTGGC CCAACGCCTG CTCGGCGCAC TGGAGCTGAC GCTCGACGCG CTCGAGCTGG TCGGCGTCCC GACCGGGCTG GCGGCCAGCG TCCCGGCGCC CTCCGGTAGC CATCTGGTCA CGTCCGCCAT GGCCGCCGGC GGCGGCGAGC TGGAAGCCGC CGTGATCATG CTCGACCAGG CCCGGTGGGG CGCGAGCACG CTCACCTGCG ATCTGGTGCG AGCGCTGTGC TCCCACCCGC TGGTCGCCCC ACTGCTCGCG GCGGAGGTGG TGGGTGTCGC CGCTGCCGGA GCCGAGGTTC CCGGCGGCGG CGAAGTCGGC GTGACGCGGG AGGTGCCTGC GGGTGGGGAG GCAGCGGCGG CCAGGGAGGC AGCCGCGGTC AGGCAGGCCG TAGTGGAGGA GGCGGAGGCG GTCACGGCCG AGGAGGCGGT CGCGGCGGGC CACGGCGCGG CGCACCTCGC CCTCGCGGTG GCGGTCTCCG TCGCGGTGCT GCGCGAGCTC GGCCTGCCGG CGGTGCCGGC CCGGCCCCCT GCTGTCGTCG GTCTGGCGCT CGGCGCGGCC GCCCACGTTC TCGCCGAGGC GCCAGTGCCG GCCGCGTACG CACCCGCGGC GCTGGCGCGT CGCCGGGCGG AGTACCGCCT TCCGCGCGGT TCCGTCGGGC GGGTCAGCGT GGCCGGCCAC TGCTTCGCCC TCGCCGAGTC CGGCTCTCCG GCGCTGGGAT CGGCGGCGCC CGCGGCGTTC GCCGCGAACG GACTGGTCGA GACCATGCCC GGCGGTGCCC TGATCCGCAC CGGGACGGCG GCCGGCTCGG CGCGTGTCCG CCTGGCGATT CTCCAGGGCC CCCCGGCCGA GGTCGAGCTC GACGGATGGG ACGAGGTGGT GGAGGTCAGC TGGACAGCAG CGGCCGGATC CGCCTCCGTC GTCGGAGCAG GCCCGGCGAT GCCCGCGGTG GAAGGCGAGA CCCCGCCGTG GCCGGGCACG TACCGGCTGC GTGTGCACGC CCGCGACCGG GACGACGGCG AAGAGTACGA AGGCGACCTC CGCGGCGGCG AGGGCTACCT GCTCGTCGTG TGGGAGGCGC CGGCCGCGCC GGAAGTCGTG TACAGGCGGT CCGATCAGTT GGGGCACCGG CTACGCGGCG AGCCGCCGCC CGCGGCCCGG CCGGAGGCCG CCTACCGCTG GGTGCGCCAC AGCCCGATCG GGGAGGCAGC CACCGTCACC GTCGTCCCCG GGGCCGACGT CGCGCAGGTG CTCCGCGCGT TCGGCGCCGA CGCCGCCGAG CCGGTGTCCA TGGCCGAGAT GCGCGAGACA TGGCAGGCCG GCCGGTTCTG GGTGGCCGTC CTGGCCGTGG AAGGTGCCGT GCTGGCGGTC GAGGACAACG GCTTCCAGGG CACCCAGGCG CAGGTGCTGC GGGCGCTGTC CCGGCGGCGC CGGGCAGCCA GCATGTTCTG GAACGTCAAT GCGGTCACCC GGCTGTCGCT CGCGGAGCAT GGCAGGATCG TCGCCTCCTT CGAGCCCGGG CTCGACCGGG CGGCCGAGAT CGCCCCGGGC GCGCTGCCGT ATCTGGAAGG CATGGACCTC ACCGACCATC GGCACAAGGT CGAGAAGTGT CTGGTCGCCG TCGAGCGGTT CACCCACTGC CCGGTGCGGC CGGTGGACAT CGAGCGGGTC GAGGCGGCCG GCGTGGCCTA CCTGATGCCC GGCAGGTGA
|
Protein sequence | MSGPGPIPPP RPGPPPAPQP FFVPPFPAGE AESPVLRALR GLGPLLAQRL LGALELTLDA LELVGVPTGL AASVPAPSGS HLVTSAMAAG GGELEAAVIM LDQARWGAST LTCDLVRALC SHPLVAPLLA AEVVGVAAAG AEVPGGGEVG VTREVPAGGE AAAAREAAAV RQAVVEEAEA VTAEEAVAAG HGAAHLALAV AVSVAVLREL GLPAVPARPP AVVGLALGAA AHVLAEAPVP AAYAPAALAR RRAEYRLPRG SVGRVSVAGH CFALAESGSP ALGSAAPAAF AANGLVETMP GGALIRTGTA AGSARVRLAI LQGPPAEVEL DGWDEVVEVS WTAAAGSASV VGAGPAMPAV EGETPPWPGT YRLRVHARDR DDGEEYEGDL RGGEGYLLVV WEAPAAPEVV YRRSDQLGHR LRGEPPPAAR PEAAYRWVRH SPIGEAATVT VVPGADVAQV LRAFGADAAE PVSMAEMRET WQAGRFWVAV LAVEGAVLAV EDNGFQGTQA QVLRALSRRR RAASMFWNVN AVTRLSLAEH GRIVASFEPG LDRAAEIAPG ALPYLEGMDL TDHRHKVEKC LVAVERFTHC PVRPVDIERV EAAGVAYLMP GR
|
| |