Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0851 |
Symbol | |
ID | 5669267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 997386 |
End bp | 998636 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239780 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001505215 |
Protein GI | 158312707 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.426804 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGAAC AGCCGGCCGC ACGGGTCCTC ATCGTCGAGC AGGGTGAGGG GCTGTGGGGC GCCCAGCGCT TCCTGCTGAG GCTCGCCCCG CTGCTGGAGC GACGCGGGAT CGAGCAGATT CTCGCCGCCC CGGAGGACAG CGCGACGGGC GCGGCCTGGC GGGCGTCCGG GCGGCACCAC GCCGTCCTGC CAGTGCCGGC GGACCGGAGG TTGCGCCGCC CGGACGGCCG CCCGAGCCCC GCGCTGGTAC TGCGCGAATC CGGCCGGACG GCGGTCATGG CGGCCCGGAC GGCCCGGCTC GCCCGGCGGT TCGGCGTCGA CGTCCTGCAG GCGAACAGCC GCTGGTCGCA TCTGGAGGCC GTCGGGGCCT CGGCGCTGTG CCGGCGGCCC GCGTTGCTGC TGCTGCACGA GGAGAACGAG CCGGACCTGG TCGGCCGGCT GCGCGGGCTG GCCGTCCGGG GGGCCGCGCG GTCCGTGGCG GTGAGCGGCG CGGTCGCGGC GTCGCTGCCG GGGTGGGCCG CCCGACGCGC GGTGGTGATC CGCAACGGGG TCGACACCGA CGCGCTGCGC CCCGGCCCGG CGGACCCGGC CGTGCGGGCC AGCCTGTCGA CGGACCCGGC GGCGCCGCTG GTCCTCGCGA TGTCCCGCCT GGACCCCCGC AAAGGCGTCG ACAAGGTGAT CCGTGCGGTG GCCGCGCTGC CGGACCACCT GAAGTCCACG CGGCTGGCGG TCGCGGGCGC GCCCAGCCTC GACCCGGCGT CCGGGGAGTC GCTGCGCCGG CTCGGCGCCG AACTGCTCGG TGACCGGGTG CTGTTCCTGG GGCCGCGCTC GGACATCGGC GACCTGTTGC GCGCCACCGA TGTCCTGGTC CTCGCGTCGA GCCTGGAGGG GCTGCCGCTG AACGTGCTGG AGGCGCAGGC GTGCGGGCGG CCGGTGGTGG CGTTCCCGAC CGCGGGCATC CCGGAGATCG TGACCGACGG AGCGACCGGC CTGATCGCCC GCCAGGACGA CGTGGCCGAC CTCAGCGCGA AGCTCGCCCG GGTGCTCGAC GACCAGACGC TGGCCGCTCT GCTCGGCGCC CGCGCGCGGG CGAGCGTCGT CGCCCACCAC ACACTGGACG CGCAGGCGGA CGCGCTGGGC GGCCTGCTGA TCAGCCTCGC CGGGCAGGCC CGCGCGCGGA GGCACCGGTC AGCCGGCCAC GACACCGCAC ACCATGAGGT GCATCACACC ACCGCTGGAC GCAGGTCGTA G
|
Protein sequence | MEEQPAARVL IVEQGEGLWG AQRFLLRLAP LLERRGIEQI LAAPEDSATG AAWRASGRHH AVLPVPADRR LRRPDGRPSP ALVLRESGRT AVMAARTARL ARRFGVDVLQ ANSRWSHLEA VGASALCRRP ALLLLHEENE PDLVGRLRGL AVRGAARSVA VSGAVAASLP GWAARRAVVI RNGVDTDALR PGPADPAVRA SLSTDPAAPL VLAMSRLDPR KGVDKVIRAV AALPDHLKST RLAVAGAPSL DPASGESLRR LGAELLGDRV LFLGPRSDIG DLLRATDVLV LASSLEGLPL NVLEAQACGR PVVAFPTAGI PEIVTDGATG LIARQDDVAD LSAKLARVLD DQTLAALLGA RARASVVAHH TLDAQADALG GLLISLAGQA RARRHRSAGH DTAHHEVHHT TAGRRS
|
| |