Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5123 |
Symbol | |
ID | 5673457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6137155 |
End bp | 6138480 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243973 |
Product | glycosyl transferase family protein |
Protein accession | YP_001509387 |
Protein GI | 158316879 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.354278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0746579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGGG ACAGACTGGC AACGGCGCTG GTGCGTTCCG GGGCGGCCGG AGCCGTCGCC GTCGCGGCGC ACACGGCGGT GAACGCGGCA CTGCTGCGCG TCCCGGCGCC GGCCGTCCCG GTGCGGGAGC GGGTCAGCGT GATCCTGCCC GTGCGGGACG AGGCGGCGCG GGTGCGCACC TGCCTGACTG CTCTGCTCGG CTCCCGCGAC GTCGCCGACC TCGAGGTGAT CGTCTACGAC GACGGCTCGA CGGACGGCAC GGGCATGATC CTGCGCCGGC TCGCCGAGCG GGACCGGCGC CTGCGGGTGC TGACCGGGCC GGAGCCGCCG GACGGCTGGC TGGGCAAGCC CCACGCCTGC GCCCGGGCGA CGCGCCAGGC CACCGGCACC GTGCTGGTCT TCGTCGACGC CGACGTCGCC CTCGCCCCGG ACGGCCTCGC CCGGGCCGTG GGCCTGCTGC GCGGATCGGG CCTGGACCTC GTCTCGCCCT ACCCGCGGCA GGTCGCCGTC GGGGCGGCCG AACGGCTCGT GCAGCCGCTG TTGCAATGGT CGTGGCTGGC GCTGCTCCCG CTGCGCGCCG CCGAGAGCTC CGCCCGCCCG TCGTTGGCCG CGGCCAACGG CCAGTTCCTC TGCGTGGACG CGGCGGCCTA CCGCCGGGCC GGCGGGCACG GCGCGGTCGG CGGCGCCGTA CTGGACGACA TCGAGCTGCT GCGCGCGGTC AAGCGCTCCG GCGGGCGCGG TGTGGTCGCC GACGGCACCG AGCTGGCGGT CACCTGGATG TACGACGGCT GGCAGCCACT GCGGGACGGC TACGCCAAGT CGCTGTGGGC CGCGGGCGGC ACCCCGGCGG CCAGCGTGGG TCAGCTGGCC GTGCTCGGAT GGCTCTTCGT CGGCCCGGCG GTCGCCGCGG CGCGCGGGTC ACGCGCGGGG CTCGTCGGCC TGCTGGCCGG CACGGTGAGC CGGCTGATCG CGGCCCGGCG CACCGGCGGC CGCGCCTGGC CGGACGCGGC GGCCCATCCG GTCTCTGTCT GCCTGCTCGG CTACCTGACG GTGCTGTCCT GGTGGCGGCA CCGGCACGGC ACCATCCGTT GGAAAGGCCG CGCGCTGAAC GGGCCACCGA CCGGGAGCGG ACGACCGCGC ATAGGCTCGG AGCCGTGGCG ACGGTCGTCG TGGTCGGGGC GGGCGTCGGC GGGCTCGCCG CCGCCGCTCG GCTCGCCGCC GCCGGGCACC GGGTCACCGT CTGCGAGGCG GCGGAGCGGA TCGGCGGCAA GCTCGGCTGG TACGAACGCG ACGGCTACGG GTTCGACACC GGCCCGTCCC TGCTGA
|
Protein sequence | MNGDRLATAL VRSGAAGAVA VAAHTAVNAA LLRVPAPAVP VRERVSVILP VRDEAARVRT CLTALLGSRD VADLEVIVYD DGSTDGTGMI LRRLAERDRR LRVLTGPEPP DGWLGKPHAC ARATRQATGT VLVFVDADVA LAPDGLARAV GLLRGSGLDL VSPYPRQVAV GAAERLVQPL LQWSWLALLP LRAAESSARP SLAAANGQFL CVDAAAYRRA GGHGAVGGAV LDDIELLRAV KRSGGRGVVA DGTELAVTWM YDGWQPLRDG YAKSLWAAGG TPAASVGQLA VLGWLFVGPA VAAARGSRAG LVGLLAGTVS RLIAARRTGG RAWPDAAAHP VSVCLLGYLT VLSWWRHRHG TIRWKGRALN GPPTGSGRPR IGSEPWRRSS WSGRASAGSP PPLGSPPPGT GSPSARRRSG SAASSAGTNA TATGSTPARP C
|
| |