Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0786 |
Symbol | |
ID | 5669202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 912113 |
End bp | 913150 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239714 |
Product | glycosyl transferase family protein |
Protein accession | YP_001505150 |
Protein GI | 158312642 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0460061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.237559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGACA TCGCACCCGA CTCGGTTGCC GACCAGGCGA CACCTGCCGG CTCCACCCTG GAGAGACCAG CTCCGGACAG CCCAGCTCCG GACGGCCCCG CCGCCGCGCC CGCCAGGGTG CCGTCCGTGC CCGCCGTGCG GCCGGCCGCG GACGGCTGGT CGTTCGAGCT CCAGCTCGTG AGCTTCCACA GCCGTGACCA GCTCGAGCAG ATGTTCGCGA CGCTGCCGAT CGACATGCCC GTCGTCGTCG TCGACAACGC CAGTGGCGTC GACCGGGTGG ACGAGCTCCT CGCCGACCGG CCGAACGGCC GCTACATCGA TTCCGGTGGC GGCAAGGGCT TCGCGAAGGC GTCCAACATG GGCATCCGCT CGTCCGCGTA CGACTACGTG GTGCTGGGCA ACCCGGACAG CCGGCCGACC GTCGAGGTCA TCAGGACGCT CGTCGCCGAC CTTGAGGGCG ACCCCGGCCT GGTCGTGAGC GCGGCGACCA TGAAGGGGCA GGACGACAAG CCCGAGCTCG GCAACGGCGG CTGGGAGCCG ACCCCCCGTC GGGTGCTCAT GCACGTCCTG GGAGCCCACA AGATCGCCCC GTCGTCGGCG CTGTTCGCCC GTCCGACGCC GAACCGGCCG ATGAGCCCCG AGTGGCTGAC CGGGGCCTGC ATGGCCGTCC GCCGGCAGTC GTTCCTCGAG CTCGGCGGCT TCGACGAGAC GTTCTACGTC TACAACGAGG ACATGGCGCT GGGCCGGGCG ATCCGCGAGG CGGGGATGCG CCAGCAGCTG CGCACCGACC TGCTCGTCCC GCACGGCGCC GGCGGCTCCG GGGCCGGCAA GACGTGGATG CTGCAGATGC GCGGCGCCTC GATGGTCCGC TACCTGCGCA AGCACAATGC CCCGGCGCGG GTGAACGTGA TGCGCTCGAT GCTCGTCGCC GGTTACGCGG GCCGCACGGT GCTCTCCCGG GTGCGGGGCC GCAGGGCCAC GGCCGACGAG CACGCCGCCT ACATCAAGGG CCTGCTGGTC GGTCCCCCGC CCCGCTGA
|
Protein sequence | MPDIAPDSVA DQATPAGSTL ERPAPDSPAP DGPAAAPARV PSVPAVRPAA DGWSFELQLV SFHSRDQLEQ MFATLPIDMP VVVVDNASGV DRVDELLADR PNGRYIDSGG GKGFAKASNM GIRSSAYDYV VLGNPDSRPT VEVIRTLVAD LEGDPGLVVS AATMKGQDDK PELGNGGWEP TPRRVLMHVL GAHKIAPSSA LFARPTPNRP MSPEWLTGAC MAVRRQSFLE LGGFDETFYV YNEDMALGRA IREAGMRQQL RTDLLVPHGA GGSGAGKTWM LQMRGASMVR YLRKHNAPAR VNVMRSMLVA GYAGRTVLSR VRGRRATADE HAAYIKGLLV GPPPR
|
| |