Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5814 |
Symbol | |
ID | 5674137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7056507 |
End bp | 7057730 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244664 |
Product | glycosyl transferase family protein |
Protein accession | YP_001510066 |
Protein GI | 158317558 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.72272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTGT TGTTGTCGAC GTATGGGTCA CGCGGGGACG TGGAGCCGCT GGTGGGACTC GCGGTACGGC TGCGGGCGCT CGGCGCGGAG GTACGAATGT GCGCACCGCC CGACAAGGAG TTCGCGCAGC GGCTGGCCGA CGTCGGCGTG CCGCTGGTGC CGGTCGGCCC GCCGATGGGC TCGATGGTGC GGCCGTCATC GGCGGGAGCG GCGTCCCGGC GGGTGTCTGA GCTGGCCGGG CTGTTCGACT CGGTCGCCGC GGCCGCCGGG GGATGTGACG CGCTTCTGGC GACCGGGTTG GCGCACTTCG CGTCGCGGTC GGTGGCCGAG AAGCTGGGCA TCCCTTACGT GTACGCGACC TTCTGCCCGT TCCTGCTGCC GTCGCCGCAT CACGCGCCGC CGCTGGTGCT GCCGGGTGAG TCGTTCCCAC CGGGGGTGAC TGACAACCGG GTGCTGTGGG AGCTGAACGC GCAGAGCTTC AACGCGCTGC ACGGCGAGGC GCTCAACGCC CACCGGGCCT CGGTCGGTCT GCCGCCGGTG GACGACGTCC GCACTTTCAT CTTCACCGAC CATCCGTGGC TGGCCACGGA TCCCGCCCTC GGCCCGTGGC AGGAGACGGC GGACCTCGAC GTCGTACGCA CCGGGGCGTG GATCCTGCCG GACGAGCGCC CGCTCCCGGC CGAGCTGGTG GCGTTCCTGG ACGCCGGCCC ACCACCGGTG TACGCCGGCT TCGGCAGCAT GCGCACCGTC TCGGCGGACA TCGCCCGGGT GGCCATCGAA GCGATCCGCG CGCAGGGCCG CCGGGCAGTC GTCGGGCGCG GCTGGGCGGG CCTGGCCCTG ATCGACGATG GGGACGACTG CCTCGTCGTC GGCGAGGTCA ACCAGCAGGC GCTGTTCAGC CGGGTGGACG CTGTCGTGCA CCACGGCGGC GCGGGTACGA CGACGACGGC CGCTCGGGCA GGCGTTCCTC AGGTGGTGGT ACCCCAGGCG GGTGACCAGC TGTACTGGGC TGGCCGGGTA GCGGCCCTGG GTATCGGCGC GGCACACGAC GGTCCGACTC CGACCACCGA GTCCCTGTCC GCCGCGCTCA GCACCGCTCT GGCCCCCGAG ACCCGCGCAC ACGCGACGAC CCTGGGCGGC ACGGTTCGGA GCGACGGGGC GACCGTGGCC GCAGCGCTGC TACTCGAAGC GGCAAGCCGA GAAAGGCCTT CCATGCCCGG GTGA
|
Protein sequence | MRVLLSTYGS RGDVEPLVGL AVRLRALGAE VRMCAPPDKE FAQRLADVGV PLVPVGPPMG SMVRPSSAGA ASRRVSELAG LFDSVAAAAG GCDALLATGL AHFASRSVAE KLGIPYVYAT FCPFLLPSPH HAPPLVLPGE SFPPGVTDNR VLWELNAQSF NALHGEALNA HRASVGLPPV DDVRTFIFTD HPWLATDPAL GPWQETADLD VVRTGAWILP DERPLPAELV AFLDAGPPPV YAGFGSMRTV SADIARVAIE AIRAQGRRAV VGRGWAGLAL IDDGDDCLVV GEVNQQALFS RVDAVVHHGG AGTTTTAARA GVPQVVVPQA GDQLYWAGRV AALGIGAAHD GPTPTTESLS AALSTALAPE TRAHATTLGG TVRSDGATVA AALLLEAASR ERPSMPG
|
| |