Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5611 |
Symbol | |
ID | 5673938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6808950 |
End bp | 6810140 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244464 |
Product | glycosyl transferase family protein |
Protein accession | YP_001509868 |
Protein GI | 158317360 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.126524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000566285 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCCGTG TTCTCTTCGC CGTGCCGCCG CTGACCGGGC ACGTGAACCC GGCGGTGGGC ATCGCCGGCG AGCTGGCGGC CCGCGGGCAG GAGGTGGCCC TGGTCGGCCA CGCGAGCGTC GTCGGGCCGC TCGTCCCGCC GTCCGTCCCG CTCATCGCGC TGCCAGGGGA GATATCGGCC GACCAGCGGG CCGAGCTGGA GGCACGGTCC CGGCCGCTGC GCGGGCCCGC GTCACTGAAG TTCCTGTGGG ACGAGTTCCT GCTGCCGCTG GGCGCCTCGA TGGCGCGGGA CGTCGGTGCC GTCGTCGAAC GGTGGCGCCC GGATGTGATC GTCGCCGACC AGCAGGCGGT CGGGGTCGCC ATGGTCGCCC GTCGGCGCGG CATCCGGTGG GCCACGCTCG CCACCACGTC GGCGGAGCTC GACGACCCCT ACGCCGTGCT CGCCGGGGTC GGGAACTGGG TGTCGGAGCG GCTGCGGGAC TTCCAGGTCG CGAACGGCGT CCCGGCGGAG GAGGCGGCGC GCGGTGACCT GCGCTTCTCT GAGGACCTCA CTGTGGTCTG CTCGGTGCCC TCGTTGCTGC GTACTGCCAG TCATCCGTCC CATCACGTGT TCGTCGGCTG CGCCGCCGGA CTGCGCCGGT CGGCCCCGGA GTTCCCCTGG GAGTGGCTCG ACCGGGACCG CCGCACCGTG CTCGTCTCGC TCGGCACGGT GACCCGGGAG GCCGGCGGGC GTTTCCTGCG CGCGGCCGCG GAGGCGCTGG TGGGGATGTC CGACCGGGTG CAGGCCGTGA TCGTCGCGCC TCCCGGCCCG CTGGACGACC TCGCCGGCCA GGTTCCCGAC GACCTGCTGG TCCGTCCGTT CGTGCCGCAG GTGGACCTGA TGGCCGGACT GGACGCGATA GTGTGCCACG CGGGCAACAA CACGGTGTGT GAGGCTTTGT CGCGGGGAGT GCCGCTGGTG GTCGCGCCGG TTCGTGACGA CCAGCCGATC ATCGGCGAGC AGGTGGTGCG GGCCGGTGCC GGTGTGCGGG TGCGCTTCGG GCGCTCGACC CCGGTGACGC TGGCCACCGC GATCGGCACC GTGCTCGACG AGCCGTCCCA CCGGGTCGCG GCGCGGCGGC TGCAGGGCGA GTTCAGCGCG GCGGGCGGTG TCGTGGCCGC CGCCGACCAC ATTGAGAAGC TGCTGCCGTA G
|
Protein sequence | MGRVLFAVPP LTGHVNPAVG IAGELAARGQ EVALVGHASV VGPLVPPSVP LIALPGEISA DQRAELEARS RPLRGPASLK FLWDEFLLPL GASMARDVGA VVERWRPDVI VADQQAVGVA MVARRRGIRW ATLATTSAEL DDPYAVLAGV GNWVSERLRD FQVANGVPAE EAARGDLRFS EDLTVVCSVP SLLRTASHPS HHVFVGCAAG LRRSAPEFPW EWLDRDRRTV LVSLGTVTRE AGGRFLRAAA EALVGMSDRV QAVIVAPPGP LDDLAGQVPD DLLVRPFVPQ VDLMAGLDAI VCHAGNNTVC EALSRGVPLV VAPVRDDQPI IGEQVVRAGA GVRVRFGRST PVTLATAIGT VLDEPSHRVA ARRLQGEFSA AGGVVAAADH IEKLLP
|
| |