Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5223 |
Symbol | |
ID | 5673557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6271683 |
End bp | 6272939 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641244077 |
Product | glycosyl transferase family protein |
Protein accession | YP_001509487 |
Protein GI | 158316979 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0184372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCT ACCTCGTCTA CTGCACGACC GCTCCCGGTC ACCTGTTTCC GCTGGTCCCA GGCCTGCGCG CCCTGCGCGA CCGGGGGCAC GAGATCCACC TGCGCATCGG CGCGCGGCTG GTGGACGGCC CGTGGGCCGA CGGCCTGAAC GTCGCGCCGA CCGACCCGCG CATCGACGAG ATCGAGGTGA AGGCGGACGC GAGCGGCACC GGCCCGGCCC AGCTCCGACG GGCGCTGGCC GGGCTGATGT CGCGCGGCCC CCTGGAGCGG GCCGACCTCG AACGGGCCGT CGCCCAGGTG GACCCGGACG TGATCCTCGT CGACACCAAC GCCTACGGCG CCGCGATCGC CGCGCGGGCG TCCGGGCGGC CGTGGGGGAT CGTGCTGCCG TCCCTTGTCG CCCTGCCGGG CAAGGGAATC CCCCCGTACG GCCTGGGCCT CGCGCCGGGA CGGGGGCCGC TCGGCCGGCT GCGCGACCGG GTGGTGTGGC CGCTGGTCAT CCGCCAGTTC GGCGCGGCGA TGCTGCCCCC GCTCAACGCC ATGCGGGCGG ACGCCGCCCT CCCGCCGCTG ACCAGCCCGA TCGACTACTT CCTGGAGGCC GACCGGCTTC TGGTGCTGAC CGGCGACCCG CTGGAGTACC CGCGCTCGGA CGCGCCGGAG CATGTCCGCT TCGTCGGCGC CCAGATCTAC GACCCGCCCG GCGAGGCACC GGCCTGGCTC GCCGAGGACG GCGACCCCTG GGTGCTCGTC ACCACCTCGA CCGACTACCA GGGCGACGAA CGCCTCGCCG TGGCCGCCGT CGAGGCCCTG CGTGACGAGC CGGTCCGCGT GGTGATCACC CTGGCGGACG CCTACGGGCA CGTCGAGCTG CCGGCCGCGG CGAACGTCCG GGTCGAGCGG TTCGTGCCGC ACGGCCCGGT GCTCGAGCGC GCCGCCGCCG TCATCTGCCA CAGCGGGATG GGCATCGTGC ACAAGGCCAT CGCCGCCGGT GTGCCCGTCG TCGCCGTGCC GTTCGGGCGG GATCAGCCCG AGGTCGCGCG CCGGGTCGTG GAGGCCGGCG CAGGCGTCAT GCTGCCGGCC AGGCGGCTGA CCGCGGAGCG GCTGCGGGAC GCGCTGCGCA CCGCGATGAG CAGGCGCGCC GGGGCGCTGA ACGCCGCACG CCGGATCAGG GCGAGCGGCG GCGGCGAGAG CTTCGCCACC GAGGCAGAGG GCCTCGCGGC CGCGCCGCGG CCCGCCGCCG CACGGCCCGC CGCCTGA
|
Protein sequence | MARYLVYCTT APGHLFPLVP GLRALRDRGH EIHLRIGARL VDGPWADGLN VAPTDPRIDE IEVKADASGT GPAQLRRALA GLMSRGPLER ADLERAVAQV DPDVILVDTN AYGAAIAARA SGRPWGIVLP SLVALPGKGI PPYGLGLAPG RGPLGRLRDR VVWPLVIRQF GAAMLPPLNA MRADAALPPL TSPIDYFLEA DRLLVLTGDP LEYPRSDAPE HVRFVGAQIY DPPGEAPAWL AEDGDPWVLV TTSTDYQGDE RLAVAAVEAL RDEPVRVVIT LADAYGHVEL PAAANVRVER FVPHGPVLER AAAVICHSGM GIVHKAIAAG VPVVAVPFGR DQPEVARRVV EAGAGVMLPA RRLTAERLRD ALRTAMSRRA GALNAARRIR ASGGGESFAT EAEGLAAAPR PAAARPAA
|
| |