Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6162 |
Symbol | |
ID | 5674483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7495980 |
End bp | 7497284 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245014 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001510412 |
Protein GI | 158317904 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.189514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCTGG TCGGAGGCGC GCCGCCGGAA CATCCCGAAC CAGCCCGGCC GAAGCCAGCC TCGCGGGTCG CGATGCTGTC GATGCACACG TCGCCCCTGG AGCAGCCCGG CACCGGCGAC GCGGGCGGGA TGAACGTCTA CGTCATCGAG CTCGCCCGGC AGCTGGCTGC GCTCGGCACC GAGGTGGAGG TCTTCACCCG GGCCGTGAGC AGCCGGCTCC CACCCGCCCT GGAGATCGCC CCCGGGGTCA TCGTCCGGCA TGTGCCCGCC GGCCCGTTCG AGGACATCGG CCGCGAGGAG CTGCCCGCCT GGCTGTGCGC CTTCACGGCG GACGTCCTGC GCACCGAGGC CGGCCACGCA GCGGGCTGGT TCGACGTCGT CCACTCGCAC TACTGGCTGT CCGGGCAGGT CGGGCTGTCG GCCGCCCGCC GGTGGGGCGT GCCTCTGGTG CACACGGCCC ACACCCTGGC GCGGGTCAAG AACGCCTCTC TCGCCGACGG CGACCGCCCC GAGCCGGAAC CGCGCGTCCA GGGCGAGCAG GAGATCATCA AGGCGGCGAC GCGGCTCATC GCGTCGACCG ACACCGAGCG CCGCCACCTC ACCCAGCTCT ACGGCGCGGC GCCGGGCAAG GTGGACGTGG TCGCCCCCGG CGTCGACCTT GACGTCTTCC GCCCCGGCGA CCCCCGGGCC GCCCGCAAGC GGGTCGGCCT CGACCCCGAC ACCCAGCTCC TCCTCTTCGT CGGACGCATC CAGCCGCTCA AGGCACCGGA CGTCCTGCTC GCCGCCGCGG CCGAACTCAT CCACCGCGAC CCCGACCGCC GCGGACAGCT GGCGGTCGTC GTCGTCGGCG GCCCGAGCGG CTCCGGGCTC GAACGCCCGG ACTCTCTGGT CAAGCTCGCC GCCGAACTCG GCATCACCGA CATCGTCCGT TTCCAGCCGC CGGTACCCCA GGAGCAGCTC GCCCACTGGT ACCGCGCGGC GACAGCGGTC GTCGTCCCCA GCCACAGCGA GAGCTTCGGC CTGGTCGCGG TCGAGGCCCA GGCCTGCGGC ACCCCCGTGG TGGCCGCCTC CGTCGGCGGC CTGCGCACCG CCGTCGCCCA CGGGACGTCC GGAGTCCTCG TCCACGGCTG GGAGCCCGCC GACTACGCCG ACGCCCTGGA ACGCATCCTC ACCGAGGAAC GCTGGCGCCG GCACCTGTCG ACAGGCGCCC GCCTGCGCGC CGCGAGCTTC GGCTGGACAG CGACCGCGAA AGGCGTCCTC GCGAGCTACC AGGCGGCGAT CTCACCAGCG GCCGTCGCCG TCTGA
|
Protein sequence | MHLVGGAPPE HPEPARPKPA SRVAMLSMHT SPLEQPGTGD AGGMNVYVIE LARQLAALGT EVEVFTRAVS SRLPPALEIA PGVIVRHVPA GPFEDIGREE LPAWLCAFTA DVLRTEAGHA AGWFDVVHSH YWLSGQVGLS AARRWGVPLV HTAHTLARVK NASLADGDRP EPEPRVQGEQ EIIKAATRLI ASTDTERRHL TQLYGAAPGK VDVVAPGVDL DVFRPGDPRA ARKRVGLDPD TQLLLFVGRI QPLKAPDVLL AAAAELIHRD PDRRGQLAVV VVGGPSGSGL ERPDSLVKLA AELGITDIVR FQPPVPQEQL AHWYRAATAV VVPSHSESFG LVAVEAQACG TPVVAASVGG LRTAVAHGTS GVLVHGWEPA DYADALERIL TEERWRRHLS TGARLRAASF GWTATAKGVL ASYQAAISPA AVAV
|
| |