Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2149 |
Symbol | |
ID | 5670549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2578176 |
End bp | 2579357 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641241070 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001506491 |
Protein GI | 158313983 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCACT TGGGAAAGCT CTCCGCCGGT ATATCTGATC TCCGAGTCGG AGTGGTTGCC GAGTGCTTTC CCGTCTATCG CGCCGCAGTC CTCGCCGAGC TCCTGCGAAT TCCAGACATC AAGTATTATT TCCTGGGTGG CACCGAACCA ATCCTGCCTG GGTACCGGAC CCACATCCCG GGCCGTCCGG AGGACTTTAT TCGGCTCCGT ACGAGGAGGA TCGGCCCCAT CCGATGGCAG CAGGGGTTGC TCCGAGAGAT CGTGGCGAGA CGGTTCGACG TACTCATCAT TACTGGCGAC TGGGCGTTCA TCTCGACCTG GCTCGGTGCG ATAGTTGCTC GTCTGCTGGG ACTGCCGGTG CTGTTCTGGA CCCATGGTTG GGCTCGTCCT GAGCGAGGGC TCCGGCTTCT CGTCCGGCGC TGCTTCTATC GGTGCGCAAC CGGCCTTCTG CTGTACAGCG AGTACGGCCG TCGCTTGGCG GAGTCCTACG GGCTTCCCGC CAATCGGCTA TTCGTCGTGC ATAACAGCCT GGACCTACCA GCCCAAGATG CTGCCGCACA GTCTATCGAG CCCTCGTCGG TCAAAGCAGT GCTCGAAAGG TTTCCGGACC CAAGTCTTCC ACTTGTTGTT TCGAGCTTTC GGCTTGTCTC TGATCGAGCA GTGGACGAAT GTATCTCTGC TGTTGCGTGG CTGGGCCGCA CCGGGTTCCC GGTCAACTAC CTTATCGTCG GAGATGGTCC AGATCTTCCC CGGCTGCAGG CTGTGGCGGT CGAGTCAGGG GCTGCAGTTT CCTTCTTCGG ACCCTGCTAT GACGAAGCTA CGCTGGCCAA GGTTTATGCC GCTGCGGACG TGTCGGTAGC GCCCCGAATG GTTGGGCTGT CCGCTCTCCA GAGTCTGGCG TATGGGACCA TGATGGTGAC CTGTGATGAC ATCACCCTAC AGACTCCTGA ATGGGAGGTC CTGGAGGACG GTGTCACTGC GGTGCTCTAT ACCGCCGGCG ATGTATCGGC ACTGGCGAGG GCGATGCGAA AAGTGATCGC TCTGTCGCGG TCCGGCGAGA TTGACGAGAA TCGGCTACGG AAACGCCTGG CTGAGTCATA CAACCCGGCC GAACATGCAC GTCGAATAAA CGCGGCTGTA CTGGCTGCGG CCCAGGGTCG GGCTGAGGCC GGTGGTAGCT GA
|
Protein sequence | MPHLGKLSAG ISDLRVGVVA ECFPVYRAAV LAELLRIPDI KYYFLGGTEP ILPGYRTHIP GRPEDFIRLR TRRIGPIRWQ QGLLREIVAR RFDVLIITGD WAFISTWLGA IVARLLGLPV LFWTHGWARP ERGLRLLVRR CFYRCATGLL LYSEYGRRLA ESYGLPANRL FVVHNSLDLP AQDAAAQSIE PSSVKAVLER FPDPSLPLVV SSFRLVSDRA VDECISAVAW LGRTGFPVNY LIVGDGPDLP RLQAVAVESG AAVSFFGPCY DEATLAKVYA AADVSVAPRM VGLSALQSLA YGTMMVTCDD ITLQTPEWEV LEDGVTAVLY TAGDVSALAR AMRKVIALSR SGEIDENRLR KRLAESYNPA EHARRINAAV LAAAQGRAEA GGS
|
| |