Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0032 |
Symbol | |
ID | 5668458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 40475 |
End bp | 41677 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641238961 |
Product | protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_001504406 |
Protein GI | 158311898 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2518] Protein-L-isoaspartate carboxylmethyltransferase |
TIGRFAM ID | [TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTCCT GCACGGCCGA CGTCTCCCTC GCGCCGCTGC GGAACGCGAT GGTCGAACGG ATCCTGGCCG CCAAGCCGGT GTCGGCCCCG GTCGAGGCTG CGATGCGCAC GGTTCCACGC GAGCTGTTCC TTCCAAACCT GCCACCCGAG GTGGCGTACC AGGACCGGGC CGTGGTCCTC AAACGCGACG TCTACGGGAA CCCTGTGGGC TCGGTGTCCC AGCCCTCGGT GATTGCGGCG ATGCTGGAAG CGCTGCGGGT CGAACCTGGC CAGCGGATTC TCGAGCTGGG TTCCGGGGGC TACGGGGCCG CGTTGTTGGC GAGGCTGGCC GGTCGGACAT GTTCGGTGGT GAGCATCGAC CTCGATGAGA CCGTGATCCA CCGCACCCAC GAATACCTGC GCGCGGCCGG CTACACCGGG ATCACGGCGC TGGTCGGCGA TGGGCGTTAC GGGTTCCGGC TCCGCGCCCC CTACGACCGG ATCATCGTCA CGTTCGACAC GCTCGACGTG CCACAGGACT GGTTCGACCA GCTTGTCGAG GGCGGCAGGG TCATCATCCC GCTACACCTG CGCGGTCTCA CCCGCACCAT CGCGTTCACC AAGACCGGCG GGATCCTGAC CTCCGACTCG ATCCGACCGT TCGCGTTCGG ACCGATGCAA GGATTCGGTG TCCCCCGCCG AACCACCGTG TCCCTCGCAG GCGGGGCACG GCTCGATGTC GGCCCCGACC AGGACGTCGA CGTCGACGCC CTGTGCGCCG CCCTCGCCGA GCCACGGCAC ACGGTGGGAA CCGGTGTCAT CCTCCCCCCC GGAAACACCC TGCTACCGAG CCTGGACCTG TGGCTGGCCA CAGCCCAACA GACCTACGGC CGGCTCCACA CCGACACCCG CACCGATCAG CAGGGCCCGG ACGCCTCGGC CATCCCGCCC GGTGTGCCAG CAGTCTGGTC CGACGACACG ATCGCCTTCC TCGCCCTCGC ACAGACCACC GGCACGGGAC TCGAACTCGG TGTGATCGCC TACGGGCCGC AGGCAGGCCG GCTCGGAGAC CAGCTCGCTC ACGACGTCCG TGCCTGGAAC ACAGAGCACT ACGGCAAGCT CCCCACGATC AAGGTCTACC CCCGCGGAAT CGTCGAGAAG TCCCCGCCAC CGGGCCGGCT TCTCGACCTG CCTTCCGCGC AGGTACTGAT CAGCTGGGGC TGA
|
Protein sequence | MTSCTADVSL APLRNAMVER ILAAKPVSAP VEAAMRTVPR ELFLPNLPPE VAYQDRAVVL KRDVYGNPVG SVSQPSVIAA MLEALRVEPG QRILELGSGG YGAALLARLA GRTCSVVSID LDETVIHRTH EYLRAAGYTG ITALVGDGRY GFRLRAPYDR IIVTFDTLDV PQDWFDQLVE GGRVIIPLHL RGLTRTIAFT KTGGILTSDS IRPFAFGPMQ GFGVPRRTTV SLAGGARLDV GPDQDVDVDA LCAALAEPRH TVGTGVILPP GNTLLPSLDL WLATAQQTYG RLHTDTRTDQ QGPDASAIPP GVPAVWSDDT IAFLALAQTT GTGLELGVIA YGPQAGRLGD QLAHDVRAWN TEHYGKLPTI KVYPRGIVEK SPPPGRLLDL PSAQVLISWG
|
| |