Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3518 |
Symbol | |
ID | 5671888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4179077 |
End bp | 4180324 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242405 |
Product | protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_001507825 |
Protein GI | 158315317 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2518] Protein-L-isoaspartate carboxylmethyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.831229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG GCACCACGAG CAGCGAGAGC GAGAGTCTCC GGCACGCGAT GGTCGACCGG CTGGCCGCCG ATCATGCGGC GAAAGGCCTT GCGCTGCGGC CCGAGGTGGA GGCGGCGATG CGGACCGTCC CCCGCGAGCT GTACACGCCG GGCCTTCCAC CGGAGCAGGC GTACGAGAAC GAAGCGGTTC TCAAGAAGCG GCGCGGTACG GAAGTGATCA GTTCGGTGTC GGCGCCGTTC CTGATCGCGG AGATGCTCGG TCAGGCCGCG GACGCGCTCG GTGGGCTGGC GGGCTGTCAC GCGCTGGAGA TCGGTAGCGG TGGCTACAAC GCGTCGCTGT TGCGGGAGCT CGTCGGCCCG TCAGGGTCGG TCACAACCGT CGATATCGAC CCCGAGGTGA CCGGCCGGGC GGTCGCCTGC CTGGCGGCGG CGGGCTACAC CGACCTCACC GTGGTGTGCG CGGACGCCGA GCATCCGATC ACCTCGGGCC ACCGCTACGA CCTGATCATT GTCACCGTCG GGGCGTGGGA CATCCCGCCG GCGTGGTGTG AGCAGCTCCG TGACGGCGGT GTGCTGGTGG TGCCGCTGCG CACGTTCGGG ATCACCAAGT CGTGGGCACT GCGCCGCCGC GGGGACCGCC TGGTCAGTGA GAGCGACCGC CAGTGCGGGT TCGTCTTCAT GCAGGGCGAC GGGGCCCACA AGGTGCGGTA CGTCGACATC GCTGATGGTG TCCGCCTGCG GATGGACGAG GGCCAGCAGG TCGATCCCGC CGTTCTCGAG GGACTTCTGG CGCAGCCTCG GGAGGAGGCC TGGGCCGGGG TGAGCCTGCC GCCCCGTACC GACCTGATCG ACCTGAACCT GTGGCTGGCG ACCCGCCTCA CCCACGAGGC GGGCCAGTTC GTGGTGCTGC TGGCGGAGGA GACGGCGATC GAGGCCGGCA CGGTCGCGCC CTCCTGGCAG CACGGCACCC CGGCGACCCT GCGCGACGGC ACGCTCGCCT ACCGGTCGGC TCTGCGCTGG ACCGGCCGGC GGTTCGATCT CGGTGCCTAC GCCCACGGGT CCAAGGCCGC CGCGGCGGCC GGGCGGATGG TCGAACACAT GCGCGCGTGG GTGGACGCGG GCAGCCCGGC TCCGGTGCTG CACGTCCTTC CCGCGCACAC CCCCGACGGC GACCTGCCGG CCGGAGCGGT CCTGAACAAG CGGCACAACC GCCTCGTCCT GACCTTCACC CCCACCGCGT CCGTATAG
|
Protein sequence | MSVGTTSSES ESLRHAMVDR LAADHAAKGL ALRPEVEAAM RTVPRELYTP GLPPEQAYEN EAVLKKRRGT EVISSVSAPF LIAEMLGQAA DALGGLAGCH ALEIGSGGYN ASLLRELVGP SGSVTTVDID PEVTGRAVAC LAAAGYTDLT VVCADAEHPI TSGHRYDLII VTVGAWDIPP AWCEQLRDGG VLVVPLRTFG ITKSWALRRR GDRLVSESDR QCGFVFMQGD GAHKVRYVDI ADGVRLRMDE GQQVDPAVLE GLLAQPREEA WAGVSLPPRT DLIDLNLWLA TRLTHEAGQF VVLLAEETAI EAGTVAPSWQ HGTPATLRDG TLAYRSALRW TGRRFDLGAY AHGSKAAAAA GRMVEHMRAW VDAGSPAPVL HVLPAHTPDG DLPAGAVLNK RHNRLVLTFT PTASV
|
| |