Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2034 |
Symbol | |
ID | 3906751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2394076 |
End bp | 2395302 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637879371 |
Product | protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_481137 |
Protein GI | 86740737 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2518] Protein-L-isoaspartate carboxylmethyltransferase |
TIGRFAM ID | [TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.908018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.539174 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAG TCCAAGATTC GAGCGCCACC GACAGTGCGG CCACGTTGCG TGCCGCGATG GTGGACGAGC TTCGCACCAC GGGGGACGCG ATCAAGACTG GGCAGGTGGC TGCGGCTGTG GGCAGGGTGC CTCGGCACCT TTTCGCACCC GACGAACCGC TGGAGGCGGT CTACGCCGCA AACAAAGCAC TTGTGATTAA ACGCGATGGG AATGGTGCGG CGCTCAGCTC GCTGTCCGCC GCGCACATTC AGGCGGTCAT GTTGGAGCAG GCCGAACTTG AACCGGGAAT GCGTGTCCTG GAGGTTGGCT CGGGCGGCTA CAACGCCGCC CTCATCCAGG AGATGGTCGG CGACGGAGGC TCGGTGACGT CCGTCGACAT CGACCAGGAG ATCGTGAGCC GGGCCCGCGC CTGTCTGGAC GCGGCCGGCT ATCGAAATGT TGAAGTCGTG GCGGCCGACG CCGAGGCCGG CGTACCCGAA AAAGCACCCT ACGACCGGAT TATCGTGACC GCAGGGGCCT GGGACATCCC ACCAGCCTGG CAGGAGCAGC TCACGAACGG CGGCAGGCTC GTTGTGCCGC TCCGGCTACG AGGGCTGACC CGGTCGATCG CGTTCGACCG GGTCGACGAG GACGGTGACG TCGGTCTGGT CAGCCGCAGC TATCGCCTGT GCGGATTCGT CCCGATGCAG GGCATCGGGA CGTTCCGCGA AAGGCTCGTC CCTATTACCG ACGAGGTCGT GCTTCGGGTC GACGACCCGT CCCAGGAGTT CGACGTCGAG GGGCTGCGCG ACGCGCTGGC CACGCCGAGG CTGGAACGCT GGTCCGGAGC GGCCTTCGAT CTTCCTGACG AGTTGGAGTT CTTCCTCGTC ACGAATCTGG CCCAGGTCGC GCACCTGCAT GTGGACGAGA CGCAGGTCCA GAACGGCCGC TTCGCGCCCT CGGCCGCCAG AGGTGTGCCC ACGCTGGTCA GCGGTGGCAG TTTCGCCTAC CGCACCAAGC GCCCGAACGA GAAGACCGGC GGATTCGAGA GCGGCGTCGT CGCGCACGGT CCCGACGCCG ATCGCGTCGC CGAGCACTAC GTGGAACTGC TCCGCCGGTG GGCCAGCGAT CACCGCCGTT CCGGTGCCGC CCACATCCGA TACGTCCCGA AGGCCGCGGG GACGCCGGCG CCGTCCGTGG GGCTGGTGCC GAAACGGCAC GGCGCTGTCG CTGTCCGCTG GCCCTGA
|
Protein sequence | MTSVQDSSAT DSAATLRAAM VDELRTTGDA IKTGQVAAAV GRVPRHLFAP DEPLEAVYAA NKALVIKRDG NGAALSSLSA AHIQAVMLEQ AELEPGMRVL EVGSGGYNAA LIQEMVGDGG SVTSVDIDQE IVSRARACLD AAGYRNVEVV AADAEAGVPE KAPYDRIIVT AGAWDIPPAW QEQLTNGGRL VVPLRLRGLT RSIAFDRVDE DGDVGLVSRS YRLCGFVPMQ GIGTFRERLV PITDEVVLRV DDPSQEFDVE GLRDALATPR LERWSGAAFD LPDELEFFLV TNLAQVAHLH VDETQVQNGR FAPSAARGVP TLVSGGSFAY RTKRPNEKTG GFESGVVAHG PDADRVAEHY VELLRRWASD HRRSGAAHIR YVPKAAGTPA PSVGLVPKRH GAVAVRWP
|
| |