Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3201 |
Symbol | pyrC |
ID | 3906167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3794449 |
End bp | 3795825 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880525 |
Product | dihydroorotase |
Protein accession | YP_482287 |
Protein GI | 86741887 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0334723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC CCGCGGCGGA AACGTCGGAG GAAACGCCGG TGGGGAGGTC GGCGGGGACC TCCTGGGTCC TGCGGCGGGT CCGCCCGCTC GGCGGGGACC CCGTCGACGT CGTCCTCGCG GACGGGGTGG TGGCCGCCTG GCGACCGGCC GGCTCCACGC ACGCCGGGGC CGGGGGACTG CCCGCCGGCA CCACCGTGCT GGACACGGAC GGGCTGATCC TGCTCCCCGG CCTGGTCGAC CTGCACACCC ACCTGCGGGA ACCCGGCCGG GAGGACGCCG AGACGGTCGC CTCCGGCACC CGCGCCGCGG CCCTCGGTGG CTACACCACC GTGTTCGCGA TGGCGAACAC CAATCCGGTC GCCGACACCG CGGGGGTCGT CGAGCAGGTG TGGCGGCTCG GCCTGGACGC GGGTCACTGC GATGTCCGGC CGGTCGGCGC GGTCACCGTC GGACTTGCCG GCGAGCGGCT CGCCGAACTC GGCGCCATGG CCTCCTCCGC GGCGGGCGTG CGGGTCTTCT CCGACGACGG GCACTGCGTG TCGGACGCGC TGCTCATGCG CCGGGCGCTG GAGTACGTCA AGGCGTTCGA TGGGGTGATC GCCCAGCATG CGCAGGAACC GCGGCTGACC GAGAACGCCC AGATGAACGA GGGCACGGTG GCTGCCAGGC TGGGGCTGCC GGGGTGGCCC GCGGTCGCCG AGGAGGCGAT CATCGCCCGG GACGCGCTGC TGGCCGGGCA CGTCGGCTCC CGACTGCACG TCTGTCACGT CTCCACCGCC GGATCGGTAG AGATCATCCG GTGGGCCAAG GCGAAGGGCT GGAACGTCAC CGCCGAGGTG ACCCCGCACC ACCTGCTGCT CACCGACGAC CTGGTCTGCT CGTTCGACCC GGTCTACAAG GTCAACCCGC CGCTGCGCAC CGCCGAGGAC GTCGCCGCGC TGCGCGCCGG GCTCGCCGAC GGCACGATCG ACTGCGTCGC CACCGACCAC GCTCCGCACG CGCTGGAGGA CAAGGAGACG GAGTGGGCCG CCGCGCGTCC CGGCATGCTC GGTCTCGAGA CGGCGCTGTC GGTGGTCATC GAGACGATGG TCATCCCGGG CCGGCTGGAC TGGGCCGGGG TCGCCGAGCG GATGGCACTG GCCCCGGCAA GGATCGGTGG CCTGCCGCGA ACCGCGGCCG AGGTGTGGAG TTCCATAGCA GTGGGCGCGC CCGCGACCGT CACCCTGCTT GACCCGGCGC CGTGGCGGAT GGTCGAGCCG GACGCGCTCG CCAGCCGCAG CCGCAACACG CCCTATGCGG GCCGGTCGCT GCCGGGAACG ATCCGCGCCA CGTTCCTGCG GGGACGGCCC ACCGTGCTCG ACGGGAAGAT CGTATGA
|
Protein sequence | MTTPAAETSE ETPVGRSAGT SWVLRRVRPL GGDPVDVVLA DGVVAAWRPA GSTHAGAGGL PAGTTVLDTD GLILLPGLVD LHTHLREPGR EDAETVASGT RAAALGGYTT VFAMANTNPV ADTAGVVEQV WRLGLDAGHC DVRPVGAVTV GLAGERLAEL GAMASSAAGV RVFSDDGHCV SDALLMRRAL EYVKAFDGVI AQHAQEPRLT ENAQMNEGTV AARLGLPGWP AVAEEAIIAR DALLAGHVGS RLHVCHVSTA GSVEIIRWAK AKGWNVTAEV TPHHLLLTDD LVCSFDPVYK VNPPLRTAED VAALRAGLAD GTIDCVATDH APHALEDKET EWAAARPGML GLETALSVVI ETMVIPGRLD WAGVAERMAL APARIGGLPR TAAEVWSSIA VGAPATVTLL DPAPWRMVEP DALASRSRNT PYAGRSLPGT IRATFLRGRP TVLDGKIV
|
| |