Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1336 |
Symbol | pyrC |
ID | 4569707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1531191 |
End bp | 1532531 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765925 |
Product | dihydroorotase |
Protein accession | YP_911791 |
Protein GI | 119357147 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA TTTTTCAAAA CGCCCATATC ATCAACCCAC AAAGCAACCT TGATTATACC GGATCAATAA GGGTGTCCGT TGATGGTTTC ATTGAAGAGA TTATTCAAGG CGAATGCGAT AAAAGCCCCG ACGACAGAAT AATCGATCTT CAGGGAAAAC TGCTGGTGCC GGGTCTTTTC GACATGCACT GCCATTTCCG TGAACCGGGG CAGGAGTATA AGGAAACCCT TGAAAGCGGC TCGAAAGCTG CCGTAGCCGG CGGATTTACC GGGGTAGCCC TCATGCCTAA CACAAAACCG GTGATCGACA GCCCGCTCGG AGTAGCATAC ATACGTCACA ACGCACAGCA GTTGCCGGTT GATCTTGAGG TTATCGGCGC AATGAGCGAA GGAAGCAAGG GAGAACAGCT TGCACCATAC GGAAAATTCC GTTCGTATGG GGTAAAAGCT GTTTCCGATG ACGGAACGGC GATTCAGAAC AGCCAGAATA TGCGACTGGT GTTCCAGTAC GCATCAAATT TCGATCTTCT TGTCATTCAG CATTGCGAAG ACAAATCCAT GACCGCCGAA GCCGTCATGA ACGAAGGGGT ATTTTCAACA AAACTTGGCC TTAAAGGAAT ACCTGATGTA TCCGAAGCGG TCATGCTGTG CCGTGATCTT CACCTGATCC GCTATATCGT GGAGCACGAA TTGCACGATC CGGCCAACAA ACCGAGATAC CATGTGGCAC ACATCAGCAC CAAAGCCTCC CTCGACCTTG TCCGGCAGGC TAAAGCCGAA GGCTTGCAGG TAACGTGCGA GGTTACGCCT CACCATTTCA CCCTTACCGA CGAAGATCTT TTCAATGCGC CCTCAAAAGG CAATTTCATC ATGAAGCCCC CGCTCCCCTC AAAGAAGAAC AGGGCAGCTA TCCTTGAAGC AATTGCAGAC GGAACGATTG ATGCCATTGC TACCGATCAC GCCCCTCATG CTCCACATGA AAAAGAGTGC CCTCCCGATC AGGCGTCATT CGGCATTATC GGTCTGGAAA CCGCTGTAGG ACTCACAATA ACCGAACTGG TCGAACCGGG AATCATAACG CTTTCAAGAG CGATTGAGCT GATGTCAGTC AATCCCCGTA AAATTCTTCA GCTTGATCCC CTGCTGTTCG CGCAGGGAGA AAGGGCTAAT TTTACCATTA TTGATCCAGA GGAGGAGTGG GCGCTTACCG CGAATGCCGT TAAATCAAAG TCAACGAATA CGCCCTTTCT TGGCCGTAAA CTCAAGGGCA GGGCGATTGC TGTTTACCAC AAAGGGATGT TTCATGAAAG CGTAACTTCA CAAGAGCATT TTAGCGTGTA A
|
Protein sequence | MSTIFQNAHI INPQSNLDYT GSIRVSVDGF IEEIIQGECD KSPDDRIIDL QGKLLVPGLF DMHCHFREPG QEYKETLESG SKAAVAGGFT GVALMPNTKP VIDSPLGVAY IRHNAQQLPV DLEVIGAMSE GSKGEQLAPY GKFRSYGVKA VSDDGTAIQN SQNMRLVFQY ASNFDLLVIQ HCEDKSMTAE AVMNEGVFST KLGLKGIPDV SEAVMLCRDL HLIRYIVEHE LHDPANKPRY HVAHISTKAS LDLVRQAKAE GLQVTCEVTP HHFTLTDEDL FNAPSKGNFI MKPPLPSKKN RAAILEAIAD GTIDAIATDH APHAPHEKEC PPDQASFGII GLETAVGLTI TELVEPGIIT LSRAIELMSV NPRKILQLDP LLFAQGERAN FTIIDPEEEW ALTANAVKSK STNTPFLGRK LKGRAIAVYH KGMFHESVTS QEHFSV
|
| |