Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1045 |
Symbol | pyrC |
ID | 7979186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1095178 |
End bp | 1096461 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644797998 |
Product | dihydroorotase |
Protein accession | YP_002949171 |
Protein GI | 239826547 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTA TTTTGAAAAA TGGCAAGTCG TTCAATAAAG ATGGTGTGAT CGAACGGACG GAACTAAAAA TCGAAAATGG ATTTATTACC GCCATCGGCT CCAAGCTTCA CAGTGAAGAA GCAGACGAAG TTATCGATGT ACAAGGGAAG TTGATATCAG CCGGATTTAT CGATTTGCAT GTTCACCTGC GCGAACCGGG CGGCGAAGCG AAAGAAACGA TTGCCACCGG AACGCTGGCA GCAGCAAAAG GTGGTTTTAC CACAGTGGCG GCAATGCCGA ATACGCGACC AGTGCCGGAT ACGAAAGAAC AAATGGAATG GCTTTGCAAG CGGATCCGCG AAACGGCTTA TGTCCATGTG CTTCCATATG CGGCCATTAC GGTCGGCCAG CAAGGAACAG AGCTGACCGA CTTCGCCGCA TTAAAAGAAG CGGGTGCGTT CGCGTTTACC GATGACGGGG TAGGCGTGCA GTCTGCCGGC ATGATGTATG AAGCGATGAA GCGGGCTGCT GCACTAGATA TGGCGATTGT CGCCCATTGC GAAGATAACA CTCTGGCGAA TCGCGGTGTG GTGCATGATG GCGAATTTGC GCACCGCTAC GGGCTATATG GAATTCCATC CGTATGCGAA TCGGTACATA TCGCGCGTGA TGTGCTATTA GCGGAAGCAA CGGGATGTCA CTACCATGTG TGCCATATTA GCACGAAAGA ATCGGTCCGC GTTGTCCGCG ATGCAAAACG GGCAGGAATT CGCGTCACCG CGGAAGTGAC GCCGCATCAT CTTCTTTTAT GCGATGAAGA TATTCCAGGC CCTGACGCGA ATTATAAGAT GAATCCGCCG CTTCGCAGCA AAGAAGACCG CGAGGCGTTA ATCGAGGGGC TGCTTGATGG CACGATCGAC TTTATCGCAA CCGACCATGC CCCGCATACG GAAGCGGAAA AACAAAAAGG AATCAATGCC GCCCCGTTTG GCATTGTCGG TTTGGAAACG GCGTTTCCGC TCCTTTATAC CCACTTGGTC GAAACAAACA TATTGACACT GAAGCAGCTG ATTGATTTGC TGACGGTGAA GCCGGCTGAA TGCTTCGGCT TGCCGCTTGG AAAGCTTGCT GTCGGCGAGC GGGCGGATAT TACGATTATA GATTTAGAGA CCGAAGAAGC AATTGATCCA CAGACGTTTG TATCCAGAGG GAAAAATACT CCATTTGCCG GTTGGAAATG TAAGGGTTGG CCGGTGATGA CGTTTGTCGG CGGAAAACTA GTTTGGCAGA AAGGAAGAGA ATAA
|
Protein sequence | MAIILKNGKS FNKDGVIERT ELKIENGFIT AIGSKLHSEE ADEVIDVQGK LISAGFIDLH VHLREPGGEA KETIATGTLA AAKGGFTTVA AMPNTRPVPD TKEQMEWLCK RIRETAYVHV LPYAAITVGQ QGTELTDFAA LKEAGAFAFT DDGVGVQSAG MMYEAMKRAA ALDMAIVAHC EDNTLANRGV VHDGEFAHRY GLYGIPSVCE SVHIARDVLL AEATGCHYHV CHISTKESVR VVRDAKRAGI RVTAEVTPHH LLLCDEDIPG PDANYKMNPP LRSKEDREAL IEGLLDGTID FIATDHAPHT EAEKQKGINA APFGIVGLET AFPLLYTHLV ETNILTLKQL IDLLTVKPAE CFGLPLGKLA VGERADITII DLETEEAIDP QTFVSRGKNT PFAGWKCKGW PVMTFVGGKL VWQKGRE
|
| |