Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DehaBAV1_1010 |
Symbol | pyrC |
ID | 5131133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides sp. BAV1 |
Kingdom | Bacteria |
Replicon accession | NC_009455 |
Strand | + |
Start bp | 998586 |
End bp | 999872 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640529935 |
Product | dihydroorotase |
Protein accession | YP_001214468 |
Protein GI | 147669650 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC TGATTAAAAA CGGACGCATC ATAGACCCCG CCAGCGGCAC GGATAACGTG GCTGATTTGC TGATTGAGAA TGGACTGGTT ACCGGAATAA ACAAGGATAT TTCTCGTGAA AAAACCGAAA AAGTAATAGA CGCCACCGGC AAAGTGGTTT GTCCCGGCTT TATAGACCTG CACGTACACC TGCGTGAACC GGGGTTTGAG GCCAAAGAGA CTATTGAAAG CGGGTGCAAA GCAGCCGCAG CCGGCGGTTT TACCTCCATC TGCCCCATGC CCAACACCAA TCCCGCAGCA GACTGTTTGC CGGTCATAGA TTTTATTAAA AACACTGCCA CAAAGGTCTC ACTCATACGG GTACTGCCCA TTGCCGCCAT CACCAAAGGA CGCAAAGGAC AGGAACTTTC CCCCATGGGA GAACTGGCCG AAGCGGGAGT GGTGGGATTC TCGGATGATG GTGACTATGT ATCCAGCAGT TCCCTGCTCT TAAACGCCCT GCTTTACAGC CGAACCTTTG ACCTGCCCAT TATGGAACAC TGTGAAGATG CCTGTCTGGC AGAAGGCGGA CTGATGAACG AAGGGCTTCT GGCCTGCCGT CTGGGCTTAA AGGGTATAAC CAACGCCACT GAAGAAATTG CCGTAAACAG GGATATTGCC CTTGCCAAAG AAAGCGGCGG GCGGCTGCAC CTTTGCCATA TCAGCACCGC CGGTTCGGTG GAACTGGTAC GCCAAGCCAA AGCCGCAGGC ATACGGGTAA GTGCGGAGGT TACTCCCCAC CACCTGACCC TGACCGAAGC CGAAGTAAAC GGCTACAACA CCAGTGCCAA GGTCAACCCG CCGCTTCGCA CCCAAACGGA TATTGAAGCA CTTATAGCTG GGCTTAAGGA CGGTACTATA GATGCCATAG CCACAGACCA TGCCCCACAC ACCAGAAATG ACAAACTGTG TGAGTTTGGT CTAGCCGCCA ACGGCATTTC AGGGCTGGAA ACCGCACTTG CCAGCCCGAT GGGTCTGGTG CACTCAGGCA AGCTTAGTTT AAGCCTGCTT ATAGAAAAGC TGACCCTTGG ACCTCAGCAG GTACTGGGTG AAAAATACCA AAATATCGGC AACCTGAAAG CGGGCTCATG TGCAGACGTG GTTATATTTG ACCCTGATGA AGAATGGACG ATAGATACCG CCAAATTCTT CTCAAAGGGT AAAAATACAC CCCTTGAGGG ACGCAAGCTT AAAGGCAGGG TAAAGACCAC CATTGCCTGC GGCCAGATAG TTTACCAACA AGCATAG
|
Protein sequence | MKILIKNGRI IDPASGTDNV ADLLIENGLV TGINKDISRE KTEKVIDATG KVVCPGFIDL HVHLREPGFE AKETIESGCK AAAAGGFTSI CPMPNTNPAA DCLPVIDFIK NTATKVSLIR VLPIAAITKG RKGQELSPMG ELAEAGVVGF SDDGDYVSSS SLLLNALLYS RTFDLPIMEH CEDACLAEGG LMNEGLLACR LGLKGITNAT EEIAVNRDIA LAKESGGRLH LCHISTAGSV ELVRQAKAAG IRVSAEVTPH HLTLTEAEVN GYNTSAKVNP PLRTQTDIEA LIAGLKDGTI DAIATDHAPH TRNDKLCEFG LAANGISGLE TALASPMGLV HSGKLSLSLL IEKLTLGPQQ VLGEKYQNIG NLKAGSCADV VIFDPDEEWT IDTAKFFSKG KNTPLEGRKL KGRVKTTIAC GQIVYQQA
|
| |