Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1272 |
Symbol | pyrC |
ID | 2686576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1386623 |
End bp | 1387900 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637125946 |
Product | dihydroorotase, multifunctional complex type |
Protein accession | NP_952325 |
Protein GI | 39996374 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0255922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGC TGATAAAAGG TGGGAGGGTG ATTGACCCGT CCCAGGGAAT TGACGAAGTT CTGGATATCC TCGTGGAGAA TGGCGCAATC AAGGAACTCG GCAAGGGACT CGCGGCTCCG GCCGGGGCCG GGGTCGTGGA CGCCGCCGGC CTGATCGTCA CGCCGGGCCT CATTGATATG CATGTGCACC TGCGGGACCC GGGGCTCGAG TACAAGGAAG ATATCGTAAC AGGCACCAGG GCGGCTGCGG CCGGCGGCTT CACGTCGGTG GCCTGCATGC CCAACACCAA GCCGGTGAAC GACAACAAGG CCGTGACCAG CTACATCGTC GCCAAGGCCA AGGCCGAGGG GCTCGTCAAC GTCTTCCCCG TGGGGTCCAT TACTCAGGGG AGCAAGGGGG ATGCCCTGGC CGAGATGGGG GACCTGAAGG AAGCAGGCTG CGTGGCGGTT TCCGACGACG GCCGGCCCGT GACCAGTTCC GAGCTCATGC GCCGGGCCCT GGAGTACGCC AAGGGAATGG GAATCATGGT CATCTCCCAT GCCGAGGATC TCTCCCTGGT GGGCGAGGGG GTCATGAACG AGGGCTTCGT CTCCACGGAG CTGGGGCTCA AGGGAATACC CTGGGCCGCC GAGGACGCTG CCACCGCCCG TGACGTGTAC CTGGCCGAGT TCACCAACTC GCCGCTCCAC ATCGCCCACG TCTCCACAAT GGGGTCATTG CGGATCATCC GTAACGCCAA GGCCCGCGGC GTGAAGGTTA CCTGCGAGAC GGCGCCCCAC TACTTCAGCC TCACCGACGA TGCAGTGCGC GGCTACAACA CCAATGCCAA GATGAATCCG CCGCTCCGTA CGGCCGATGA TCTGGCCGCG GTCAAAGAGG CCCTGAAGGA CGGCACCATC GACGCCATCG CCACCGACCA CGCCCCCCAC CATCTGGATG AGAAGGACGT GGAGTTCAAC GTGGCTTTGA ACGGCATCAT CGGCCTGGAA ACCTCCCTGC CGCTGTCGCT GAAGCTGGTG GAGGAGGGAG TGTTGACCCT GCCGGCACTG GTTGAGAAGA TGGCGTGCAA CCCGGCCGCG ATTCTCGGCA TTGACCGGGG CACGCTCCGG CAAGGCGCGG TTGCCGACAT CACGGTTATT GATCCGGCGG CCGTCTGGAC GGTGGAGGCC GGTGCGCTCG CCAGCAAGTC CAAGAACTCA CCCTTCCTCG GCTGGGAGAT GAAAGGTGCC GCGGCATACA CCATCGTCGG CGGCACGGTG GTCCACAGCA GAGGATGA
|
Protein sequence | MNLLIKGGRV IDPSQGIDEV LDILVENGAI KELGKGLAAP AGAGVVDAAG LIVTPGLIDM HVHLRDPGLE YKEDIVTGTR AAAAGGFTSV ACMPNTKPVN DNKAVTSYIV AKAKAEGLVN VFPVGSITQG SKGDALAEMG DLKEAGCVAV SDDGRPVTSS ELMRRALEYA KGMGIMVISH AEDLSLVGEG VMNEGFVSTE LGLKGIPWAA EDAATARDVY LAEFTNSPLH IAHVSTMGSL RIIRNAKARG VKVTCETAPH YFSLTDDAVR GYNTNAKMNP PLRTADDLAA VKEALKDGTI DAIATDHAPH HLDEKDVEFN VALNGIIGLE TSLPLSLKLV EEGVLTLPAL VEKMACNPAA ILGIDRGTLR QGAVADITVI DPAAVWTVEA GALASKSKNS PFLGWEMKGA AAYTIVGGTV VHSRG
|
| |