Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1755 |
Symbol | pyrD |
ID | 2687066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 1918043 |
End bp | 1918960 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637126435 |
Product | dihydroorotate dehydrogenase 1B |
Protein accession | NP_952805 |
Protein GI | 39996854 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0665746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAAC CTGACCTTTC CGTTGAGATT GCTGGCATCA CGCTGAGAAA CCCGGTCATG ACCGCCTCGG GCACCTTCGG CTACGGCGAG GAGTTCTCGG AGTACGTGAA CCTGGAGGCG ATCGGCGCCA TCATCACCAA AGGGCTGTCC TTGAAGCCCA AGGCCGGCAA TCCGACCCCG CGTATCGTCG AAACCACCGG TGGAATGCTG AACGCCATCG GTCTGCAAAA TGTGGGCATT GATGCCTTTG TCGAGAAAAA GGTACCGTTT CTCCGCACGG TTGCCACGCC GGTGATCGTT AATTTTTTCG GAAACACCCT CGAAGAATAT GCCGAACTCG CCGAGCGCCT GGACCTGATA CCCGAAGTGG CTGCCGTGGA GATCAATATT TCCTGCCCGA ACGTCAAACA TGGCGGGATC GTTTTCGGTA CAGACCCCAA GGCCGCCTAT TCGGTGGTAA AGGCAGTGCG GGAGGCCACC ATCAAGCCGG TGATCGTCAA GCTGTCGCCC AATGTGACCG ATATTGTGGA GATGGCCTGG GCCTGTGCCG ATGCGGAAGC CGACGCCCTT TCCCTCATTA ATACCCTCAC CGGCATGGCC ATTGATCTGG ACAAGCGCCG TCCAATCCTG GCCAACGTGA CCGGCGGCCT CTCCGGACCG GCGGTGAAGC CGATCGCGCT ACGCATGGTC TGGCAGGTTG CCAGGGCGGT GAAGATTCCG GTTATCGGCA TAGGGGGCAT CATGACCGGC ATCGATGCTC TGGAGTTCAT GCTTGCCGGC GCAACGGCCG TGCAGGTTGG CACCGCCAAC TTCCTGGATC CGGGCGCCGC GGGCCGGATT GCTGCGGAAA TGGAGCGATA TCTGGCGGAT AACGGTATCG CCGATGTTAA GGAGATGATC GGCGCCCTGG AGGTCTAG
|
Protein sequence | MQKPDLSVEI AGITLRNPVM TASGTFGYGE EFSEYVNLEA IGAIITKGLS LKPKAGNPTP RIVETTGGML NAIGLQNVGI DAFVEKKVPF LRTVATPVIV NFFGNTLEEY AELAERLDLI PEVAAVEINI SCPNVKHGGI VFGTDPKAAY SVVKAVREAT IKPVIVKLSP NVTDIVEMAW ACADAEADAL SLINTLTGMA IDLDKRRPIL ANVTGGLSGP AVKPIALRMV WQVARAVKIP VIGIGGIMTG IDALEFMLAG ATAVQVGTAN FLDPGAAGRI AAEMERYLAD NGIADVKEMI GALEV
|
| |