Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1550 |
Symbol | |
ID | 4075848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1656807 |
End bp | 1657865 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006863 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_613545 |
Protein GI | 99081391 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.11793 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.477736 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGA GCGAGAAACT TGCCCTCGGG CTGATGCATC GCCTTGATCC CGAAACCGCG CATGGTCTGT CGATCAAGGC GCTCAGAGCG GGGCTGACGC CACGCCCTGG TCCGGTGACA TCACCGCGCC TGCGCACGGA TGTGGCGGGT CTTTCGCTGC CGAACCCGGT GGGGCTTGCA GCCGGGTTTG ACAAGAACGC CGAAGCGCTT GCTCCGCTTT CAGAGGCTGG CTTTGGATTT ATCGAAGTGG GGGCCGCCAC GCCGCGCCCG CAACCGGGCA ACCCCAAGCC GCGCCTTTTT CGTCTGAGCG AAGATCGCGC CGCCATCAAC CGCTTTGGCT TTAACAATGA GGGCATGGAC ACGATCGGGA AGCGCCTCGC GCAGCGTCCA AAGTCGGGCG TGATCGGCCT CAACCTCGGG GCCAACAAGG ACAGCGAGGA CCGCGCGCAG GATTTCGCCC GCGTGCTCAG CCATTGCGGC GCGCATCTGG ATTTTGCCAC CGTGAACGTG TCGTCGCCCA ACACAGAGAA ACTGCGCGAT CTGCAGGGCA AGGATGCTCT TGCTTCACTG CTGGCAGGGG TCATTGACGC CCGAGAGGCC CTGCAGCGCC CCATCCCGGT CTTTCTCAAG ATTGCGCCGG ATCTCGACAT ATCCGGGCTT GATGACATTG CCGAGGTCGC GCGTGACAGC GGCATTGATG CGGTGATCGC CACCAACACG ACGCTTTCGC GCGACGGCCT GAAAAGCACG CACCGGGACG AGATGGGCGG CCTCTCGGGC GCGCCCCTAT TTGAGCGCTC GACACGGGTG CTGGCGCAGC TTTCGCAACG TCTGGATGGG GCGGTGCCGA TCATCGGCGT CGGGGGCATC TCCACGGCTG AAGGCGCCTA TGCCAAGATC CGCGCTGGAG CCTCGGCGGT GCAGCTTTAT ACGGCGCTGG TCTACGGCGG GCTGTCGCTG GCCTCTGAGG TCGCTTCGGG TCTTGATGCA TTGCTCGCGC GAGACGGGTT TTCAAATGTC GCGGAAGCGG TTGGCACAGG GCGTGCGGAC TGGCTCTGA
|
Protein sequence | MKLSEKLALG LMHRLDPETA HGLSIKALRA GLTPRPGPVT SPRLRTDVAG LSLPNPVGLA AGFDKNAEAL APLSEAGFGF IEVGAATPRP QPGNPKPRLF RLSEDRAAIN RFGFNNEGMD TIGKRLAQRP KSGVIGLNLG ANKDSEDRAQ DFARVLSHCG AHLDFATVNV SSPNTEKLRD LQGKDALASL LAGVIDAREA LQRPIPVFLK IAPDLDISGL DDIAEVARDS GIDAVIATNT TLSRDGLKST HRDEMGGLSG APLFERSTRV LAQLSQRLDG AVPIIGVGGI STAEGAYAKI RAGASAVQLY TALVYGGLSL ASEVASGLDA LLARDGFSNV AEAVGTGRAD WL
|
| |