Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0940 |
Symbol | pyrD |
ID | 3927413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 960112 |
End bp | 961164 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902056 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_507729 |
Protein GI | 88657936 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.365408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCACTAC AACATATATT TACCAATCCG CTATTTTGTA TACCACCAGA AATTGCACAT AAACTAGTAA TTTTTGCACT AAAGAACAAT TTCATCCCCA CAAAAAAAAT TGAAATTCCA AGATCTCTTA ATATACAAGT CTTCAATAAG CTTCTCAGAA CCCCTATAGG GGTTGCTGCT GGTTTTGATA AAAATGCAGA AGTAATTCAA CCCTTATTGT CTATTGGGTT TGGATTTGTT GAAGTAGGAA CGGTAACAAA ACACCCTCAA AATGGTAATA AAAAACCACG TGTACATAGA TTAATTAGTA AAGAAGCAAT AATTAACAGT CTAGGCTTTA ATAATAAGGG GATAAATTTC TTAATAAAGA AAGTAAATAA CATTCAACTT AACCATTGTA TATTTGGAAT TAACATAGGA TTTAACAAAA AATCCAGTGA TCCTATCCAA GACTATTTCG ATTCAGTAAA GAAAGTATAT GGTCTAAGTA ATTATATTAC AATTAATATA TCATCTCCCA ATACCCCAGG ATTAGATGAG TTTCAAAAAA AAGATCTTCT TTCAGAATTA TTAACAGCTA TATCTCAAGT TCGAAAGCTA GCAGATTATG CAGAATCTGT TCCTATAATG CTCAAAATTT CACCAGACAT TAATGATAAC AAAAAACAAG ATATTGTAGA TTTAGCAATA AAATATAAAA TTAGCGGTTT AATAATCAGT AACACTTCAT CACAACATAA CAAACTATTA AATATGAATA CAAACATACA TGGAGGATTA AGCGGAAAAC CACTATTTGA TTTATCAACA CAAGTATTGT CTGAAATATA TCAAGCTTCC CAAGGAAAAC TTTTACTAAT AGGATGTGGT GGAGTAAGTA CTGGATATCA TGCATATGAA AAAATAAAAG CTGGTGCATC ATTAGTACAG TTATACACTG CTATAGTTTA CAATGGATTC AATATAGCTA ATAAGATAAG CTTAGAGTTA GCTGATCTTT TAGCCGCAGA TGGGTTTCCT ACAGTACGCC ATGCAATAGG CCATAATCAC TAG
|
Protein sequence | MSLQHIFTNP LFCIPPEIAH KLVIFALKNN FIPTKKIEIP RSLNIQVFNK LLRTPIGVAA GFDKNAEVIQ PLLSIGFGFV EVGTVTKHPQ NGNKKPRVHR LISKEAIINS LGFNNKGINF LIKKVNNIQL NHCIFGINIG FNKKSSDPIQ DYFDSVKKVY GLSNYITINI SSPNTPGLDE FQKKDLLSEL LTAISQVRKL ADYAESVPIM LKISPDINDN KKQDIVDLAI KYKISGLIIS NTSSQHNKLL NMNTNIHGGL SGKPLFDLST QVLSEIYQAS QGKLLLIGCG GVSTGYHAYE KIKAGASLVQ LYTAIVYNGF NIANKISLEL ADLLAADGFP TVRHAIGHNH
|
| |