Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0373 |
Symbol | pyrC |
ID | 3927284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 364710 |
End bp | 366053 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901497 |
Product | dihydroorotase, multifunctional complex type |
Protein accession | YP_507193 |
Protein GI | 88658539 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0824446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAC AATCTTGGGA ATTGTTAGGC AATGGGCAAC ATGCAGATTT AACCGTTGCA TATATTAATG CAAGAATCAT TGACCCAGAA TCTAAGTTAG ATATCAGAGG TTCGCTACTT ACCAAAGGAG ATAAGATTAT TGATTTTGGA CCTGATCTAT TTGCTAATGG TATACCAAGT ACTATAGATG AGGTCATAGA TTGCAATAAT AACATACTGT TACCAGGTTT AATTGATATT CATGTGCATT TCCGAGAACC AGGCCAAGAG CATAAAGAAA CCATCAATAC AGGAAGCAAA TCTGCTGCAG CTGGAGGTAT AACAACTGTA GTTTGCCAAC CCAATACTAT CCCTACTATC AGTAGTGTAA TCACTGCAAA ATACATTAAA ATGAGAGCTC TAGAAAGTGC TTATGTCAAC ATAGAGTTTT ATGCTTCTAT AACAAAATCA GATAATTCAT TATCCGATAT GGCACTGTTA AAAGAAGTAG GAGCAGTTGG TTTTACTGAT GATGGCATGC CAGTAATGAA CGCTTTAACA ATGAGACAAG CACTAAGTTA CTCAAGTATG TTGGATACCG TTATAGCACA ACACGCAGAA GACCTAAACA TATCTAATAA CGGATGTATC AATGAAGGAA TAATATCATA TGAATTAGGA TTAAAAGGTA TACCTGATAT TTCAGAATCT ATAATAGTAA ATCGTGATAT TGCTTTAATG AAAAATATCA AAAATGTACA TTATCACATT TTACATGTCT CATCCCAAGA GTCTTTACAC ATAATAAAAC AAGCAAAAAG TCAAGGACTA AAAGTTACTT GTGAAGTAAC TCCTCATCAT TTCACTTTAA CTGAAAGAGA CATAATGACA CATGGTAGTC TTGCAAAAAT GAATCCTCCT TTGCGAACTG AAAATGATCG CCTTAGTATG ATAGAAGGAT TAAAAAGTGG CATAATAGAT TGTATAGCAA CAGATCATGC TCCTCATGAT ATCAATGCTA AGGAATTACC ATTAGATACA GCTGCTTTTG GAATAGTTGG ATTAGAAACA ATGCTTCCAA TTTCTTTAGA ATTATATCAT AATGGTACCA TGCCTTTAAT AGATCTACTA GCAACATTAA CATACAAGCC AGCTGATATT ATAAAGGTAC CAAGAGGGCG TATAAAAAAA GACTATGTTG CAGATCTCAT CATATTGGAT TTAGATCATG AATGGGTAGT TGACATATCA AAATTTGCAA GTAAATCAAA AAATTCCCCC TTTCACAACC GTAAAGTAAA AGGTAAAGTT TTAAGGACTA TAGTATCTGG CAAAACTACT TACAAAGCAG AAATCATAAT CTAA
|
Protein sequence | MYKQSWELLG NGQHADLTVA YINARIIDPE SKLDIRGSLL TKGDKIIDFG PDLFANGIPS TIDEVIDCNN NILLPGLIDI HVHFREPGQE HKETINTGSK SAAAGGITTV VCQPNTIPTI SSVITAKYIK MRALESAYVN IEFYASITKS DNSLSDMALL KEVGAVGFTD DGMPVMNALT MRQALSYSSM LDTVIAQHAE DLNISNNGCI NEGIISYELG LKGIPDISES IIVNRDIALM KNIKNVHYHI LHVSSQESLH IIKQAKSQGL KVTCEVTPHH FTLTERDIMT HGSLAKMNPP LRTENDRLSM IEGLKSGIID CIATDHAPHD INAKELPLDT AAFGIVGLET MLPISLELYH NGTMPLIDLL ATLTYKPADI IKVPRGRIKK DYVADLIILD LDHEWVVDIS KFASKSKNSP FHNRKVKGKV LRTIVSGKTT YKAEIII
|
| |