Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4359 |
Symbol | |
ID | 5736219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5567403 |
End bp | 5568473 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281520 |
Product | dihydroorotate dehydrogenase |
Protein accession | YP_001547119 |
Protein GI | 159900872 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.42236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGCT ATCAGCTTGC AAAAGCACTT TTGTTTCGTT TACCACCAGA AAAAGCTCAT CGGTTAACCA CTTTAGGCCT AGATTTGGCC ACTCGTTTAC CATTTACCTC AGCTTTATTC CGTTCCTTCC ATCACAATGA CCCATTGCTT AAAACCAATT TATGTGGGTT AACGTTTAAT AATCCGGTTG GTTTAGCAGC AGGTTTCGAT AAAGATGGCA CGCATATCCG TGGAATGAGT CAATTGGGTT TTGGTTTTTT GGAATTGGGC ACAGTTACGC CCAAGCCTCA AGCTGGCAAT GAACAGCCGC GTTTATTTCG TTTAATCGAG GATCATGCAT TAATCAATCG AATGGGATTT AATAATGCAG GAATTGCAGC GCTTGCTCAA CGTTTAGCCA AACAGCCACG CATCATTCCA CTTGGGATTA ATTTGGGCAA GAATAAAATT ACGCCAAACG AACAAGCTGC TGATGATTAT CGCCAAGGTA TTAATTTGCT TGGTGAATAT GCCGATTACA TTGTGATCAA TATTTCTTCG CCGAATACGC CTGGTTTGCG CGAACTCAGC CGCCGCGAGC CATTGACTGA ACTATTGCAG GTTGTCAAAA CTGCCCGCCA ACAATTACGC CATCAAGCCC CGTTGTTCGT TAAACTCTCG CCCGATGAAG ATCGCGAAGG CTTGGAGGCA GCGCTTGGCG CAGCCCTCGA CGCTGGAGTT GATGGGATTA TCGCCACCAA TACAACCGTC AGTCGCGAAA ATTTACGTTC TGCTCAGCAA ACCGAAACTG GCGGCTTAAG TGGCGCTCCG CTCAAAACCA AGGCCTTGGC AACCCTCAAA TATATCTATC AAACAACCAA CGGCAAATTG CCCTTGATTG GCGTTGGCGG AATTGCCAAC GGCCAAGATG CTTACGAACG GATTTTGGCT GGCGCGAGTG CCGTGCAACT CTATACCAGC CTGATCTATG CCGGGCCACA ATTGGTTGGC ACAATCAACC GCGAGCTAGC AGCATTACTA CGGCGCGATG GCTTTGATTC AATTCAAACA GCCGTTGGGT CAGCAGTTTA G
|
Protein sequence | MRSYQLAKAL LFRLPPEKAH RLTTLGLDLA TRLPFTSALF RSFHHNDPLL KTNLCGLTFN NPVGLAAGFD KDGTHIRGMS QLGFGFLELG TVTPKPQAGN EQPRLFRLIE DHALINRMGF NNAGIAALAQ RLAKQPRIIP LGINLGKNKI TPNEQAADDY RQGINLLGEY ADYIVINISS PNTPGLRELS RREPLTELLQ VVKTARQQLR HQAPLFVKLS PDEDREGLEA ALGAALDAGV DGIIATNTTV SRENLRSAQQ TETGGLSGAP LKTKALATLK YIYQTTNGKL PLIGVGGIAN GQDAYERILA GASAVQLYTS LIYAGPQLVG TINRELAALL RRDGFDSIQT AVGSAV
|
| |