Gene Haur_4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4359 
Symbol 
ID5736219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5567403 
End bp5568473 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content47% 
IMG OID641281520 
Productdihydroorotate dehydrogenase 
Protein accessionYP_001547119 
Protein GI159900872 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.42236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGCT ATCAGCTTGC AAAAGCACTT TTGTTTCGTT TACCACCAGA AAAAGCTCAT 
CGGTTAACCA CTTTAGGCCT AGATTTGGCC ACTCGTTTAC CATTTACCTC AGCTTTATTC
CGTTCCTTCC ATCACAATGA CCCATTGCTT AAAACCAATT TATGTGGGTT AACGTTTAAT
AATCCGGTTG GTTTAGCAGC AGGTTTCGAT AAAGATGGCA CGCATATCCG TGGAATGAGT
CAATTGGGTT TTGGTTTTTT GGAATTGGGC ACAGTTACGC CCAAGCCTCA AGCTGGCAAT
GAACAGCCGC GTTTATTTCG TTTAATCGAG GATCATGCAT TAATCAATCG AATGGGATTT
AATAATGCAG GAATTGCAGC GCTTGCTCAA CGTTTAGCCA AACAGCCACG CATCATTCCA
CTTGGGATTA ATTTGGGCAA GAATAAAATT ACGCCAAACG AACAAGCTGC TGATGATTAT
CGCCAAGGTA TTAATTTGCT TGGTGAATAT GCCGATTACA TTGTGATCAA TATTTCTTCG
CCGAATACGC CTGGTTTGCG CGAACTCAGC CGCCGCGAGC CATTGACTGA ACTATTGCAG
GTTGTCAAAA CTGCCCGCCA ACAATTACGC CATCAAGCCC CGTTGTTCGT TAAACTCTCG
CCCGATGAAG ATCGCGAAGG CTTGGAGGCA GCGCTTGGCG CAGCCCTCGA CGCTGGAGTT
GATGGGATTA TCGCCACCAA TACAACCGTC AGTCGCGAAA ATTTACGTTC TGCTCAGCAA
ACCGAAACTG GCGGCTTAAG TGGCGCTCCG CTCAAAACCA AGGCCTTGGC AACCCTCAAA
TATATCTATC AAACAACCAA CGGCAAATTG CCCTTGATTG GCGTTGGCGG AATTGCCAAC
GGCCAAGATG CTTACGAACG GATTTTGGCT GGCGCGAGTG CCGTGCAACT CTATACCAGC
CTGATCTATG CCGGGCCACA ATTGGTTGGC ACAATCAACC GCGAGCTAGC AGCATTACTA
CGGCGCGATG GCTTTGATTC AATTCAAACA GCCGTTGGGT CAGCAGTTTA G
 
Protein sequence
MRSYQLAKAL LFRLPPEKAH RLTTLGLDLA TRLPFTSALF RSFHHNDPLL KTNLCGLTFN 
NPVGLAAGFD KDGTHIRGMS QLGFGFLELG TVTPKPQAGN EQPRLFRLIE DHALINRMGF
NNAGIAALAQ RLAKQPRIIP LGINLGKNKI TPNEQAADDY RQGINLLGEY ADYIVINISS
PNTPGLRELS RREPLTELLQ VVKTARQQLR HQAPLFVKLS PDEDREGLEA ALGAALDAGV
DGIIATNTTV SRENLRSAQQ TETGGLSGAP LKTKALATLK YIYQTTNGKL PLIGVGGIAN
GQDAYERILA GASAVQLYTS LIYAGPQLVG TINRELAALL RRDGFDSIQT AVGSAV