Gene Hhal_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0211 
Symbol 
ID4710987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp243786 
End bp244814 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID639854670 
Productdihydroorotate oxidase 
Protein accessionYP_001001807 
Protein GI121997020 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.203581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGCCC TGATCCGCCA ACTGCTGTTC CGCCTCGAGC CCGAACAAGC CCATCGCGTG 
AGTATGCAGC TCGCCCGCTT GGGTCTGCGT ATCGCCGCCG TCCCCGGGGT GCGCAGCCTG
CCGGCCGTGC CGCGGCGGGT GATGGGTATC GATTTCCCCA ATCCGGTGGG CCTGGCCGCC
GGCTTTGATA AGGATGGCGA GTACATGGAC GTACTCGAGC AGCTCGGCTT TGGCTTCCTG
GAGTTGGGCA CGGTAACGCC CCGCGCGCAA CCGGGTAATC CGCAGCCGCG GGTCTTCCGC
ATCCCCGAGC ACGAGGCCCT GATCAACCGC ATGGGTTTCA ACAACCAGGG GGCCGAGCCG
CTGGTCCGCC GGCTGGAGGT CTCGCGCCAC CGCGGTGTGG TGGGTATCAA CATCGGCAAG
AATCGGGATA CACCCCCCGA GCGGGCCGTC GAAGACTACG CCCAGGCGCT GGGGATGGTT
TACGGGGTGG CCGACTATGT GGCGGTCAAC CTCAGCTCGC CGAACACCCC GGGGCTGCGC
GACCTGCAGC ACGAGGGCGC GCTGCGCAAC CTGATCGACC GCCTGCAGAC CGAGCGCAAG
CGGTTGGCCG AGCTGCACGA CAAACGGGTG CCGCTGGTGG TCAAGATCGC CCCGGACTGG
GAGGCCGGGG AGCTGGACGC CACCCTGGAT ATCCTGCTCG AACGCCGGGT GGACGGCATC
GTCGCCACCA ACACCACCCT CGGGCGCACC GGGGTGGAGC AGACCCCCCA GGCCCGCGAG
AGTGGGGGGC TCAGCGGTGC GCCGTTGCGG GAGCAGGCCG AGTGGGTCCT GGAGCAGGTG
GCGGCCCGCC GTGATCGGCG GACGGCCCTG ATCGCTGCCG GGGGGATCAT GAGCGGTGAG
GACGTGACCC GGCGCCTCGA TCTCGGTGCG GATCTGGTCC AGCTCTATAC CGGCATGATC
TACCGCGGTC CCGGCCTGGT CCAGGAGGCC GTGCGAGCCG CCGCCCGCCA CGCCGGGCAG
CCCGCCTAG
 
Protein sequence
MYALIRQLLF RLEPEQAHRV SMQLARLGLR IAAVPGVRSL PAVPRRVMGI DFPNPVGLAA 
GFDKDGEYMD VLEQLGFGFL ELGTVTPRAQ PGNPQPRVFR IPEHEALINR MGFNNQGAEP
LVRRLEVSRH RGVVGINIGK NRDTPPERAV EDYAQALGMV YGVADYVAVN LSSPNTPGLR
DLQHEGALRN LIDRLQTERK RLAELHDKRV PLVVKIAPDW EAGELDATLD ILLERRVDGI
VATNTTLGRT GVEQTPQARE SGGLSGAPLR EQAEWVLEQV AARRDRRTAL IAAGGIMSGE
DVTRRLDLGA DLVQLYTGMI YRGPGLVQEA VRAAARHAGQ PA