Gene Htur_5166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5166 
Symbol 
ID8745714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp58822 
End bp59892 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID646515523 
Productdihydroorotate dehydrogenase 
Protein accessionYP_003406470 
Protein GI284176193 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.313486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTGT ACTCGCGGGT TCGCCCCCTC GCGTTCAAGC TGCCGGCCGA GACGGCCCAC 
GACCTCGGCA AGCGAACGCT CCGGGCGGCC CAGTCGACGT GGCCGACGCG GCGAGCCCTC
GCCGCGGCCT ATCGGTATGA CCATCCCGCG CTCGAGGTCG ACCTGTTCGA CTCGACGTTT
CCGAACCCGG TGGGGATCGC GGCCGGCTTC GACAAGAACG CCGAGGTGAC CCACGCCCTC
GAGGCGCTCG GCTTCGGGTT CGTCGAAATC GGCACCGTCA CGCCCTATCC GCAAGAAGGC
AACGACCGTC CCCGGCTATT CCGGCTGCGG GAGGACGAGG GGATGATCAA TCGGATGGGC
TTCAACGGGC AGGGAATGGA GACCGTCAAG GAACGACTCG AGGAAGACGG CACGCCGGGA
TTCCCGCTTG GCGTCAACAT CGGGAAGATG AACTCCTCGA CCGAACGGGA GGCGATCGAG
GACTACCGAC GGGTCTTCGA TCGGGTCTCG CCGTTCGCCG ACTACGTCGT CGTCAACGTC
TCCTGTCCGA ACACGCCCGA CGAGTTCAAC GAGGCCTCGC CCGAGCATCT GCGGGCGATC
TTCGAAACCC TCGAGGCCGA GAACGACGGG AACGTGCCGA TGCTGGTGAA GATCGGTCCC
GACGAGCCCG AGGACGCGAT TTTGGATCTC GTCGATATCG TTCAGGAGTT CGGTCTGGAC
GGGATCGTCG CGACGAACAC CTCGACGGCT CGCGAGGGGC TCGAGTCGCC CGCCCGTGAG
GAGTGGGGCG GACTCAGCGG CGCCCCCATC GAAGACAGAT CCACCGACGT GATCCGAACG
ATCGCCGGGC ACACGGACGG CGAACTCCCG ATCGTCGGCG TCGGCGGCGT CGATTCGGCC
GCGAGCGCCT ACGAGAAGAT TCGCGCGGGC GCGTCGCTCG TGCAACTCTA TACGGGGTTC
GTCTACCGGG GGCCGTCGAC GGCCGGGCGG ATCAACCAGG GACTGGTCGA CCTGCTCGAG
CGCGACGGAT TCTCGTCGGT CGAGGACGCG GTCGGCGCCG ATCTCGAGTA G
 
Protein sequence
MTLYSRVRPL AFKLPAETAH DLGKRTLRAA QSTWPTRRAL AAAYRYDHPA LEVDLFDSTF 
PNPVGIAAGF DKNAEVTHAL EALGFGFVEI GTVTPYPQEG NDRPRLFRLR EDEGMINRMG
FNGQGMETVK ERLEEDGTPG FPLGVNIGKM NSSTEREAIE DYRRVFDRVS PFADYVVVNV
SCPNTPDEFN EASPEHLRAI FETLEAENDG NVPMLVKIGP DEPEDAILDL VDIVQEFGLD
GIVATNTSTA REGLESPARE EWGGLSGAPI EDRSTDVIRT IAGHTDGELP IVGVGGVDSA
ASAYEKIRAG ASLVQLYTGF VYRGPSTAGR INQGLVDLLE RDGFSSVEDA VGADLE