Gene Hhal_0179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0179 
Symbol 
ID4710855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp207844 
End bp209541 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content68% 
IMG OID639854637 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001001775 
Protein GI121996988 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGATA ACCGTCGCAG CCGAGTGATC ACCCAGGGCG TGGCCCGAAC CCCCAACCGC 
GCGATGCTGC GCGCCGTCGG TTTCCGGGAT GAGGATTTCG AGAAGCCGAT CATCGGGGTC
GGCAACGCGC ACAGTACGAT CACCCCCTGC AACGTCGGGA TCGGCGCCAT GGCGCGGCGC
GCCGAGGAGG CGCTGCGCGA GGTGGGGGCG ATGCCGATGA AGTTCGGCAC CATCACCGTC
TCCGACGGGA TCACCATGGG CACCGAGGGC ATGAAGTACT CCCTGGTCTC CCGCGAGGTG
ATCGCCGATT CGATCGAGAC GGTGGGCGGT GGCCAGCGGA TGGACGGCAT GCTCGCCACT
GGTGGCTGCG ACAAGAACAT GCCCGGGGCG ATGATGGCCC TGGCCCGTCT GGATATCCCG
GGTATCTTCG TCTATGGCGG CACCATCAAG CCGGGTCATT ACAAGGGCGA GGATCTCACC
GTGGTCAGCG CCTTCGAGGC GGTGGGGCAG TACAACGCCG GCAACCTCTC CGAGGCCGAT
CTCAAGGGGG TCGAGGAGAA CGCCTGCCCC GGGGCGGGCG CCTGTGGCGG CATGTTCACG
GCCAACACCA TGTCCAGCGC CTTCGAGGCC ATGGGCATGA GCCTGATGGG CTCGTCCACC
GTCTCTGCCG AGGACGACGA GGCCCGGGAT GTGGCCGCCG AGGCGTCCCG GGTCCTGATG
GACGCTGTCC ATCACGACCG GCGCCCGTCC TCGATCCTCA CCCGCGAGGC CTTCGAGAAC
GCCTTTGCGG TGGTGATGGC CCTGGGCGGC TCCACTAACG CCGTGCTCCA CCTGCTGGCC
ATCGCCAATA CCGCCGAGGT GCCGTTCGAC CTCGACGACG TCGAGCGCAT CCGCCGCAAG
GTGCCGGTAC TCTGCGACCT CAAGCCGTCG GGCCGGTTCG TGACCAGTCA GTTCCACGAG
GTCGGCGGCA CGCCGCAGGT GATGCGCATC CTCCTGGAGC AGGGGCTGTT GCACGGCCAC
TGCCAGACCA TCACCGGGCA GACCATCGAG GCCCTCATCG GCCACCTGCC GCCGGAGCCG
CCGGCGGATC AGGAGATCAT CATGCCCTTC GATCGGCCGC TCTATCCCGA GGGTCACCTG
GCAATCCTGC GGGGCAACCT GGCTGAAGAG GGTGCCGTGG CCAAGGTCAG TGGCATCCAA
CAGCGCCGTA TCAGCGGACC GGCCCGCGTG TTCGATTCCG AGGAGGAGTG CCTGGAGGCC
ATCCTGGCCG ACGGGGTGCA GGCCGGTGAC GTGGTGATCG TCCGCTACGA GGGGCCCAAG
GGAGGGCCCG GCATGCGCGA GATGCTGGCG CCCACCTCGG CGATCATCGG CAAGGGGCTC
GGCGATGCGG TGGGGCTGAT CACTGACGGC CGTTTCTCCG GCGGGACCTA CGGGATGGTG
GTCGGGCACG TGGCGCCTGA GGCCTACGAC GGCGGGACCA TCGCGCTGAT TGCCGAGGGC
GATACGGTGA CCATCGACGC CGACCAGAAC CTGCTTCAGG TGGAGGTGGA TGACGCGGAG
CTCGAGCGTC GCCGCAGCCA GTGGCAGGTG CCCCGGCCTC GCTACCGTCG CGGGGTGTTG
GGCAAGTATG CGCGCCTGGC GGCCTCGGCC AGTCGCGGTG CAGTGACCGA CGCCGACCTC
TTCCCCGAGC AGGAGTAG
 
Protein sequence
MADNRRSRVI TQGVARTPNR AMLRAVGFRD EDFEKPIIGV GNAHSTITPC NVGIGAMARR 
AEEALREVGA MPMKFGTITV SDGITMGTEG MKYSLVSREV IADSIETVGG GQRMDGMLAT
GGCDKNMPGA MMALARLDIP GIFVYGGTIK PGHYKGEDLT VVSAFEAVGQ YNAGNLSEAD
LKGVEENACP GAGACGGMFT ANTMSSAFEA MGMSLMGSST VSAEDDEARD VAAEASRVLM
DAVHHDRRPS SILTREAFEN AFAVVMALGG STNAVLHLLA IANTAEVPFD LDDVERIRRK
VPVLCDLKPS GRFVTSQFHE VGGTPQVMRI LLEQGLLHGH CQTITGQTIE ALIGHLPPEP
PADQEIIMPF DRPLYPEGHL AILRGNLAEE GAVAKVSGIQ QRRISGPARV FDSEEECLEA
ILADGVQAGD VVIVRYEGPK GGPGMREMLA PTSAIIGKGL GDAVGLITDG RFSGGTYGMV
VGHVAPEAYD GGTIALIAEG DTVTIDADQN LLQVEVDDAE LERRRSQWQV PRPRYRRGVL
GKYARLAASA SRGAVTDADL FPEQE