Gene Hhal_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1070 
SymbolprpD 
ID4709860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1161940 
End bp1163385 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID639855541 
Product2-methylcitrate dehydratase 
Protein accessionYP_001002648 
Protein GI121997861 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.327536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGAG ACATCCGCAG CGCCCAGCGC CCCGACCCCG ATGCCGAGCT GGTGGCCATC 
GCCGACTACG TCAGCGGGAC CGCCATCGAC TCCGCCGAGG CCTACCAGAC CGCCCGCCAC
TGCCTGATGG ACAGCCTCGC CTGCGCCATG ATGGCGCTCG ACTACCCGGC CTGTACCAAA
CTGCTCGGCC CCATCGTCCC GGGGGCCGAG ATGCGCGACG GGGTGGTGGT GCCGGGCACC
GCCCATCGGC TCGATCCCGT GCAGGGGGCG TTCAATATCG GCGCGCTGAT CCGCTGGCTG
GACTTCAACG ATACCTGGCT GGCGGCGGAG TGGGGCCACC CTTCGGATAA CCTCGGTGGC
ATCCTGGCGG CGGCCGATTA CGAGAGCCGT CGGCGCCTGG CGCTGGGTCA GACACCGCTG
ACCATGCGCG AGGTGCTGAC TGCGGCCATC AAGGCCCACG AGATCCAGGG CGTGCTGGCG
CTGGAGAACA GCTTCAACCG TGTCGGCCTC GACCACGTCC TCCTGGTGCG ACTGGCGACC
ACCGCGGTGG CCACGCAGCT GCTCGGCGGC AGCCGCGAGG ACATCATCAA CGCCGTCTCC
AACGCCTGGA TCGACGGCGG CGCCCTGCGG ACCTATCGCC ACGCCCCGAA CACCGGCTCG
CGCAAGAGCT GGGCCGCCGG GGACGCCTCG GCCCGCGGGG TCCGGCTGGC GCTGATGAGC
CTCACCGGCG AGATGGGCTA CCCGTCGGCA CTCACCGCCC AGGGGTGGGG CTTCCAGGAC
GTGCTCTTCC GCGGGGAACC GATCCGCCGC TCCCAGGCGT ACGCCAGTTA CGTCATGGAG
CACGTGCTGT TCAAGATCTC CTTCCCCGCC GAGTTCCACG CTCAGACCGC GGTGGAGGCG
GCCCTGCAGC TGCACGCCGA GGTGGCGCCT CGACTCGACG AGGTCGACCG GGTGGTCATC
GAGACCCAGG AGGCGGGGGT GCGGATCATC GACAAGACCG GCCCGCTGAA CAATCCTGCC
GACCGCGACC ACTGTCTCCA GTACATGGTG GCGGTGCCGC TGATCTTCGG CCGCCTGACC
GCCGCCGACT ACGAGGATGC GGTGGCGGCC GACCCGCGTA TCGACGCCCT GCGCGAGCGC
ATGGAGGTCT CCGAGAACGA GGCCTTCTCC CGGGACTACA TGGACCCGGA GAAGCGCTCC
ATCGGGAACG CCGTGCAGGT CTTCTTCAAG GATGGCTCCA AGACCGAGCG CATCGCCGTG
GAGTACCCCA TCGGCCACCG GCGTCGCCGT GACGAGGGGA TCCCGGTGCT GGAGGAGAAG
TTCCGCAACG CCCTGGCGGC GCGGTTCGCC CCGCGGCAGG CTGGCGCCAT CGAGGCGGCA
CTGTCCGACC AGGCGGGTCT GGAACAGATG CCGGTCCACG CCTTTATGGA TCTGTGGCGG
GCCTGA
 
Protein sequence
MSGDIRSAQR PDPDAELVAI ADYVSGTAID SAEAYQTARH CLMDSLACAM MALDYPACTK 
LLGPIVPGAE MRDGVVVPGT AHRLDPVQGA FNIGALIRWL DFNDTWLAAE WGHPSDNLGG
ILAAADYESR RRLALGQTPL TMREVLTAAI KAHEIQGVLA LENSFNRVGL DHVLLVRLAT
TAVATQLLGG SREDIINAVS NAWIDGGALR TYRHAPNTGS RKSWAAGDAS ARGVRLALMS
LTGEMGYPSA LTAQGWGFQD VLFRGEPIRR SQAYASYVME HVLFKISFPA EFHAQTAVEA
ALQLHAEVAP RLDEVDRVVI ETQEAGVRII DKTGPLNNPA DRDHCLQYMV AVPLIFGRLT
AADYEDAVAA DPRIDALRER MEVSENEAFS RDYMDPEKRS IGNAVQVFFK DGSKTERIAV
EYPIGHRRRR DEGIPVLEEK FRNALAARFA PRQAGAIEAA LSDQAGLEQM PVHAFMDLWR
A