Gene Cpha266_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0850 
Symbol 
ID4570444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp973107 
End bp974165 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content52% 
IMG OID639765448 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_911325 
Protein GI119356681 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.898844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAGA TTGTTTCAAT ACCGGGTGAC GGTATCGGCC CGGAAGTCGT TGCTGGCGCC 
GTTACCGTGC TCAGAAAAAT CTCTGAAAAG CACGGTTTCG AAATCCGCAT TGAGGAACAC
CCTTTTGGAG GCGCATCCTA CGACCTGCAC GGAACCATGC TTACCGATCA AACGCTTGAA
GCGTGCAAAA ACTGCGATGC CGTTCTGCTT GGAGCCGTAG GAGGACCGAA ATGGGAGAAT
CTCCCGCATG AGCACAAACC CGAAGCAGCT TTGCTCAAAC TCAGAAAGTC GCTCGGTCTC
TTCGCTAACC TGAGGCCGGC AAAAGTCTAT GATCCCCTTG TTGACGCTTC GTCTCTCAAG
GCAGAAGTCG TGCGGGGAAC AGATTTTCTT GTCTTCAGGG AGCTGATCGG CGGCATCTAT
TTCGGAGAGC CGAGAGGATA TGACGAAAAC AGAGGGTGGA ACACCATGGT CTATGAACGC
CATGAAGTTG AGCGCATAGC CCGCCTTGCC TTTGAAGCTG CCCAGAAGCG TGGCGGACGG
GTTATCTCCA TAGACAAAGC CAATGTGCTT GAAGTTTCCC AGTTCTGGAG AAATGTCGTA
CATGAGGTAC ACCGGGAGTT TCCCGACATA GAACTCAGCG ACATGTATGT TGACAACGCT
GCCATGCAGA TTGTCAGAAA CCCCTTGCAA TTTGACGTTA TCGTCACAGG AAACCTTTTT
GGTGACATAC TCAGCGACAT TGCGGGCATG ATCACCGGTA GTCTTGGAAT GCTTCCTTCG
GCCAGCATCG GAACAAGCCA TGCTCTCTAC GAACCTATTC ACGGCAGTGC GCCGGACATT
GCAGGAAAAA ACATTGCGAA CCCCATTGCG ACCATCGCAT CGGTAGCCAT GATGTTCGAA
CACAGCTTCT GCATGCCTGA TATAGCCGAA GAGATCAGCC AGGCTATTGT ATCGGCCCTT
GCGGCCGGCC TCAGAACCGC AGATATTGCC GGGGCGGGTG ACAGAATTGT TTCAACTACT
GAAATGACCG AAGCCATCGT CACCAGCCTC GGTTCATAA
 
Protein sequence
MYKIVSIPGD GIGPEVVAGA VTVLRKISEK HGFEIRIEEH PFGGASYDLH GTMLTDQTLE 
ACKNCDAVLL GAVGGPKWEN LPHEHKPEAA LLKLRKSLGL FANLRPAKVY DPLVDASSLK
AEVVRGTDFL VFRELIGGIY FGEPRGYDEN RGWNTMVYER HEVERIARLA FEAAQKRGGR
VISIDKANVL EVSQFWRNVV HEVHREFPDI ELSDMYVDNA AMQIVRNPLQ FDVIVTGNLF
GDILSDIAGM ITGSLGMLPS ASIGTSHALY EPIHGSAPDI AGKNIANPIA TIASVAMMFE
HSFCMPDIAE EISQAIVSAL AAGLRTADIA GAGDRIVSTT EMTEAIVTSL GS