Gene Cpha266_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0849 
Symbol 
ID4570443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp971513 
End bp973084 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content52% 
IMG OID639765447 
Product2-isopropylmalate synthase 
Protein accessionYP_911324 
Protein GI119356680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGAAA AAATCCTTGT ATTTGACACA ACACTTCGCG ACGGTGAACA GTCACCCGGA 
GCATCCCTGA ATGTCCAGGA AAAAGTAGAG ATCGCCAGGC AGCTTGAAAA GCTTGGCGTC
GATATCATTG AAGCCGGATT TCCGGCTTCA TCACCGTTAC AGTTTGAGGC CGTTCAAAAA
ATAGGCGCGG AATCCGGTGC CGTTGTCGCG GCACTGGCAA GGGCCGTTGA ACAGGACATA
ACCCGCGCAT GGCAATCACT CAGGGAGGCA AGGAAACCAA GAATCCACAC CTTCATCAGC
ACGTCCGACA TTCATATCAC GGGAAAGTTC GGAAGCAGCC GCTACGGAAC AAGCCTGAAG
GAAAAACGGG CCACAATCCT GAACATGGCG GTAAACGCCG TTACTTTTGC CCGTTCGCTT
GCCGGAGATA TTGAGTTTTC AGCCGAAGAT GCGGGAAGAA CAGACCCCGT TTATCTTGCT
GAAATAATAG AAGCCGTTAT AGAGGCGGGA GCCTCGACCG TCAATATACC CGACACCACA
GGATATACAT GGCCTTCGGA GTTCGGCAAA AAAATCAGGG ATCTCAAAAC GCGGGTCGGG
AACATCGAAA AAGCAATCAT CAGCGTTCAC TGCCACAACG ATCTTGGCCT TGCCGTAGCC
AACTCGCTCA GCGCGCTTGA ACAGGGAGCG CGACAGGTTG AATGTTCGAT CAACGGCATT
GGAGAACGGG CGGGAAACGC ATCACTTGAG GAGATCGTGA TGGCCCTGAA AGTCCGCAGC
GACCTGCACA ACTTCGAAAC CGGAATTATT ACCGAAGAGA TTTATAACAC CAGCAGGATG
GTCTCCTCGT TTACAGGAAT TATCATACAA CCCAACAAAG CAATCGTAGG CGATAACGCG
TTCTCGCACG AATCGGGCAT TCACCAGGAT GGCATGCTGA AAAACCGGGA GACTTATGAG
GTCATGACGC CACAATCCGT CGGTGTTCCC GAAACAAGCA TCGTCCTCGG ACGTCATTCC
GGCAAACACG GTCTCGCGTC CCGTCTGCTC TCGCTCGGCT ATATTCTTCA GGACAAGGAA
CTTGAAACGA TCTATCGACG TTTTGTTGAC ATTGCCGACA AGAAAAAAGA GGTCTACGAT
GATGACCTGC GCGTCATGAT GGGAGACGAG CTTTCCAGGC CCGCGAGCGT TTACGAACTC
GACTACCTCC ACATCAACAG CGGCACTGCT TCAATCCCGA CGGCAACGGT GCGAATCACG
CACAATCAAC GGACGTTTGA GGAGTCAGCG ACAGGCGATG GACCGGTCGA TGCCTGTTTC
AGGGCTATCG AAAGAGCGCT CGGCATCGAG TCGATGGTCA GTTCCTATTC GGTAAGATCC
ACGACGGCAG GACGGCAGGC ACTTGGTGAA GCACTGGTAC GAATCAGGGA CAGGAATGTC
TCCTTTAACG GAAGAGGCAT TTCAACCGAT ATTATCGAGG CAAGCGCAAA AGCTTACCTC
CAGGCACTCA GCCTGAGCCG GACATATTTT GAAACAGACA ACACTACAGA AACCATAGAT
AACGGGGTTT AA
 
Protein sequence
MREKILVFDT TLRDGEQSPG ASLNVQEKVE IARQLEKLGV DIIEAGFPAS SPLQFEAVQK 
IGAESGAVVA ALARAVEQDI TRAWQSLREA RKPRIHTFIS TSDIHITGKF GSSRYGTSLK
EKRATILNMA VNAVTFARSL AGDIEFSAED AGRTDPVYLA EIIEAVIEAG ASTVNIPDTT
GYTWPSEFGK KIRDLKTRVG NIEKAIISVH CHNDLGLAVA NSLSALEQGA RQVECSINGI
GERAGNASLE EIVMALKVRS DLHNFETGII TEEIYNTSRM VSSFTGIIIQ PNKAIVGDNA
FSHESGIHQD GMLKNRETYE VMTPQSVGVP ETSIVLGRHS GKHGLASRLL SLGYILQDKE
LETIYRRFVD IADKKKEVYD DDLRVMMGDE LSRPASVYEL DYLHINSGTA SIPTATVRIT
HNQRTFEESA TGDGPVDACF RAIERALGIE SMVSSYSVRS TTAGRQALGE ALVRIRDRNV
SFNGRGISTD IIEASAKAYL QALSLSRTYF ETDNTTETID NGV