Gene Cagg_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2034 
Symbol 
ID7269193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2495969 
End bp2497000 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content56% 
IMG OID643566869 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_002463358 
Protein GI219848925 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAC CGCGCTTGAC CGATACAACC TTACGCGACG GTTCGCACCC AATGCGCCAC 
CAGTTTACCC GCGAACAAGT AGCCAAGATT GTGCAAGCGC TCGACCGGGC CGGCGTACCG
GTGATCGAGG TCAGTCATGG TGATGGGTTG GCCGGATCGT CGCTGCAATA TGGCTTTTCT
CATACATCAG AGTTTGATCT GATCGAAACT GCGCGTAGTT ATGCCGAACG GGCCAAGATT
GCTGCATTGA TGCTGCCGGG AATTGGCACA CGCCAAGAAT TGAAAGAAGC GGTAGCACGC
GGTATTCAGG TGGTACGAAT AGCTACTCAG TGTACAGAAG CGGATATTTC CGAACAGCAT
TTTGGTCTGG CAAAAGAGTT GGGTCTAGAA ACGGTCGGCT TTCTCATGAT GGCGCATATG
CGTCCACCAG AAGCGTTAGT CGAACAGGCA AAACTGATGG AATCGTATGG CGCCGACTGC
GTCTACATTG TTGATTCGGC GGGGGCGATG TTACCGCACG ACGCTGCGGC ACGGGTTCGG
GCGCTGAAAG AAGCGTTGTC GGTACAAGTT GGCTTTCATG CACACAATAA TCTCGGGTTA
GGCATCGGGA ATACCCTCGC AGCACTTGAA GCAGGCGCCG ATCAGATTGA TGGTTGTTTG
CGCGGATTAG GAGCCGGTGC CGGTAATGCT GCGACCGAGC TTCTGGCTGC CGTGCTCGAC
CGGCTGGGGC TGAATCCGGG GCTTGATGTC TTTGGGCTGA TGGATGCCGC CGAATATATT
GTGGCTCCGA TCATGCCATT TCAGCCCTTT CCTGACCGTG ATGCGATCAC CATCGGATAT
GCCGGAGTCT ACTCAACCTT CCTCCTACAC GCAAAACGTG CAGGTGAGCA GTACGGCATC
GATCCACGTG AGATCCTCGT CGAACTTGGC CGGCGGCAAG CAGTAGCCGG ACAAGAAGAC
TGGATTATCG ACGTAGCCCT CGATTTGAGT CGCCGGCGGG GCGCAGGGAC ACGCAAGGAG
GCGCACGGAT GA
 
Protein sequence
MKAPRLTDTT LRDGSHPMRH QFTREQVAKI VQALDRAGVP VIEVSHGDGL AGSSLQYGFS 
HTSEFDLIET ARSYAERAKI AALMLPGIGT RQELKEAVAR GIQVVRIATQ CTEADISEQH
FGLAKELGLE TVGFLMMAHM RPPEALVEQA KLMESYGADC VYIVDSAGAM LPHDAAARVR
ALKEALSVQV GFHAHNNLGL GIGNTLAALE AGADQIDGCL RGLGAGAGNA ATELLAAVLD
RLGLNPGLDV FGLMDAAEYI VAPIMPFQPF PDRDAITIGY AGVYSTFLLH AKRAGEQYGI
DPREILVELG RRQAVAGQED WIIDVALDLS RRRGAGTRKE AHG