Gene Cagg_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3600 
Symbol 
ID7269744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4376283 
End bp4377281 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID643568408 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_002464874 
Protein GI219850441 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase
[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.177438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCT GTGTCATTCC CGGTGATGGG ATCGGTTCTG AAGTAATGGC TGCGGCAACG 
GCGGTGTTGC GCGTGTTGGC ACCAGACCTC ACCCTCTGTG AAGCCGAGGC GGGGTGGGGC
GTCTTCCAGC GCACCGGCGT CGCCCTTCCC GCCGAAACAC TCGCAGCAGC GCGTGAGGCG
ACTGCTATTC TCTTCGGCGC AGTTGCCTCA CCGAGCCACC CGGTGCCCGG ATACCGTAGT
CCGATTGTCG AATTGCGCCG TACCCTTGAT CTCTACGCCA ACATTAGGCC AACGATTGGC
CCCGGCGTTG ATCTGGTTGT GGTACGCGAG AATACCGAAG ATTTGTACAG TGGGCGTGAA
CGGCTCGAAG ACGACGGTAA CACAGCAATT GCCGAGCGGG TGATTACGCG CACCGCTTCG
GCCCGAATTG TGCGTACCGC CTGCGAACTG GCGCGCACCC GCCAAGCCGC CGGCCGTCCC
GGCAAGATCA CGATTGTTCA TAAAGCTAAC GTGCTGCGGG TCAGTGACGG CCTCTTCCGT
ACCGTAGCGC TAGAGGTCGC TGCCGACTTC CCCGAATTGA CCTTTGAAGA GCGGTTGGTT
GACGTGGCCG CGATGCAACT CGCCGCCCAA CCGCAACGGT TTGATGTGAT CGTTACCACT
AACATGTTCG GTGATATTCT GTCGGATATT GCCTGCATTC ACGGCGGTGG TTTAGGCGTA
GCGGCCAGCA GCAATCTCGG CCATGGCCGC GCACTGTTTG AACCGGTGCA TGGTGCCGCA
CCCGATATTG CCGGGCGCGG GATTGCCAAC CCAACCGCCG CTCTCAACTG TGTGGTGATG
CTGCTTGAGT GGATCGACCG GACTCACACA GCCGAACGGC TCCGCCGCGC GATGGACGCC
GTCGCTAGCG AAGGAATTCG CACACCGGAT GTTGGCGGTG ATGCGACAAC ACGTGAAGTA
ACTGAAGCCA TTCTTGAACA GATAGCAACA CAAGCCTGA
 
Protein sequence
MQICVIPGDG IGSEVMAAAT AVLRVLAPDL TLCEAEAGWG VFQRTGVALP AETLAAAREA 
TAILFGAVAS PSHPVPGYRS PIVELRRTLD LYANIRPTIG PGVDLVVVRE NTEDLYSGRE
RLEDDGNTAI AERVITRTAS ARIVRTACEL ARTRQAAGRP GKITIVHKAN VLRVSDGLFR
TVALEVAADF PELTFEERLV DVAAMQLAAQ PQRFDVIVTT NMFGDILSDI ACIHGGGLGV
AASSNLGHGR ALFEPVHGAA PDIAGRGIAN PTAALNCVVM LLEWIDRTHT AERLRRAMDA
VASEGIRTPD VGGDATTREV TEAILEQIAT QA