Gene Cagg_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3602 
Symbol 
ID7269746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4377807 
End bp4379060 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content62% 
IMG OID643568410 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_002464876 
Protein GI219850443 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.187655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACGA TGAGCGAGCA AATCTTAAGC CGTGTTGCCG GACGAATGGT GCGCGCCGGC 
GATGTTGTGA CGGCAAATGT TGATCTGGTG ATGGTACACG ATAGCCTGGC GCCCGGTATT
ATCCGTATTC TCCACAACGA ACTGGGTGCC GAACGGGTGT GGGATCCTCA GCGCATTGCG
GTTGTGATCG ATCACGTCGC CCCGGCTGCC AGTGTACAGA CCGCAGAAAA GCAGCAAGAA
GTGCGGCGTT GGGTCAAGGC GCAAGGTATT CCCAACTTGT TCGATGTCGG GCGCGGTATT
TCGCACCCGG TGTTGGTGGA AGAGGGGTTG GCCCAGCCGG GTATGTTGAT TTTGGGTAGC
GATAGCCACA GTACGGCGTA TGGCTGTGTC GGAGCGTTTG GCACCGGCAT GGGCAGCACC
GACATCGCAC TCGCATTGGC TACGGGTAAG ACGTGGTTGC GCGTGCCGGA GACCACCGTG
GTGCGAGCAC GCGGTGAGTT TGGGTTTGGT GTGGGGCCGA AAGATTTGGC ACTCCGCGCT
GCCCGTCTGC TTCGCGCCGA TGGAGCAACA TATGCAGCCA TCGAGTGGCA CGGCGTCGAG
CACCTGAGCG TGATGGAGCG GATGACGCTG GCGACCCTCT CGATTGAAAT GGGGGCCAAG
GCTGGGATTA TTCCGCCGAC CGGCCTCGAT CTCACCGGCC CACTCGTACC GACCGTCGAT
GCCGACGCGC AGTATCAACA GGTGGTTGAG ATCGATCTTG AGCAACTGGA GCCACAAGTC
TCGGCACCAC ACTATGTTGA CAACGTTGCG AACCTCAGTG ATCTGGGGCG CGTCGCAGTT
GATGTGGTCT ATCTCGGCAC ATGCACGAAC GGCCATTACG AAGATATGGC AGTGGCAGCC
CAGATTCTGG CCGGACGACG TATCGCTCCC GGTGTGCGGA TGATCGTTGT GCCGGCTAGC
GCGCAGGCGC TGCATCGCGC CGCCGCCGAT GGCACCCTCG CAACCTTGCT CGCCGCCGGC
GCGACCATCG GCACGCCGGG GTGCGGCGCC TGCATTGGCC GCCACATGGG AGTGCTCGCC
CCCGGTGAGG TCTGTCTGTT CACCGGTAAT CGCAATTTCC GTGGCCGTAT GGGCAGCCCT
GAAGCGCAGA TCTATTTGGC TTCGCCGGCA GTGGCTGCCG CGACGGCCCT CACCGGTTAT
CTGACCGACC CGCGGATGGT GATGGATGGG CAGCCGGTTG GCTCACATTC GTGA
 
Protein sequence
MPTMSEQILS RVAGRMVRAG DVVTANVDLV MVHDSLAPGI IRILHNELGA ERVWDPQRIA 
VVIDHVAPAA SVQTAEKQQE VRRWVKAQGI PNLFDVGRGI SHPVLVEEGL AQPGMLILGS
DSHSTAYGCV GAFGTGMGST DIALALATGK TWLRVPETTV VRARGEFGFG VGPKDLALRA
ARLLRADGAT YAAIEWHGVE HLSVMERMTL ATLSIEMGAK AGIIPPTGLD LTGPLVPTVD
ADAQYQQVVE IDLEQLEPQV SAPHYVDNVA NLSDLGRVAV DVVYLGTCTN GHYEDMAVAA
QILAGRRIAP GVRMIVVPAS AQALHRAAAD GTLATLLAAG ATIGTPGCGA CIGRHMGVLA
PGEVCLFTGN RNFRGRMGSP EAQIYLASPA VAAATALTGY LTDPRMVMDG QPVGSHS