Gene Cagg_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1142 
Symbol 
ID7267890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1411061 
End bp1412110 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID643565985 
Producthypothetical protein 
Protein accessionYP_002462488 
Protein GI219848055 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID[TIGR00748] hydroxymethylglutaryl-CoA synthase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.448183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000356194 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAC CGAACCAACC TGTCGGCATT ATCGGCTATG GCGTGTACAT CCCACGTTAC 
CGGATCGCAG CGCGCGAAAT TGCTCGGATC TGGACAGACG GTCAGAATGG CGTCCCCGTG
GAGGCAAAGA GCGTTCCCGG CCCCGATGAA GACACGATTA CGATGGCAAT TGAAGCGGCG
CGTAATGCGC TGGTGCGTGC CGACATTCCG GCTAGCGCAC TCGGTGCGGT CTGGATCGGG
AGCGAAAGCC ATCCCTACAG CGTGAAACCA TCGGGGACGG TAGTAGCCGA CGCACTCGGC
GCCGGGCCAT GGGTGAGTGC CGCCGACTGG GAATTCGCAT GTAAGGCCGG CTCCGAAGCG
CTGACCGCGG CGATGGCACT GGTCGGCAGT GGGATGCAGC GCTACGCCTT GGCGATCGGC
GCCGACACTG CCCAGGGGCG TCCCGGTGAT GCGCTGGAAT ACACTGCTTC CGCCGGCGCA
GCAGCGTTGA TCGTTGGTCC TGCCACCGAA GCGTTGGCGA CCATCGATGC AACCGTCTCG
TATGTCACCG ATACCCCTGA CTTCTACCGC CGCGCCGACC GACCGTATCC GGTACACGGC
AACCGCTTCA CCGGCGAGCC GGCGTACTTC CACCAGATTC AATCGGCAGC CTCTGAATTA
TTACGTCAAC TCAACCGTAC TGCTGCCGAC TTTACCTATG CCGTCTTCCA TCAACCTAAT
GCGAAATTTC CCCAGACGGT TGCCAAACGA CTCGGCTTCA CCGATGCCCA AATCGCGCCG
GGATTGCTCA GTCCACAGAT CGGTAATACC TATTCGGGCG CCGCACTGCT AGGCCTGTGT
GCCATTCTCG ATGTCGCCAA ACCGGGCGAT ACCATCTTCG TAACGAGCTA CGGTAGTGGG
GCCGGTTCCG ACGCTTATGC CCTCACCGTC ACCGAAGCGA TTGTGGAGCG ACGCGAGCGA
GCGCCATTGA CGGCAGCGTA CCTCGCCCGC AAGGTGATGA TCGATTACGC AATGTATGCG
AAATGGCGCG GTAAGTTGGT GATGGGCTAG
 
Protein sequence
MMKPNQPVGI IGYGVYIPRY RIAAREIARI WTDGQNGVPV EAKSVPGPDE DTITMAIEAA 
RNALVRADIP ASALGAVWIG SESHPYSVKP SGTVVADALG AGPWVSAADW EFACKAGSEA
LTAAMALVGS GMQRYALAIG ADTAQGRPGD ALEYTASAGA AALIVGPATE ALATIDATVS
YVTDTPDFYR RADRPYPVHG NRFTGEPAYF HQIQSAASEL LRQLNRTAAD FTYAVFHQPN
AKFPQTVAKR LGFTDAQIAP GLLSPQIGNT YSGAALLGLC AILDVAKPGD TIFVTSYGSG
AGSDAYALTV TEAIVERRER APLTAAYLAR KVMIDYAMYA KWRGKLVMG