Gene Cagg_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1950 
Symbol 
ID7268866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2382288 
End bp2383475 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content57% 
IMG OID643566788 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_002463281 
Protein GI219848848 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00273343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAA AGCGTGAAGT CGTCGTACTG AGCGGCGTGC GCACGGCTAT TGGCACGTTT 
GGTGGCAGTC TCAAAGATAC CCCCCCAACC GAACTTGCTG CACTGGTAAC GCGCGAGGCA
GTAGCTCGCG CCGGTGTTCA ACCGGACGAA ATCGGCCACG TGGTTTTTGG TCACGTGATC
AATACCGAAC CACACGATAT GTACATGGCT CGCTATGCAG CAGTGCGTGG CGGTTTACCC
GTTGAGACGC CGGCACTCAC CCTCAACCGA TTATGCGGGA GTGGTTTGCA GGCGATTGTC
TCAGCCGCTC AGTATATTCT GCAGGGTGAT ATTGATGCAG CAGTGGCCGG TGGTGCCGAA
TGTATGAGCC GTGGTCCTTA CAGCGTGCCG GCGATGCGTT TTGGTGCTCG TATGAACGAC
ACCAAAGTCG TGGATATGAT GGTCGGTGCG CTGACCGATC CGTTCGACGA TTGCCATATG
GGAGTGACGG CTGAGAATGT CGCGGCAAAG TGGGGCATTA GCCGCGAAGA TCAAGACCAA
CTCGCCTACG AAAGCCATAT GCGTGCGGCG CGTGCTATTG ACGAAGGACG CTTCGCTGGC
CAAATCGTGC CGGTTGAGAT CAAGACCAAA GGTGGAACTG CCCAGTTTAT GGTCGATGAG
GGCGTGCGTC GTGATACCAC CATTGAGAAG CTGGCCAAGC TACGCCCTGT CTTCCTCAAG
GATGGGACGG TTACCGCCGG GAATGCCTCG AGTATCAACG ATGCTGCGGC TGCCGTTGTC
TTGATGGATC GGGCGACGGC TGAGCGTCGT GGCTACAAAC CATTGGCACG CCTGGTCGGT
TATAGCAACG CTGCTGTTGA GCCGAAGTAT ATGGGGATTG GACCGGTGCC GGCAGTGCGC
CGTCTGCTCG AGCGCACCGG TCTACGGATT ACCGATATCG ATCTCTTTGA AGTGAATGAA
GCGTTTGCCG CCCAAGCGTT GGCTGTGATC CGCGATCTGG GTCTGCCCAT GGATCGCACC
AATCCGAATG GCAGCGGTAT TTCGCTTGGT CACCCGATCG GCGCTACCGG TTGCATCCTG
ACCGTCAAGG CAATTCACGA GCTACACCGC ACCGGTGGCC GTTATGCGCT GGTGACGATG
TGTATCGGTG GCGGACAAGG TATCGCTGCG ATTTTTGAGC GGATGTAG
 
Protein sequence
MSEKREVVVL SGVRTAIGTF GGSLKDTPPT ELAALVTREA VARAGVQPDE IGHVVFGHVI 
NTEPHDMYMA RYAAVRGGLP VETPALTLNR LCGSGLQAIV SAAQYILQGD IDAAVAGGAE
CMSRGPYSVP AMRFGARMND TKVVDMMVGA LTDPFDDCHM GVTAENVAAK WGISREDQDQ
LAYESHMRAA RAIDEGRFAG QIVPVEIKTK GGTAQFMVDE GVRRDTTIEK LAKLRPVFLK
DGTVTAGNAS SINDAAAAVV LMDRATAERR GYKPLARLVG YSNAAVEPKY MGIGPVPAVR
RLLERTGLRI TDIDLFEVNE AFAAQALAVI RDLGLPMDRT NPNGSGISLG HPIGATGCIL
TVKAIHELHR TGGRYALVTM CIGGGQGIAA IFERM