Gene Cagg_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1143 
Symbol 
ID7267891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1412224 
End bp1413390 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID643565986 
Productacetyl-CoA acetyltransferase-like protein 
Protein accessionYP_002462489 
Protein GI219848056 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.525995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000104524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAACG TCTATATTGC AGGTATGGGA GCGACTGCGG TTGGCGAGCA CTACCGGCGC 
AGCCTCGCCG ATCTCGTCAG TGAAGCAGCT CGTGCCGCGC TATCCAGCAC ACCGGGAATC
GCTCCGCATC AGATCGGCGC ACTCTATGTC GGTAGTGCCT ACAGCGAAGA GCTGTATGGG
CAAAGCCAAC TCGGCGCATA TCTCGCCGGT GTACTGGGCC TCTCCACCTC CATCCCCACA
CTCCGCGTCG AAGCCGCCGG CGCGAGTGGC GGATTAGCCC TCTACCAGGC AGTCCAGGCA
GTACAACATG GCTTGCCATT AGCACTGGTC ATCGGTGTCG AAAAGGTCAC CGATCAACTC
GAAGACGATA TTGAGGCCGC ACAAGCAATG GCAAGTGACG GCAATGAAGA GGCGTTACAC
GGCATCACCT TGACGGCACA GTGGGCAATG CTGATGCGTC GCTACATGTA CGAATACGGC
TATAGCGCTG AAGCGTTCGC GCCGTTTCCG ATCAATGCCC ATGCCAACGG GGCGAAAAAT
CCGCTTGCCC TTTACCGTTT TCCCATTGAC GCCAACAAGT ATCGCAAAGC GGCGATGGTT
GCATCACCGA TCAACCTGCT CGATTGCAGT ACCCTTGCCG ATGGCGCAGC AGCCCTTCTG
ATCGCCGGCG AGCACTTGGC CCGCGAACTA ACCGGGCCGC GCATTCGTAT TGCCGGGTCG
GCGGTTGCGA CCGACACGGT TGCCCTTCAC CGCCGACGTA ACCCGCTCGA ACTAACTGCC
GCGCGGGCCA GTGCGCACAT TGCGCTAGGC CGCGCCCACC TCGGCGTCGG CGATGTTCAC
GTCTGGGAGC TGACCGATCC GCACGGCATT GCCGCAACTT TGGCGCTGGA AGCGATTGGC
TGTTACGAAC CCGGTACAAC CCCACGCCAC GCCGCCGAAG GAGCGATTAC TCCGACCGGT
AAGACACCCA TTGCCACGGC CGGCGGCTAT AAAGCGCGCG GTGACGTAGG TGGAGCAAAC
GGAATCTATC AGGTGATCGA GCTTGCCCAT CAACTCTGCG GTACAGCCGG CGCAACTCAA
GTCGCCGATG CACGGATTGG GTTAGCGCAA ACACTTGGTG GCATTGGGGC GACGGCAGTG
ACCCATGTCC TGATTCGCGA ATCGTGA
 
Protein sequence
MTNVYIAGMG ATAVGEHYRR SLADLVSEAA RAALSSTPGI APHQIGALYV GSAYSEELYG 
QSQLGAYLAG VLGLSTSIPT LRVEAAGASG GLALYQAVQA VQHGLPLALV IGVEKVTDQL
EDDIEAAQAM ASDGNEEALH GITLTAQWAM LMRRYMYEYG YSAEAFAPFP INAHANGAKN
PLALYRFPID ANKYRKAAMV ASPINLLDCS TLADGAAALL IAGEHLAREL TGPRIRIAGS
AVATDTVALH RRRNPLELTA ARASAHIALG RAHLGVGDVH VWELTDPHGI AATLALEAIG
CYEPGTTPRH AAEGAITPTG KTPIATAGGY KARGDVGGAN GIYQVIELAH QLCGTAGATQ
VADARIGLAQ TLGGIGATAV THVLIRES