Gene Cagg_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3789 
Symbol 
ID7267863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4623288 
End bp4625222 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content58% 
IMG OID643568597 
Productacetate/CoA ligase 
Protein accessionYP_002465061 
Protein GI219850628 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.670559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00433941 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAGA CTCGCGATGT GGCGCTGCCC GACACCGGTG AACTGTACTA CCCCGACCCG 
GCACTGGTTG AGCAATCTAA TGTGATGGCG TATGCTCGCA GCAAGGGCTT CAACTCCTAC
GATGAACTAT ACCAGTGGAC GATCACCCAC CGTGAAGAGT TTTGGGCCGA TATGGCCGGC
GAACTCGAGT GGTTCAAACC GTGGGAGAAG GTGCTCGATG ACAGCAACAA ACCATTCTAC
AAGTGGTTTG TTGGTGGGAA AACCAATATC GTCTACAATG CCATCGACCG CCACCTCAAG
ACGTGGCGCA AGAACAAGCT GGCCCTGATC TGGGAAGGTG AGGATGGAAG CCAGCGCACC
TATTCCTATT ACCAGCTCAA TTACGAAGTG TCGCGGATTG CCAACGTGCT GAAGAGCATG
GGGGTGAAGA AGGGCGATAT TGTTACCATT TACATGCCCC GCATCCCTGA GCTGATGTTT
AGCATGCTGG CCTGTGCTAA GATCGGCGCC GCTCACAGCG TGGTCTACGG CGGCTTTTCG
GAAGCGGCGC TTGCTGACCG CTTAGCCGAT GCCAAGAGCA AGGTGCTGAT TACTGCCGAT
GGCGGCTACA TGCGCGGCAA GATCGTCGAA CTGAAGAAGA TCGTGAACGA GGCGTTGGCC
CGGACTCCCA CCGTCCAAAC CTGTCTCGTC TTCCGCCATA CCAACCACGG TGCCCCGATG
GAGCAGGGGC GAGACTTCTG GATGCACGAT CTCCTCGGTT TGCCGATTGC CAACGGTCAT
TGCCCCACCG AAGAGATGGA TGCCGAGGAT ATGCTGTTTA TCCTCTACAC ATCGGGCACG
ACCGGTAAAC CCAAAGGTGT GGTACACACC CACGGCGGCT ATATGGTCGG TACCTATACG
ACGCTCAAGT TTGTGTTTGA CATCAAAGAC GAAGATCGCT ACTGGTGTGC CGCCGACCCA
GGCTGGATTA CCGGCCACTC GTTTATTGTC TATGCCCCCC TAATCAACGG CGCTACCTCG
TTTATGTACG AGGGCGCTCC CAACTATCCA TACCCCGATC GCTGGTGGAG CATGGTTGCC
AAGCACGGCA TTACCATCCT CTACACTGCC CCGACTGCTA TTCGCGGCTT GATGCGCTTC
GGCGATTTGT GGCCCTCGCG CCACGATCTT AGCACTCTGC GCTTGCTCGG TTCGGTCGGC
GAGCCGATTA ACCCTGAAGC GTGGAAGTGG TTCTACGAGA AGATCGGTCA TAATCGCTGC
CCCATCATGG ATACGTGGTG GCAGACTGAG ACCGGCCACT TTATGATTAC ACCGACCCCG
GCTGTACCGC TAAAGCCCGG TTCGGCGACC CGCCCCTTCC TCGGCATCGA AGTTGATGTG
GTGCATGAAG ATGGTACACC CTGCGCACCC GATGAAGACG GTCTGCTGGT GATCAAGACA
CCGTGGCCGG GTATGATGCG CACAATCTTG TACGATCCAC AGCGCTATGT CGAAGGCTAT
TGGCAAAAGG TGCCGCCGTA CTACGCTGCC GGTGACAGCG CACGTAAAGA CAAGGACGGC
TATATCTGGG TGATTGGCCG CCTCGATGAT GTGATCAAGG TGTCGGGCTA CCGGCTAGGC
ACCGCCGAAG TCGAGAGCGC ATTGGTCAGC CACCCCGCCG TTGCCGAGGC CGCCGCGATT
GGGTTGCCGC ACGAGGTGAA GGGCAACGCG ATCCACGCCT TCGTTATTCT GCGTGCCGGT
TACGAACCGA GCCACGAGCT GGAAGAAAAA TTGCGCGCAC ACGTCGGCCA CGAGCTTGGG
CCGATCGCTC GCCCCGACTC GATTACCTTC GTAACGTCGC TGCCCAAGAC GCGCTCCGGT
AAGATTATGC GCCGTGTGCT GCGTGCCCGT GCGTTGGGCC TGCCCGAAGG CGACATCTCG
ACGCTCGAAG AGTAG
 
Protein sequence
MTETRDVALP DTGELYYPDP ALVEQSNVMA YARSKGFNSY DELYQWTITH REEFWADMAG 
ELEWFKPWEK VLDDSNKPFY KWFVGGKTNI VYNAIDRHLK TWRKNKLALI WEGEDGSQRT
YSYYQLNYEV SRIANVLKSM GVKKGDIVTI YMPRIPELMF SMLACAKIGA AHSVVYGGFS
EAALADRLAD AKSKVLITAD GGYMRGKIVE LKKIVNEALA RTPTVQTCLV FRHTNHGAPM
EQGRDFWMHD LLGLPIANGH CPTEEMDAED MLFILYTSGT TGKPKGVVHT HGGYMVGTYT
TLKFVFDIKD EDRYWCAADP GWITGHSFIV YAPLINGATS FMYEGAPNYP YPDRWWSMVA
KHGITILYTA PTAIRGLMRF GDLWPSRHDL STLRLLGSVG EPINPEAWKW FYEKIGHNRC
PIMDTWWQTE TGHFMITPTP AVPLKPGSAT RPFLGIEVDV VHEDGTPCAP DEDGLLVIKT
PWPGMMRTIL YDPQRYVEGY WQKVPPYYAA GDSARKDKDG YIWVIGRLDD VIKVSGYRLG
TAEVESALVS HPAVAEAAAI GLPHEVKGNA IHAFVILRAG YEPSHELEEK LRAHVGHELG
PIARPDSITF VTSLPKTRSG KIMRRVLRAR ALGLPEGDIS TLEE