Gene Cag_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1685 
Symbol 
ID3746375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2186616 
End bp2188616 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content47% 
IMG OID637774223 
Productacetyl-CoA synthetase 
Protein accessionYP_379980 
Protein GI78189642 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACAG AAACACCATC ATCAAGCCAG CAAGAGGCTG CTGCTAATCC TGCCGAGGAT 
TCCATCAGTT CCGTGTTAAC TGAAAAACGC AAATTTCCGC CTCCTGCAAG CTTTTCAGAG
CAAGCGCACC TTTCCACTAT GGAGCAGTAT GAAAAGCTTT ATGCTGACGC AGCGGCTGAT
CCTGAAGGGT ATTGGGCTGG AATTGCTGAA CAATTTCACT GGTTTAAAAA ATGGGACTCG
GTACTTGAAT GGAATTCGCC CTATGCAAAA TGGTTTAATG GCGGTAAAAC GAACATTTGT
TACAATGCGC TTGATGTTCA TGTTAAAAGC TGGAGAAAAA ATAAAGCCGC AGTTATTTGG
GAGGGTGAAC AGGGCGACCA ACGTATTTTA ACCTATGGTG AATTACATCG TCAGGTATGT
AAGTTTGCCA ATGTGTTGAA AATTGCAGGC ATTAAGCCGG GCGATCGTAT TGCTATTTAC
ATGGGTATGG TGCCTGAACT TATGATTGCT GTGCTGGCTT GTGCGCGTGT TGGCGCGGTG
CATAACGTTA TTTTTGCTGG TTTCTCGGCT CATGCTATTA CGGAGCGTGT GAATGATTCT
CGTGCAAAAA TGGTGATTTG TGCTGATGGT ACTCGTCGTC GTGGTTCAAC CATCAACCTT
AAAAACATTG TTGATGAGGC AATTGTTAAT ACCCCATCGG TGCGCAATGT GATTGTGCTA
AAAACTACCG GTGAAACCAT TAAGATGCAC GATGGTATGG ATCATTGGTG GCACGATTTA
ATGGGACTTG CTGTAGATGA ATCGGAAGCT GTGGAGTTGG ATGCTGAACA TCCGCTTTTT
GTGCTTTATA CCAGCGGTTC AACGGGTAAG CCAAAAGGTA TTTTACACAC CACGGCAGGT
TACATGGTTC ACGCTGCAAG CTCTTTCCGC TATGTGTTCG ATATTAAAGA TGAGGATATT
TATTTCTGTA CAGCCGACAT TGGCTGGATT ACAGGGCATA GCTACATGGT GTATGGTCCG
CTCTTGAACG GTGCAACGCT GTTGATGTAC GAAGGTGCAC CAAACTATCC TCAGTGGGAT
CGCTTCTGGG ATATTATTAA TCGCCATAAG GTTACCATTC TCTACACGGC TCCAACGGCT
ATTCGTGCCT TTATTCGTGC TGGTAACGAA TGGGTAACCA AGCACAATCT TAACTCTCTC
CGCTTGCTTG GTACCGTAGG TGAGCCAATT AACCCCGAAG CGTGGATGTG GTACCACAAA
GTGGTTGGAC AGGAAAAATG CCCCATTGTG GATACGTGGT GGCAAACAGA AACAGGCGGC
ATTATGGTTT CTCCAATGCC GGGTGCAACG CCAACCAAAC CAGGCACCGC AACTCGTCCA
CTTCCAGGCA TTATGGTAGA TGTTGTGCAC AAAGATGGCA CGCCATGTGG CGCTAACGAA
GGTGGTTACC TTGTTATTAA AAAGCCATGG CCTTCCATGT TGCGTACCAT TTATGGTGAT
AACGAGCGTT ACGAAAAAAC CTACTGGTCG GAGTTTAAAG ATATGTACTT TACTGGCGAT
GGTGCCCGTA AGGATGACGA TGGCTATATT TGGATCATGG GTCGTGTTGA CGATGTGGTA
AACGTTTCGG GGCACCGCCT TGGTACCAGC GAAGTTGAAA GCGCTCTTGT GTCGCACGAA
GCGGTAGCTG AAGCGGCTGT GGTAAGTCGT CCTGACGATA TTAAAGGTAA CTCGTTGGTT
GCCTTTGTAA CGTTAAAGGA TGAGTATGAG GGCGATATGA AGCTTCGTGA ATCGCTCCGC
AACCATGTGG CTCGCGAAAT TGGACCTATT GCTAAGCCCG ATGAAATTCG CTGGGCAAAA
GCGCTGCCAA AAACTCGTAG CGGTAAAATT ATGCGCCGCT TGCTTCGTGA GCTTGCAACC
AGCAACGAAA TTAAGGGCGA TGTTACTACG CTCGAAGATT TTGGTGTGCT CGAAAATCTT
CGCGATCAAG AAAACGAATA A
 
Protein sequence
MATETPSSSQ QEAAANPAED SISSVLTEKR KFPPPASFSE QAHLSTMEQY EKLYADAAAD 
PEGYWAGIAE QFHWFKKWDS VLEWNSPYAK WFNGGKTNIC YNALDVHVKS WRKNKAAVIW
EGEQGDQRIL TYGELHRQVC KFANVLKIAG IKPGDRIAIY MGMVPELMIA VLACARVGAV
HNVIFAGFSA HAITERVNDS RAKMVICADG TRRRGSTINL KNIVDEAIVN TPSVRNVIVL
KTTGETIKMH DGMDHWWHDL MGLAVDESEA VELDAEHPLF VLYTSGSTGK PKGILHTTAG
YMVHAASSFR YVFDIKDEDI YFCTADIGWI TGHSYMVYGP LLNGATLLMY EGAPNYPQWD
RFWDIINRHK VTILYTAPTA IRAFIRAGNE WVTKHNLNSL RLLGTVGEPI NPEAWMWYHK
VVGQEKCPIV DTWWQTETGG IMVSPMPGAT PTKPGTATRP LPGIMVDVVH KDGTPCGANE
GGYLVIKKPW PSMLRTIYGD NERYEKTYWS EFKDMYFTGD GARKDDDGYI WIMGRVDDVV
NVSGHRLGTS EVESALVSHE AVAEAAVVSR PDDIKGNSLV AFVTLKDEYE GDMKLRESLR
NHVAREIGPI AKPDEIRWAK ALPKTRSGKI MRRLLRELAT SNEIKGDVTT LEDFGVLENL
RDQENE