Gene Acid345_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3520 
Symbol 
ID4072779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4163044 
End bp4165002 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content60% 
IMG OID637985543 
Productacetyl-coenzyme A synthetase 
Protein accessionYP_592595 
Protein GI94970547 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.877944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.990981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAG CCACGCACTC CAATATTGAT TCCATTCTTC AGGAAAATCG CAAATTCGAA 
CCGCCTGCTG AGTTCAGCCG TCACGCCCAC ATCAAGTCGC TCGAGGAATA CGAGAAACTC
TACAAGCAGG CCGCTGACGA CCCTGAAGGA TTTTGGGCTG AAGTCGCGAA GGAACTGCAC
TGGTTCAAGC CGTGGACCAA AGTTCTGGAG TGGGACGCAC CCTGGGCGAA GTGGTTCGTC
GGCGCTGAGG CCAACCTTTC CTACAACTGT CTCGACCGCC ACGTGCTCGG CGGACGCCGC
CACAAGGCCG CGTTCATTTG GGAAGGCGAA CCCGGCGACG TGCGCACGCT GACCTATCAG
CAGCTCTGGC TCGAAGTGCA GAAGTTCGCC AACGTTCTGC TGGATCTTGG GATCAAGAAG
GGTGATCGCG TTGCGATTTA TATGGGTATG GTGCCGGAAC TGCCCGTTGC GATGCTCGCC
TGCGCGCGCA TCGGCGCGAC GCACTCCGTG ATCTTCGGTG GCTTTTCGGC GAACGCGCTG
GTGGACCGCA TCACCGACCA GCAGGCCGTC GCCGTCATTA CGCAGGATGG CTCATGGCGT
CGCGGCAACG AAGTGAAGTT GAAAGTCGCA GTAGACGAGG CGCTGGAAAA GTGCCCAACC
GTGAAACACG TCGTGGTCTA TAAGCGCACG GCAAGCGCCA TCAACATGAA AGAGGGTCGC
GACCACTGGT GGCACGATCT CATGGCGAAG GCGAAGGACC ACTGCCCCGC CGAGCCGCTC
GACGCCGAGC ATCCGCTCTA CATCCTCTAT ACGTCGGGGA CCACCGGCAA GCCGAAGGGA
ATCGTCCACA CCACTGGCGG CTACGCAGTC GGCACCTACT ACACGACCAA GATGGTCTTC
GATCTCAAGG AAGACGATAC CTTCTGGTGC ACTGCCGATA TCGGTTGGGT CACGGGCCAC
AGCTACATCG TTTACGGTCC GCTGCAAACC GGCGCCACGA CGGTGATGTA CGAAGGCGCG
CCGAACTTCC CGGACCTCGA TCGTTTCTGG GCGCTCGTCG CCAAGCACAA GGTCACCGTC
TTCTACACCG CCCCGACCGC GATTCGCACC TTCATGAAGT GGGGCGCGGA ATATCCCAAC
CGTCATGACA TGAGCACTCT AAGATTGCTC GGCAGCGTTG GTGAGCCGAT CAACCCCGAA
GCCTGGATGT GGTACCGCGA CGTCATCGGG AAAGATCGTT GTCCGATCGT TGATACCTGG
TGGCAGACCG AGACTGGCGC CATCATGATC TCGCCGCTGC CTGGCGCGAT CGCCACCAAG
CCGGGTTCGG CGACCAAGCC GCTGCCCGGA ATCATCGCGG AAGTCGTAAC CCGTGCCGGC
GAGAAAGTAC CGCTCGGCTC GGGCGGGTTC CTCGTCATCA AGAAACCGTG GCCCTCGATG
ATGCGCACCA TCTACGGCGA TCCCGAGCGC TACAAGCACC AGTATTGGTC TGATATTCCG
GGCGTGTACT TCACGGGTGA TGGTGCTCGC GAAGACAAGG ACGGCTACTT CTGGATCATG
GGTCGCGTGG ACGACGTGCT GAACGTCTCC GGCCATCGCC TGAGCACCAT GGAAATCGAG
TCCGCGCTGG TGGCACATCC GAAGGTCGCG GAAGCCGCGG TCGTTGGCCG CCCAGACGAG
ATGAAAGGTC AGGCAGTATC GGCGTTCGTC ACGCTGGAAT CCGGTAGCAA GCCCTCGCCT
GAACTGAAGG AAGAACTCCG CGCCTGGGTA GCCAAGGAAA TCGGTTCCAT GGCGAAGCCC
GATGACATCC GCTTTACGGA CACGCTCCCC AAGACCCGCA GCGGCAAGAT CATGCGCCGT
CTGCTCCGTG AACTGGCAAC GGGAGGCGAT GTAAAGGGCG ACACCACGAC CTTGGAAGAT
TTCACCGTCA TCGCCAAGCT CAAGGAAGAT GAACAGTAG
 
Protein sequence
MSSATHSNID SILQENRKFE PPAEFSRHAH IKSLEEYEKL YKQAADDPEG FWAEVAKELH 
WFKPWTKVLE WDAPWAKWFV GAEANLSYNC LDRHVLGGRR HKAAFIWEGE PGDVRTLTYQ
QLWLEVQKFA NVLLDLGIKK GDRVAIYMGM VPELPVAMLA CARIGATHSV IFGGFSANAL
VDRITDQQAV AVITQDGSWR RGNEVKLKVA VDEALEKCPT VKHVVVYKRT ASAINMKEGR
DHWWHDLMAK AKDHCPAEPL DAEHPLYILY TSGTTGKPKG IVHTTGGYAV GTYYTTKMVF
DLKEDDTFWC TADIGWVTGH SYIVYGPLQT GATTVMYEGA PNFPDLDRFW ALVAKHKVTV
FYTAPTAIRT FMKWGAEYPN RHDMSTLRLL GSVGEPINPE AWMWYRDVIG KDRCPIVDTW
WQTETGAIMI SPLPGAIATK PGSATKPLPG IIAEVVTRAG EKVPLGSGGF LVIKKPWPSM
MRTIYGDPER YKHQYWSDIP GVYFTGDGAR EDKDGYFWIM GRVDDVLNVS GHRLSTMEIE
SALVAHPKVA EAAVVGRPDE MKGQAVSAFV TLESGSKPSP ELKEELRAWV AKEIGSMAKP
DDIRFTDTLP KTRSGKIMRR LLRELATGGD VKGDTTTLED FTVIAKLKED EQ