Gene Acid345_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3521 
Symbol 
ID4072780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4165115 
End bp4167025 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content60% 
IMG OID637985544 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_592596 
Protein GI94970548 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.728516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.613909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGG TTGCGCCCGC ACCCGGCGTT CAGAGTTACG TAGCCAAAGA AGAGCAGTAC 
GACGCCCCGC AGATCGTGGT CAAAGAGGCC CTGCTCCAGC ACTGGGACGA GGAATACGCG
CGCTCGATCG CCGACAACGA CGCCTTCTGG GGCGAATACG CGAAGAACTT TCGCTGGACC
AAACCATTTC AAACAGTGAG CGAGGCGAAC GGCGCGCACC ACAAGTGGTT TCTCGGCGGC
AAGACTAACA TCACGCTAAA CGCCCTCGAT CGTCATGCGA AGTCGGAGCG CCGCAATCGA
GTTGCTTACA TCTGGCTCTG CGAAGACGGC TCAGAGCGGG TCGTGACTTA TGGCCAGCTC
TATCGCATGG TGTGCCGCTT CGCCAACGGC CTGCGTTCGA TTGACGTCAA CAAGGGCGAT
CGCGTTGTGA TCTACATGCC GCTCACCATC GAATGCATCG TCGCGATGCT GGCATGCGCC
CGTGTCGGTG CGATTCACTC GGTTGTCTAT GCAGGTCTGG GTCACCAGGC ACTTCGCGAT
CGCATTGAAG ATGCGCAGGC GAAGGTCGTC ATCGCCGGTG AATGTACCTA TCGTCGCGGT
AAGACGGTCG CGTTGAAACC GATCGTGGAT GAAGCGATTG ACGGCCTGGA GTTCGTTGAG
CACGTCGTCG TGTACCAGCG CAGCAAGGGG CAATTCGAAG CCGCGAGCAG ACGCGAGATC
GATTTCTTCG CGCTGATGAA GTTTTCCTCG GAATGTCCGG CGGAAGAGAT GGACGCCGAG
GATCCGCTGT TCATCCTCTA TACCTCGGGA ACCACGGGAA AACCGAAGGG TGTGGTCCAC
GTCCACGGCG GATTCATGGT TGGCACCACC TATCACCTGC GCAGCTTCCT CGATATCGGC
GAGCAGGACA TTTTCTGGAA CACTTCGGAC ACCGGCTGGA TCGTTGGCCA CTCCTACATC
GTGTATGCGC CGCTCTGCGC GGGTGTCACC ACTCTTTTGC GCGAAGGCGC GATTGATTAT
CCCGAACCCT CTGCGGCGTG GCAGATCATC GAGCGCTACG GCGTGACCAA GATGTTCACG
GCGCCGACAG CCATCCGCAT GTTTATGCGC TTCGGCGAAT CGCTGCCGTT GTCTTACGAC
CTGACAACAC TGCGCGTAGT CGCCTGCGCG GGCGAGCCGC TGAATCCCGA AGCTTGGCGC
TGGGCGCAGA CCTATATCGC CGGCGACGGC AAATGGGGAT ACGTCATTGA TAACTGGTGG
CAGACCGAAC TCGGCGGTCC GACCCTTGGT ACGCCCGTCA CCAAGGCCAT GCGCGCTGGT
AAAGCTGGAT TGCCGCTGCC CGGTGTCGAA GCCGACGTGG TGGACATGGA AGGGAAGCGT
TCGCCCGATG GTGTGCAGGG CCGATTGATC TTGAAACGAC CTTTCCCGCA CATGATGCGC
ACAATCTGGA AGAACGACGC CCGCTGGGAA CGCGAGTGGC AGGAGATCCC CGGCTGCTAC
ATGACCGGCG ACGTCGCCGT TCGCGACAAA GATGGCTACA TCGCGGTGAT CGGCCGCGCC
GACGACGTGC TCAACGTCGC AGGCCACCGT ATTGGTACCG CGGAAGTGGA AAGCGCCTTG
GTTTCGCACC CGGCGGTTGC GGAAGCCGCA GCGATTGGCA TCCCCGACGC GCTGAAGGGC
GAGTCCATCA AGGCTTTCGT GCAGCTCCGC GCCGGCCACA ACGCCAGCGA CAACCTGAAA
GCCGCGCTGG TGGACCACGT TCGCCGCGAA CTTGGCCCGA TCGCCACGCC GTCAGCGATT
GACTTCGTTC CATCACTACC GAAAACACGA AGCGGCAAGA TCATGCGCCG GTTGTTAAAG
GCGCGTGAAA CCGGAGCGGA CATCGGCGAT CTTTCGACAC TGGAGCAGTA G
 
Protein sequence
MSTVAPAPGV QSYVAKEEQY DAPQIVVKEA LLQHWDEEYA RSIADNDAFW GEYAKNFRWT 
KPFQTVSEAN GAHHKWFLGG KTNITLNALD RHAKSERRNR VAYIWLCEDG SERVVTYGQL
YRMVCRFANG LRSIDVNKGD RVVIYMPLTI ECIVAMLACA RVGAIHSVVY AGLGHQALRD
RIEDAQAKVV IAGECTYRRG KTVALKPIVD EAIDGLEFVE HVVVYQRSKG QFEAASRREI
DFFALMKFSS ECPAEEMDAE DPLFILYTSG TTGKPKGVVH VHGGFMVGTT YHLRSFLDIG
EQDIFWNTSD TGWIVGHSYI VYAPLCAGVT TLLREGAIDY PEPSAAWQII ERYGVTKMFT
APTAIRMFMR FGESLPLSYD LTTLRVVACA GEPLNPEAWR WAQTYIAGDG KWGYVIDNWW
QTELGGPTLG TPVTKAMRAG KAGLPLPGVE ADVVDMEGKR SPDGVQGRLI LKRPFPHMMR
TIWKNDARWE REWQEIPGCY MTGDVAVRDK DGYIAVIGRA DDVLNVAGHR IGTAEVESAL
VSHPAVAEAA AIGIPDALKG ESIKAFVQLR AGHNASDNLK AALVDHVRRE LGPIATPSAI
DFVPSLPKTR SGKIMRRLLK ARETGADIGD LSTLEQ