Gene Acid345_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3662 
Symbol 
ID4072265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4333444 
End bp4334724 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content61% 
IMG OID637985685 
ProductS-adenosyl-L-homocysteine hydrolase 
Protein accessionYP_592737 
Protein GI94970689 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0499] S-adenosylhomocysteine hydrolase 
TIGRFAM ID[TIGR00936] adenosylhomocysteinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.253183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACGA CATCTACTTC GAACGTCGCG TGCGACATCG CGAACATTGA ACTGGCGGAC 
CTGGGCAAAA AGCGCATCGA ATGGGCGAAC CAGTCGATGA AAGTGCTGCA GATCATCCGC
AAGGATTTCA TCAAGAACCA GCCGCTGAAG GGCTTCCGCA TCAGCGCCTG CCTGCACGTG
ACGGCCGAGA CCGCCAACCT GATGATCACG CTGCGCGACG GTGGCGCCGA GGCTGTTCTG
TGCGCCTCGA ACCCGCTCTC GACACAGGAT GACGTGGCTG CCTCACTCGT CCGCGACTAC
GGTATTCCCG TCTACGCGAT CAAGGGCGAG GACAACGATA CCTACTACTC GCACATCATG
GCGGCGCTCG ACCACAAGCC GCACATCACG ATGGATGACG GCGCCGACCT CGTGACGATC
GCCCTGACCA AGCGCAAGGA CGCGCTGGAG CACGTGATCG CAGGCACCGA AGAGACGACC
ACCGGCGTCA TCCGCCTGCG CGCGATGGCG AAGGACGGCA TGTTGAAGTA CCCGATCATC
GCGGTGAATG ATGCCGACAC CAAGCACATG TTCGACAACC GCTACGGCAC CGGTCAGTCC
ACGATTGACG GTATCGTGCG CGCGACGAAC TTCCTGCTCG CAGGCGCGAA GTTCGTGGTC
GCTGGCTACG GCTGGTGCGG ACGCGGTTTG GCTTCGCGTG CGCGCGGCCT TGGAGCCGAG
GTCATCGTGA CCGAAATCGA TCCCACGAAG GCGATCGAAG CCGTGATGGA CGGCTACCGC
GTGATGTCAA TGCACGAAGC GGCACAGCTT GGCGATGTGT TCTGCACCGT GACCGGCAAT
AAGAGCGTTC TGCGCAAGGA ACACTTCGAG TTGATGAAGG ACGGCGCGAT CATTTCGAAC
TCCGGCCACT TCAACGTCGA GATCGACATT CCGGCGCTGG AAAAGCTGTC GTCGTCGAAG
CGCACGACCC GCACGTTTGT GGATGAGTAC TCGCTGAAAG ATGGCCGCAA GATCAACCTG
CTGGGCGAAG GCCGCCTGAT CAACCTGGCC AGCGCGGAAG GCCATCCGCC GTCCGTGATG
GACATGAGCT TCGCCGACCA GGCGCTCTCG CTCGACTACC TGGTGAAACA CCACAAGACG
CTCGAGAAGA GCGTGTTCAA GGTTCCGGAA GAACTCGACA AGCGGGTTGC GAAGCTGAAG
CTGGAGTCGA TGGGCGTGAA GATCGACAAG CTGACGCCGG AGCAAGAAGA GTACCTGGCG
GGCTGGAGCG AAGGAACATA G
 
Protein sequence
MATTSTSNVA CDIANIELAD LGKKRIEWAN QSMKVLQIIR KDFIKNQPLK GFRISACLHV 
TAETANLMIT LRDGGAEAVL CASNPLSTQD DVAASLVRDY GIPVYAIKGE DNDTYYSHIM
AALDHKPHIT MDDGADLVTI ALTKRKDALE HVIAGTEETT TGVIRLRAMA KDGMLKYPII
AVNDADTKHM FDNRYGTGQS TIDGIVRATN FLLAGAKFVV AGYGWCGRGL ASRARGLGAE
VIVTEIDPTK AIEAVMDGYR VMSMHEAAQL GDVFCTVTGN KSVLRKEHFE LMKDGAIISN
SGHFNVEIDI PALEKLSSSK RTTRTFVDEY SLKDGRKINL LGEGRLINLA SAEGHPPSVM
DMSFADQALS LDYLVKHHKT LEKSVFKVPE ELDKRVAKLK LESMGVKIDK LTPEQEEYLA
GWSEGT