Gene Acid345_0726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0726 
Symbol 
ID4069798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp890319 
End bp891419 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID637982732 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_589805 
Protein GI94967757 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.642345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTCC GAAATGTTTT GATTGCTGCC GCTGCAGGAT CGGTGGCGGC CGGCGCCGTC 
GCGGGTGTAG GCGCGGTGCT GCTCGGACGC GCGATCTGGA AGGGCTTCCG CGAAATCAGC
CTGCGCGGCA AAATCGCACT GGTCACCGGC TCTTCGCGCG GACTCGGCTT GGCAATGGCG
CAGGAACTCG CGCGCGAAGG TGCGCGCCTC GTGATCTGCG CGCGCGATGA AGCAGAACTG
CGCTGGGCAC AAGAGGAACT GCAGGCCATC GGCGCCGAGG TTCTTGCAGT CCCCTGCGAC
GTTGGCGATC GCGAGCAAGT TCGCAAGATG TTCGAGCAAA TCCGTGCGCG CTATGGCGCA
CTCGACGTGC TCATTAACAA CGCGGGCGTG ATCCAGGTCG GCCCACTGCA CTCGCAGACG
ATCGAAGACT TCGAAGAAGC CATGCGCGTG ATGTTCTGGG GCTTGGTCTA TCCGACACTC
GAAGCGATTC GTGACATGCG TGCCATGGGC TCTGGGCACA TTGCAAACAT TACTTCAATT
GGCGGTCGGA TCGCGGTGCC GCATCTCGTT CCTTACTCGG CTGCGAAGTT TGCCGCTATC
GGTTTCAGTG AAGGTATCCA CGCGGAGCTT GCGAAAGACG GCATTACGGT CACGACCGTT
GTTCCCGGCC TGATGCGCAC GGGCTCTCAC GTGAATGCAC GCTACAAGGG CGACCACCGC
AAGGAATATG CGTGGTTCAG CCTGGGCGCG AGTACACCAG TCACGGCGAT GAGCGCTCAC
CGCGCCGCGC GCTGCGTGAT CACCGCGGTG AAACGCGGAC GTTCGAGCGT TACTCTGTCG
CCGCAGGCGA AGCTCGCGGC GTTAGCGCAC GGCATCGCTC CCGGACTCGT AAGTGATGCG
TTGGGCATGG TGAACCGTAT TATGCCGGGC AGTCGTTCCA CATCCACGGA GGGTTTCACC
GGAAAAGAAA GCCGCAGCGT GGTGTCGGAA TCCTTCGCGA CCAAACTCGG CCGTGATGCC
GCGGAGGAAC TGCATCAGAA TCCTGAGCAG CGGCGCACAG ACAACGCAAC CAGCGGCCCA
GAAGCGTTGC TCGCGGATTA A
 
Protein sequence
MRVRNVLIAA AAGSVAAGAV AGVGAVLLGR AIWKGFREIS LRGKIALVTG SSRGLGLAMA 
QELAREGARL VICARDEAEL RWAQEELQAI GAEVLAVPCD VGDREQVRKM FEQIRARYGA
LDVLINNAGV IQVGPLHSQT IEDFEEAMRV MFWGLVYPTL EAIRDMRAMG SGHIANITSI
GGRIAVPHLV PYSAAKFAAI GFSEGIHAEL AKDGITVTTV VPGLMRTGSH VNARYKGDHR
KEYAWFSLGA STPVTAMSAH RAARCVITAV KRGRSSVTLS PQAKLAALAH GIAPGLVSDA
LGMVNRIMPG SRSTSTEGFT GKESRSVVSE SFATKLGRDA AEELHQNPEQ RRTDNATSGP
EALLAD