Gene Acid345_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3044 
Symbol 
ID4071951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3615673 
End bp3616689 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content59% 
IMG OID637985063 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_592119 
Protein GI94970071 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.165067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TGCGTGCTGT TCAAATTGCT GCGGCCCATG GCGCATTCGA ACTTGTCGAA 
CGTGATATCC CTCAACCCGG TGCCCGCGAA GTGCGAATCA AAGTGCAGGC TTGCGGCGTC
TGCCATAGCG ATTCAGTCGT AAAAGAAGGC ATCATGCCGA CCAGCTATCC GCGCGTTCCC
GGCCACGAAG TTGTGGGCGT GATTGATGCC CTCGGCAAAG ACGTGCCGCG CTGGAAGGTT
GGCGACCGAG TTGGCGTTGG CTGGAATGGC GGCTACTGCG GCTATTGCGA CAACTGTCGT
CGTGGCGATT TCTTCGCCTG CACTTCGGGC CCGTTCATCA CCGGACTGAC CTCAGATGGC
GGTTACGCGG ACTACATGAT CGCCCGCCCC GAAGCGCTTG CCCTCGTGCC CACAGATCTC
TCACCGGAAG ACGCTGCCCC TCTGATGTGT GCTGGCGTCA CCACCTACAA CTGCTTGCGC
AACAGCGGCG CCATACCGGG TGATCTCGTC GCTGTCCTCG GCATCGGCGG GCTCGGCCAT
CTCGCGGTGC AGTACGCAGC GAAATCTGGC TATCGCACCG CGGCCATCGC GCGCGGTGCC
GACAAAGCTG CGCTCGCGAA ACAACTCGGT GCGCATCATT ACATCGACAC CGAGAAAGAA
GATCCGTCTA AGGCTTTGCA AACGCTCGGC GGCGCGAAGG TGATCCTCTC CACTGTTACC
GCAGCCGATG CTATGGAAGC GACTCTCGGC GGACTCGCCA TTCGCGGCAA GTTCTTCCTG
ATTGGCGCAG TGCCCTCGAT GAAGATCAAT CCACTCCAGA TGCTCACGTT CCGCCAGGGC
GTGGAAGGTT GGTATTCGGG AACGTCGATT GATTCGCAGG ACACGCTGAA CTTCAGCGTG
CTCGAGAACG TCCGGTCAAT GAATGAGGTC TATCCGCTGG AGAAAGCCGC CGAAGGCTAT
GAGCGAATGC TGAGCGGCAA AGCGCGTTTC CGCGTCGTGC TAAAAACAGG AAATTAA
 
Protein sequence
MAKMRAVQIA AAHGAFELVE RDIPQPGARE VRIKVQACGV CHSDSVVKEG IMPTSYPRVP 
GHEVVGVIDA LGKDVPRWKV GDRVGVGWNG GYCGYCDNCR RGDFFACTSG PFITGLTSDG
GYADYMIARP EALALVPTDL SPEDAAPLMC AGVTTYNCLR NSGAIPGDLV AVLGIGGLGH
LAVQYAAKSG YRTAAIARGA DKAALAKQLG AHHYIDTEKE DPSKALQTLG GAKVILSTVT
AADAMEATLG GLAIRGKFFL IGAVPSMKIN PLQMLTFRQG VEGWYSGTSI DSQDTLNFSV
LENVRSMNEV YPLEKAAEGY ERMLSGKARF RVVLKTGN