Gene Acid345_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0334 
Symbol 
ID4070096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp364377 
End bp365234 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content61% 
IMG OID637982337 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_589413 
Protein GI94967365 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.841681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.545556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCGA AAGACTTGCT TCAGGGAAAA CGCGCACTCA TCACGGGCGG CGGCACGGGC 
CTCGGCAAAG CGATGGCACG GCGGTTTCTC GAACTCGGCG CGACGGTGTA CATCTGCGGG
CGTCGCGAAG AAGTGCTGCG CGGAACCTGC GAAGAGCTGA CGTCAGCGAC TGGGGGCGAG
ATTCACGGCA TTCCCTGCGA TGTACGAGAT CTCGCGGCGG TAGACACAAT GATCACGCAG
ATCTGGAACG ATGGCCCGCT CGATATTCTT GTGAACAACG CCGCCGGGAA TTTTCTGGCG
AAAACCGAGG AACTCTCTCC GCGCGCGTGG GAGGCCGTGA TTGGAATCGT GCTCAACGGA
ACGATCAATC TGACGATGGC ATGCGGACGT CGCTGGCTGG CGGAGAAGAA ACCCGCGAAC
GTGCTGAGCA TCGTCGCCAC CTATGCTTCT ACCGGCTCAG GCTCGGGCTA TGTGGTTCCG
TCGGCGGTCG CGAAAGCCGG AGTGTTGGCG CTTATGCGCA GCCTCGCTGT CGAGTGGGGA
CCTCGGGGTA TTCGTCTCAA CGCGATCGCG CCGGGGCCGG TGCCGACGGA AGGCGCCTTC
TCGCGATTGA TTCCAAGCGA TCAACTAGAA GAAATCGCAA AGCAACGCGT GCCAATGCGG
CGCTTCGGAC GACCGGAAGA GATTGCAGAT CTCGCAGCGT TTCTCGTGAG CGACGGTGCC
GGCTACATCA ACGGCGAGGT TGTCACCATC GACGGTGGCG AGTGGCTGCA AGGCGCGGGC
GAGTTCAACT ACGTCGGACA GATGATGACC GACGAAATGT GGGCGATGTT TAAGCCCGGC
AAAAAGCGCC GCGAATAA
 
Protein sequence
MFAKDLLQGK RALITGGGTG LGKAMARRFL ELGATVYICG RREEVLRGTC EELTSATGGE 
IHGIPCDVRD LAAVDTMITQ IWNDGPLDIL VNNAAGNFLA KTEELSPRAW EAVIGIVLNG
TINLTMACGR RWLAEKKPAN VLSIVATYAS TGSGSGYVVP SAVAKAGVLA LMRSLAVEWG
PRGIRLNAIA PGPVPTEGAF SRLIPSDQLE EIAKQRVPMR RFGRPEEIAD LAAFLVSDGA
GYINGEVVTI DGGEWLQGAG EFNYVGQMMT DEMWAMFKPG KKRRE