Gene Acid345_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1743 
Symbol 
ID4072010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2115301 
End bp2116425 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content57% 
IMG OID637983751 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_590818 
Protein GI94968770 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.789152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACTG CGACACAAAT TGGCAATGAG AAGGATGAGG CGATTCCCGC TTCGATGAAA 
GCTGCGGTGT ACCACGGCCT CAATGATGTC CGCCTCGAGA CTGTACCGGT GCCCGAGATC
GGGCGTGGGG AAGTGCTCAT CCGGGTTGCG TCTTGCGGTA TTTGCGGCAC CGACCTAAAG
AAGATCTCGA CGGGTTCCCA TTCCGCGCCC CGCATTTTTG GCCACGAGAC TGCCGGAGTC
ATTGCTGCAG CTGGCGACGG CGTGACAAAG TTTCAAGTCG GCGATCGCGT TCTCGCCTTC
CACCACATCC CATGCCAGGA GTGCTACTAC TGTCGACACA AAGTGTTTGC GCAGTGTCCG
ACATACAAAA AGGTCGGGGT CACGGCAGGC TACGAACCAA GCGGTGGGGG ATTCTCCGAG
TATGTGCGAG TAATGGACTG GATCGTGGAT CGGGGCGGCG TAGTGAAACT CCCTGATCAT
GTTTCGTATG ACCTCGCAAC CTTCGTGGAG CCGGTGAACA CCTGCCAGAA GGCTATTGAA
ACGATGGCGC TTAAGTCTGG GGAAACGGTG TTGGTGATCG GCCAAGGTGC AATTGGCATG
ATCTTGGCGT TCCTGGCACA AAGGGCCGGA GCAACGGTGA TTACCTCCGA TTTGTTCCCA
CAGAGGCTTA CAATAGGAGA GTCTCTGGGG CTGAAAAACG GGATAGATGC CAATCAGACA
GACGTTGTGG CCCATATGCG CGACCTTACG GAAGGACGCG GTGCCGATGC GTCGATTTTG
GCGGTTCCGG TGAACGGCTT GATCCGCACG GCTATGGACG CGGTCCGGCC CGGCGGTAGA
GTGATGTTAT TTGCGCACAC TCAGCGCACG GAGGCGAAGT TCGACCCTTC CGCGGTGTGT
ATGGATGAGA AAACGCTGCT GGGTTCGTAT AGCGCTTCAG TAGATTTACA GAAAGATTCG
GTAGATTTTG TATTCAGCCG CGAGATGGAC CTGGAAAAGT TGATCTCGCA TCGCTTCCCG
CTGGAGCAGG CGGTGGAAGC TTTGCAGTTG GCGGCGCGCC CGCAACCGGA TTCGCTCAAG
ATCATGATCG AACCTCGTAT GGCGTGGGAA GGACAGGCGA AGTGA
 
Protein sequence
MSTATQIGNE KDEAIPASMK AAVYHGLNDV RLETVPVPEI GRGEVLIRVA SCGICGTDLK 
KISTGSHSAP RIFGHETAGV IAAAGDGVTK FQVGDRVLAF HHIPCQECYY CRHKVFAQCP
TYKKVGVTAG YEPSGGGFSE YVRVMDWIVD RGGVVKLPDH VSYDLATFVE PVNTCQKAIE
TMALKSGETV LVIGQGAIGM ILAFLAQRAG ATVITSDLFP QRLTIGESLG LKNGIDANQT
DVVAHMRDLT EGRGADASIL AVPVNGLIRT AMDAVRPGGR VMLFAHTQRT EAKFDPSAVC
MDEKTLLGSY SASVDLQKDS VDFVFSREMD LEKLISHRFP LEQAVEALQL AARPQPDSLK
IMIEPRMAWE GQAK