Gene Acid345_4584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4584 
Symbol 
ID4071529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5431790 
End bp5432791 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content53% 
IMG OID637986624 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_593658 
Protein GI94971610 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.978307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCAT TCGTCGTCGA CCGTTACGGA AAAAAGGAGA GCGGACGAAT TGGCGAAATG 
CCGAATCCGA AGCTGCAGGA CGATGACGTA TTGGTCCAGG TGCATGCCGC TGGCGTAAAC
CTGCTGGATT CGAAAATCAA AAATGGCGAG TTCAAGCTGG TTTTGCCCTA TCGCTTACCG
CTGATTCTTG GCCATGACGT GGCTGGCGTG GTGGTTGAAG TCGGCCCGCG AGTACAACGT
TTCAAGCCTC GAGATGAGGT CTATTCGCGG CCAGATGACT TGCGTATCGG TGCATTCGCG
GAATTGATTG CGATCAAGGA AGATTCGCTG GCAATCAAAC CTAAAGCACT GACCATGGAA
GAAGCTGCTT CCATCCCGCT GGTCGGGTTA ACCGCATGGC AAGCGCTGGT CGACAAAGCG
AATCTGACGA AGGGACAAAA GGTCTTCATC CAGGCGGGCT CCGGGGGCGT GGGAACATTT
GCCATTCAGT TGGCGAAACA TTTGGGCGCC ACGGTGGCGA CAACCACGAG CGCCGCCAAT
TTTGATTTGG TGAAACAACT CGGCGCAGAT ATTGCGATCG ATTACAGGAA AGATGATTTC
GAAAAGATAT TGCGTGACTA TGACGTGGTG TTGAACAGCC AGGACAAGCC GACGCTGGAA
AAATCTCTAC GAGTGCTCAG ACCCGGTGGA AAGCTGATCT CGATCTCAGG TCCCCCCGAT
CCCGAGTTTG CGAAAGAAAG TGGATCGAAT TGGATCGTGA GATCGGTGAT GCGCGTTTTG
AGCTATGGAA TCAGGAAGCA GGCAAAGCGC CAGCGGGTCA GCTATTCATT TCTTTTCATG
AGAGCAAACG GGGACCAGTT GCGCGAGATT TCCGCCCTCG TGGATTCCGG GATCATACGC
CCGGTTGTGG ACCGCGTCTT TCCATTTGAA TCGACTGGGG AGGCTTTGGC TTACGTCGAA
AAAGGACGCG TAAAAGGCAA AGTCGTCGTT TCGTTGAGGT GA
 
Protein sequence
MKAFVVDRYG KKESGRIGEM PNPKLQDDDV LVQVHAAGVN LLDSKIKNGE FKLVLPYRLP 
LILGHDVAGV VVEVGPRVQR FKPRDEVYSR PDDLRIGAFA ELIAIKEDSL AIKPKALTME
EAASIPLVGL TAWQALVDKA NLTKGQKVFI QAGSGGVGTF AIQLAKHLGA TVATTTSAAN
FDLVKQLGAD IAIDYRKDDF EKILRDYDVV LNSQDKPTLE KSLRVLRPGG KLISISGPPD
PEFAKESGSN WIVRSVMRVL SYGIRKQAKR QRVSYSFLFM RANGDQLREI SALVDSGIIR
PVVDRVFPFE STGEALAYVE KGRVKGKVVV SLR