Gene Acid345_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1535 
Symbol 
ID4072926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1873660 
End bp1875141 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content58% 
IMG OID637983544 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_590611 
Protein GI94968563 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.31089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.378909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGG CTACGAAAAC CAGCACCAAA ACCTACCAGA TGTACATCGA TGGCAAGTTC 
GTCGAAAGCG CCACGGGAAA GTTGTTCCCC GTTTACGATC CTTCGACGGA AGAAGTGATT
GCTGAGTGTC CGGCAGGAAA CGCTGCCGAT GTTGACCGTG CCGTTGCCGC CGCCCGAACG
GCTTTTGACG ATGGCCCCTG GAACGCAACC ACCGCGCAGG AGCGCGGACG CGTCCTCTTC
CGCATTGCTG AAAAGATCCG CCAGCACACG GCTGAACTCG CCGAGATCGA GTGTCGCAAT
TCCGGCAAGC CCATCGTCGA AGCCGAGTAC GACATTGCCG ATTGCGCTAC CTGTTTCGAG
TACTACGGCG GGCTCGCGAC GAAGATCACC GGCCAGGTGA ATCCGGTTCC TGACAACGCA
GTTAGCCTTT CGCTAAAGGA ACCGATCGGG GTCGCAGGGC AGATCATTCC GTGGAACTAT
CCGCTGGCGA TGGCCGCGTG GAAGCTTGCT CCCGCCATTG CTGCGGGCTG CACCGTTGTC
ATCAAGCCTG CCGAACAGAC GCCCCTCACG TTGCTGGAAC TTGCGAAGCA CTTCGAAGAA
GTGGGGCTGC CGAATGGCGT AGTAAATGTT GTGACCGGAT ACGGTGAAGA AGCGGGTGCG
CCCATCGTCG CGCATAAAGA TGTGGACAAG ATTGCCTTCA CCGGCAGTGG AAGCGTGGGC
AAGCTGATTA TGCGCGAGGC GAGCGCCACA CTGAAGCGCG TCTCGCTCGA ACTCGGCGGT
AAATCGCCGA ATATTTTCTT CGCCGATGCC GACTTTGAAG CTGCGATTGA TGGCGCACTC
TTCGGCGTCT TCATCAACCA GGGCGAAGTC TGCTCCGCGG GCAGCCGAAT TCTTGTCGAG
CGCCCGATCT ATAAGAAGTT CGTGGAGGCG ATGACCGAAA AGGCCAAGAA GATAAAACTC
GGCGCGCCGC TCGATCGCGA TACGAAGATG GGTCCGTTGG TCAGCAAAGA CCAATATGAG
CGCGTTCGCG AGTATCAGGA GATCGGCAAG AAAGAAGCGA AGGTCGCATC CGGCGGCAAT
CGCCCCAGCG GCTTCGCTAA GGGCTATTAC GTCGAACCGA CGATCTTCTA TGACGTGGAC
AATAGCGCTC GCATTGCGCG CGAAGAAATC TTTGGACCGG TGGCGTCCGT CATCCCATTC
GACAACGAAG CTGATGGCCT GAAGATCGCC AACGACACGC CCTTCGGCCT CGCCGCTGCG
GTTTGGACGC GCGACATCTT CAAGGCATTC CGCATGGTGA AGAAGATCCG CGCCGGCATC
GTGTGGGTGA ACCACATGCA ACCCACGTAT TACGAGGCGC CGTGGGGCGG ATACAAGCAA
TCCGGCTTTG GACGCGAGCT CGGCCCGTGG GGAGTAGAGG AATACCTCGA GACCAAGCAG
GTGCACATCA ACCTCAGCGA ACAGCCGATT GGGTGGTACT GA
 
Protein sequence
MDTATKTSTK TYQMYIDGKF VESATGKLFP VYDPSTEEVI AECPAGNAAD VDRAVAAART 
AFDDGPWNAT TAQERGRVLF RIAEKIRQHT AELAEIECRN SGKPIVEAEY DIADCATCFE
YYGGLATKIT GQVNPVPDNA VSLSLKEPIG VAGQIIPWNY PLAMAAWKLA PAIAAGCTVV
IKPAEQTPLT LLELAKHFEE VGLPNGVVNV VTGYGEEAGA PIVAHKDVDK IAFTGSGSVG
KLIMREASAT LKRVSLELGG KSPNIFFADA DFEAAIDGAL FGVFINQGEV CSAGSRILVE
RPIYKKFVEA MTEKAKKIKL GAPLDRDTKM GPLVSKDQYE RVREYQEIGK KEAKVASGGN
RPSGFAKGYY VEPTIFYDVD NSARIAREEI FGPVASVIPF DNEADGLKIA NDTPFGLAAA
VWTRDIFKAF RMVKKIRAGI VWVNHMQPTY YEAPWGGYKQ SGFGRELGPW GVEEYLETKQ
VHINLSEQPI GWY