Gene Acid345_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1459 
Symbol 
ID4069609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1761928 
End bp1763445 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content60% 
IMG OID637983468 
Productaldehyde dehydrogenase 
Protein accessionYP_590535 
Protein GI94968487 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA CTACCGCGGA AACCGCGAAG GCGACCGTTT ACAAGAACCT TATTGATGGC 
GAATGGGTCG AGTCGAAGTC CGGCCAGACC TTCGAGAACC TCAACCCAGC CGATACCCGC
GAAGTGGTTG GCATCTTCCA ACGCAGCGGC AAAGAAGACG TTGAACACGC CATTGACGCC
GCCAGCGAAG CCTATAAGAA GTGGCGTCTC GTGCCGGCAC CACGACGTGC CGAGCTCCTC
TTCAAGGCTG CCGCTATCCT CGAGCAGCGT AAGGAAAAGT ACAGCCAGGA AATGACCCGC
GAGATGGGCA AGGTCATTAA AGAGACCCGC GGCGACGTCC AGGAAGCCAT CGACGCTGGT
TACTACAACG CCGGCGAAGG TCGCCGCATG TTCGGACCGA CCACGCCGTC GGAGCTGCCG
AACAAGTTCG CCATGGCCGT CCGCCAGCCA CTGGGCGTCT GCGCCATGAT CACCCCGTGG
AATTTCCCGA TGGCGATCCC GTCATGGAAG CTCTTCCCCG CGCTCGTCTG CGGCAACACC
GCCGTGATCA AGCCAGCCCA GGACACGCCG CTTTCGACTT TCAACTTCGT CCAGGCGCTT
AACGATGCCG GCGTCCCCAA GGGGGTAATC AACATAGTTA CCGGCTACGG TGGCGAAGTC
GGCACCCCGA TGACCGAGCA CCCCGAAGTG AAGGCCGTAT CGCTAACCGG TTCCACAGCG
GTCGGTCGGA TCGTTGGCCA AGCCGCAGCG AAGAGCTTCA AGCACTGCTC GCTCGAACTC
GGCGGCAAGA ACCCCATGAT CGTCCTCGAC GACGCCAACC TCGATCTGGC TCTCGAAGGC
GCTCTCTGGG GATCGTTTGG CACCACCGGA CAGCGCTGCA CCGCGACCAG CCGCATCATT
CTTCAGAAGG GCATTTACAA GAAATTCGCC GACGAACTCG TCGCCCGTGC CAGGAAATTG
AAAGTTGGCA ATGGTCTCGA TGAGACCGTT GACATGGGAC CCGCAGTGAA CGAGAACCAG
ATGAACACCG ACCTCAAATA CATCGAGATC GGCAAGGGTG AAGGCGCCAA GCTCGCGCAC
GGCGGCCACC GCCTCGACAA GGGCGATTAC GCCCACGGCT GGTTCCTGGA GCCGACGATC
TTTACCGACG TCAACCACAA GATGCGGATT GCGCAGGAAG AGATCTTTGG GCCTGTCGTT
GCGCTGATCC CGTGCGACGA CCTCGATGAA GCCATCGAGA TCGCCAACGG CATCGAGTAC
GGCCTCTCGT CCGCGATCTA CACACGCGAC GTCAACAAGT CGTTCCGCGC CATGCGCGAT
CTCCATGCCG GCATCACCTA CGTCAATGCA CCGACGATCG GCGCGGAAGT TCACATGCCC
TTCGGCGGTG TAAAGGCTAC AGGAAATGGC CATCGCGAAG GCGGCATCGG CGCCCTCGAC
TTTTACACCG AGTGGAAGGC GATCTACGTG GATTACAGCG ACACGCTCCA GCGTGCACAG
ATCGACAACC GGGAGTAG
 
Protein sequence
MATTTAETAK ATVYKNLIDG EWVESKSGQT FENLNPADTR EVVGIFQRSG KEDVEHAIDA 
ASEAYKKWRL VPAPRRAELL FKAAAILEQR KEKYSQEMTR EMGKVIKETR GDVQEAIDAG
YYNAGEGRRM FGPTTPSELP NKFAMAVRQP LGVCAMITPW NFPMAIPSWK LFPALVCGNT
AVIKPAQDTP LSTFNFVQAL NDAGVPKGVI NIVTGYGGEV GTPMTEHPEV KAVSLTGSTA
VGRIVGQAAA KSFKHCSLEL GGKNPMIVLD DANLDLALEG ALWGSFGTTG QRCTATSRII
LQKGIYKKFA DELVARARKL KVGNGLDETV DMGPAVNENQ MNTDLKYIEI GKGEGAKLAH
GGHRLDKGDY AHGWFLEPTI FTDVNHKMRI AQEEIFGPVV ALIPCDDLDE AIEIANGIEY
GLSSAIYTRD VNKSFRAMRD LHAGITYVNA PTIGAEVHMP FGGVKATGNG HREGGIGALD
FYTEWKAIYV DYSDTLQRAQ IDNRE