Gene Acid345_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0685 
Symbol 
ID4068775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp842617 
End bp844074 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content60% 
IMG OID637982691 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_589764 
Protein GI94967716 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTCGC TGCACAGCTA CATTGGCGGA CGCTTCGTGC CCGGCAAACG GGGGTTTGCG 
GACATTAATC CCGCAGATGG AAGTGTTGCT GCCACCGTGA CGGAAGCGTC AGCCGAGATG
GTGGACGAAG CTGTTGCTGC GGCGCGCAAG GCGCTGCATG GGGAGTGGGG CAAACTCGGC
GCGCGTGGTC GAGCGACTGT TCTCTACAAG ATTGCGGAAG GAATTGAGAA GCGCTTTGAC
TGCTTTGTAC AGGCGGAGGT CGCAGACACG GGCAAGCCCT TATCGCTGGC CTCCCGACTG
GATGTGCCAC GCGCCGCCGC GAACTTTCGA GTGTTTGCTG ATGTGATCAA AATCGCGGGG
CTGGAAGCGT TCGAGACCGA ATTGCCGGAC GGTCGCCGCG CGCTGAATTA CACCGTGCGC
AAGCCGATTG GTGTAGTTGG GATCGTGACT CCGTGGAACT TGCCGCTGCT GCTACTGACG
TGGAAGGTGG CTCCGGCACT GGCGTGCGGA AACGCGGTGG TGGTGAAGCC GTCGGAAGAG
ACGCCAGCGA CCGCGACACT GCTAGCGGAA GTGATGCAGG AAGCGGGCGT ACCCGATGGC
GTCTATAACG TGGTGCACGG GTTCGGGCCA AACTCGGCGG GTGAGTTTCT GGTCAGCCAT
CCGGGCGTGA ACGCGGTGAC TTTTACGGGG GAGTCGCAGA CGGGCGCCTC TATTATGCGG
GCTTGCGCGC CGACCGTGAA GCCGGTTTCG TTTGAGTTGG GCGGCAAGAA TGCCGCCGTC
ATTTTTGCTG ACTGTGATTT CGACGCGACC ATCGCGGGCA TGAGTGATGC GGTGTTCCTC
AACACCGGCC AGGTTTGTCT GTGTGCAGAG CGCGTGTACG TGGAAAGACG AATCTTCGAT
AGGTTTGTCG CGGCTCTGAC GGAGCGCGCG AAGAGTTATG ACCTGGGATG GCCGATGGAA
CCGGCTACAT CGATGGGGCC TCTGATTTCG AAGGTGCATC GCGAGAAAGT TCTGTCTTAT
TTCGACCTGG CGCGCGAAGA GGGCGCAACT GTCGTAATCG GCGGTGGCGT GCCGACGTTT
GGTGATGGCC GCGATAGCGG CTTCTATGTA CAGCCAACGA TCTTCACGGG ACTGAAGGAA
TCGGCGCGCT GCGTGAAAGA AGAAATCTTC GGACCGGTGT GCCACGTTGC ACCGTTCGAT
TCCGAAGAAG AAGCAGTGGC GCTGGCGAAC GATACGCGGT ATGGTTTGGC GGCTTCAATT
TGGACGAGTG ATTTGCAGAG GGCGCACCGC GTGGCGCCGC AGATGAATGC AGGCATCACG
TGGGTGAATT GCTGGTTCCT GCGCGATTTG CGCACGCCAT TCGGCGGAGT TGGGCTATCG
GGAATTGGGC GCGAGGGCGG GATGCACTCG CTGAATTTCT ATTCCGAGTT GAACAACATC
TGCATTCGGA CCGAGTAA
 
Protein sequence
MKSLHSYIGG RFVPGKRGFA DINPADGSVA ATVTEASAEM VDEAVAAARK ALHGEWGKLG 
ARGRATVLYK IAEGIEKRFD CFVQAEVADT GKPLSLASRL DVPRAAANFR VFADVIKIAG
LEAFETELPD GRRALNYTVR KPIGVVGIVT PWNLPLLLLT WKVAPALACG NAVVVKPSEE
TPATATLLAE VMQEAGVPDG VYNVVHGFGP NSAGEFLVSH PGVNAVTFTG ESQTGASIMR
ACAPTVKPVS FELGGKNAAV IFADCDFDAT IAGMSDAVFL NTGQVCLCAE RVYVERRIFD
RFVAALTERA KSYDLGWPME PATSMGPLIS KVHREKVLSY FDLAREEGAT VVIGGGVPTF
GDGRDSGFYV QPTIFTGLKE SARCVKEEIF GPVCHVAPFD SEEEAVALAN DTRYGLAASI
WTSDLQRAHR VAPQMNAGIT WVNCWFLRDL RTPFGGVGLS GIGREGGMHS LNFYSELNNI
CIRTE