Gene Acid345_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1531 
Symbol 
ID4073019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1868021 
End bp1869466 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content59% 
IMG OID637983540 
Productaldehyde dehydrogenase 
Protein accessionYP_590607 
Protein GI94968559 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA TCGTCGCGGC CCCGGCGGCA TCGTATCCGT TCCTGCTCAA CGGCGAGTGG 
ATCTCCGACG GTAGTCCCGT CGAGATCCAC TCGCCTTTTG ACCACAAAGT TATCGGCCAG
GTCTTCTACG GTTCTTCCGC TCACGTGGAA GCGGCCATCC GCGCAAGTGT TGAGGCGTTC
CAGATCACCC GAAAGCTGGG CAGCTACGAG CGCGAACGAA TCCTGAGTGC GATCTCCCAA
AAGCTTTCCG AACAGCGCGA AGACTTTGCG CACACGATTG CACTCGAAGC CGGCAAGCCA
ATCAAGACGG CTCGGCAGGA AGTCGAACGT GCGATTTACA CATTCAAGGT TGCGGCAGAA
GAAAGCACGC GCATCGAGGG AGAGTACCTT CACCTCGATA CGATTGAGGC GACAAAAGGG
CGATGGGGAA TTGTGCGGCG CTTCCCTATC GGCCCGATCT TTGCGATCAC GCCGTTCAAT
TTCCCGCTGA ACCTCGTGGC GCACAAGCTC GCGCCTGCAA TTGCCGCAGG GTGCCCAGTC
ATCCTAAAGC CCGCACCGCA GACGCCGATC ACCGCTCTGA AGCTCGCGCG CGTGATTCAC
GAATCAGGCT GGCCGGCAGG CGCACTCACC GTCATGCCGC TGTCGAACGA AGACGCAAGC
CTGCTCGTCA CCGACGAACG CATCAAGCTA CTCACGTTCA CTGGCAGCTC CATTGGTTGG
GACCTGAAGA GCAAAGCCGG CAAAAAGCGG GTGCTCCTCG AACTCGGCGG AAATGCCGCC
ATCATTATCC ATTCCGACGC TGATCTCAGG TTCGCTGCCG AGCGATGCGC GCACGGCGCA
TTCGGTTACG CCGGTCAAAG CTGCATTTCC GTTCAGCGGA TCCTGGTGGA AAAGAGCGTC
TATAGCGAGT TCCGCCAAAT GCTCGTAAAC GCAGCCGGAA AACTAAAGAC CGGAGACCCT
CTCGATGAGG CGACCGACGT CGGGCCACTC ATTCGCGAAT CGGACGCCTT GCGTGCGGAA
TCATGGGTGA AGGAAGCGGT GGCTCAGGGC GCGACCTTGC TCTGCGGTGG GACCCGCAAA
GGCAGCTTGC TCGAGCCCAC CGTGCTGACG AATACGCGCC CGGAGATGCT GGTCAATTGC
CGCGAAATCT TCGCCCCCGT GGTCACCGTG GAAGCATACG ACGACTTCAA CGAAGCCCTG
AGGCAAGTCA ACAATTCGCC ATTCGGTCTG CAAGCAGGCA TTTTGACTCG CGATGCGCAG
CGCATCTTCA CGGCCTTTAA CGGGCTCGAT GTTGGCGGGG TTGTGGCAGG CGACGTACCG
ACCTTCCGCA TTGACCACAT GCCCTACGGC GGGATCAAAG ATTCAGGTCT CGGACGCGAA
GGCGTGCGCT ATACGATTGA GGAAATGACC GAGCCGAAGT TACTCGTGAT GAACCTCGGC
GCGTAG
 
Protein sequence
MSEIVAAPAA SYPFLLNGEW ISDGSPVEIH SPFDHKVIGQ VFYGSSAHVE AAIRASVEAF 
QITRKLGSYE RERILSAISQ KLSEQREDFA HTIALEAGKP IKTARQEVER AIYTFKVAAE
ESTRIEGEYL HLDTIEATKG RWGIVRRFPI GPIFAITPFN FPLNLVAHKL APAIAAGCPV
ILKPAPQTPI TALKLARVIH ESGWPAGALT VMPLSNEDAS LLVTDERIKL LTFTGSSIGW
DLKSKAGKKR VLLELGGNAA IIIHSDADLR FAAERCAHGA FGYAGQSCIS VQRILVEKSV
YSEFRQMLVN AAGKLKTGDP LDEATDVGPL IRESDALRAE SWVKEAVAQG ATLLCGGTRK
GSLLEPTVLT NTRPEMLVNC REIFAPVVTV EAYDDFNEAL RQVNNSPFGL QAGILTRDAQ
RIFTAFNGLD VGGVVAGDVP TFRIDHMPYG GIKDSGLGRE GVRYTIEEMT EPKLLVMNLG
A