Gene Acid345_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1381 
Symbol 
ID4068916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1672429 
End bp1673919 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID637983390 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_590457 
Protein GI94968409 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.514457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCG TGTCGGCCGT AGAACTGAAC AGCAACGTCA GCCAATTCAT CACAAAGCCG 
CGAAAGATGC TGATCGGCGG GAACTGGATC GATAGTGCGT CGGGTAAATT CTTTGAGACC
CTGAATCCGG CGACTGGCGA AGTACTGGCG CGAGTTGCCG AAGGCGATCG TGCCGACATT
GATCTCGCCG TCGCCGCGGC GCGGAAGGCA TTTGAGAGCG GACCGTGGTC GAAAATGTCG
CCGTCGCAAC GGGGACGTCT TTTGTGGAAA CTCGCGGACC TGCTCGAGCA GCACCTGGAA
GAGTTCGCCG AGTTGGAATC GCTCGACAAT GGGAAGCCGC TGTCGGTGGC GCGGGTGGCC
GACGTTCCGC TCGCGGTAGA CCTTTTCCGT TACATGGCTG GCTGGGCGAC GAAGGTGGAA
GGCAACACGA TTCCGCTGGG CCCGCAGTTC CATGCTTATA CCTATCGCGA ACCGGTGGGC
GTAATCGGCC AGATCATTCC CTGGAACTTT CCGCTGCTGA TGGCCGCGTG GAAACTGGGT
CCGGCGCTGG CGGTTGGGTG CACGGTGGTG TTGAAGCCGG CGGAACAGAC ACCTCTCTCC
GCGCTGCGCC TGGGCGAACT GATCATGGAA GCAGGCTTCC CCGATGGCGT GGTGAACGTT
GTGCCGGGCT TCGGCGAAAC TGCCGGCGCT GCGCTGGCCG CCCATCCGGA CGTCGACAAG
ATTGCGTTCA CCGGATCGAC AGAAGTCGGC AAATTGATTG TGCAGGCTGC CGCAGGCAAC
CTGAAAAAAG TCTCTCTCGA ACTGGGCGGC AAGTCGCCGA ATATCGTGCT CGCTGATGCG
GACCTGGACA TTGCGATATC AGGCAGCGCG AACGCGATCT TTTTCAATCA CGGCCAGTGC
TGCTGCGCGG GCTCACGGCT GTTCGTACAC AAGAGCCAGT TCGACAAAGT GGTGGAGGGT
GTGGCCGAAG CCGCAAAGAA CATTCGCTTG GGATCTGGGC TTGATCCGGC AACCAACATG
GGTCCGCTGG TTTCGCAGGA GCAACTCGAT CGCGTGTGCG GGTATCTCGA ATCTGGGGTG
CAACAAGGAG CAAAACCCCT GGTTGGCGGG AAGAAACAGA CGGGGCCGGG CTACTTCGTG
GAGCCAACGG TGCTGGTGGA TGTGAAGCCG ACGATGAAAG TCGTTTGCGA AGAGATCTTC
GGACCCGTGG TCACGGCGAT CCCGTTCAAC AGCGTGGACG AGGTGTTGAA CTCAGCCAAT
GCGTCGAGCT ACGGTCTCGC GGCAGCGGTG TGGACGCGCG ACATTAACAA GGCGCATTCA
CTGGCGGCAA AGCTGCGCGC CGGCACAGTG TGGGTGAATT GTTACAACGT GTTCGACGCC
GCGCTGCCGT TTGGGGGTTA TAAGCAATCG GGCTGGGGAC GCGAGATGGG GCACGACGCA
CTCGAGCTCT ACACCGAGAC CAAAGCGGTC TGTGTGCGCC TGGAAAACTA A
 
Protein sequence
MSVVSAVELN SNVSQFITKP RKMLIGGNWI DSASGKFFET LNPATGEVLA RVAEGDRADI 
DLAVAAARKA FESGPWSKMS PSQRGRLLWK LADLLEQHLE EFAELESLDN GKPLSVARVA
DVPLAVDLFR YMAGWATKVE GNTIPLGPQF HAYTYREPVG VIGQIIPWNF PLLMAAWKLG
PALAVGCTVV LKPAEQTPLS ALRLGELIME AGFPDGVVNV VPGFGETAGA ALAAHPDVDK
IAFTGSTEVG KLIVQAAAGN LKKVSLELGG KSPNIVLADA DLDIAISGSA NAIFFNHGQC
CCAGSRLFVH KSQFDKVVEG VAEAAKNIRL GSGLDPATNM GPLVSQEQLD RVCGYLESGV
QQGAKPLVGG KKQTGPGYFV EPTVLVDVKP TMKVVCEEIF GPVVTAIPFN SVDEVLNSAN
ASSYGLAAAV WTRDINKAHS LAAKLRAGTV WVNCYNVFDA ALPFGGYKQS GWGREMGHDA
LELYTETKAV CVRLEN