Gene Paes_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1918 
Symbol 
ID6460049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2099629 
End bp2101056 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content54% 
IMG OID642725903 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002016577 
Protein GI194334717 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAACC ACACAGTATG CAATAGCGGT GCCGCAGTTC AAACAGAGGC TGAAATCGCT 
GAACTCTGCG TGCATCTTCG CTCCGTTTTT CAAGAGCGAC GTACGTCCGG CTATGCCTGG
CGAAAAGAGC AGCTTATGCA GCTGCAGCGG TTTCTTGAGG AGCGTGAAGA GGATATTCTT
CAAGCGCTGC ATGAGGATTT CCGCAAGCCG CAGACTGAAA CCTGGTTTAC CGAAATTCAC
TATCTCTTAA CAGAGATTAC CGTTGCACTC AGGCATCTGC GACGCTGGAT GAAGCCCGTA
ACGGTTCACA CCCCTCTTCG CTACCAGCCA GGGCGGAGCT ATTACATTTG CGAGCCCTGC
GGTGTCGTGC TCAATATCGC AGCCTGGAAC TATCCGCTGC AGCTCAGCCT TGCACCTGCG
GTTGCCGCGA TTGCCGCGGG GAACTGCCTG GTGATCAAGC CGTCGGAAAT GGCCCCGGCA
ACAGCAGCCC TGCTTTCCGA TGGGCTCAAG GATTATCTCG ACAGCGACGC CATCAGGGTC
GTTCAGGGCG GGGCAGAGGT AACGGCATCC CTTCTGACAC ATCGTTTTGA TCATGTCTTC
TTCACCGGCA GCCAACAGGT TGGTCGGCTT GTTCTGTCGG CAGCGTCAAG GCACTTGACA
CCGGTGACGC TCGAACTCGG AGGGAAAAGT CCCTGTATTG TTGACAAGGG AACCAATATC
GATGTTGCGG CACGCAGGAT TGTCTGGGCG AAATATATCA ATGCCGGTCA GACATGCATT
TCTCCTGATT ATGTCCTTGT CCAGAGCGCT ATCCGCCAGG ATCTGCTCGA TGCGCTGCAG
CGGGCTATAG ATGCCATGTA TGGACCGGGA TCGAGAGAAC GCGGAGCGTA TGCCGGCATT
ATCAGCGAAG GTCACGTCAG GCGGTTGCAG GAGCTCATGA AGGGAGGAAG CATTGTTTGC
GGGGGAGGGA GTGATGCGCA AAGCCGCTAT GTGGAGCCGA CTATTCTCAC CGACGTTTCT
CTCTCATCAC CCCTGATGCA GGAGGAGATT TTCGGCCCGC TGCTTCCCGT GATTGCCTAC
GATACACCGG AAGAAGCCGT TGCGGTCGTC CGTTCAGCAG GAGATCCTCT TGCGCTCTAT
ATTTTTTCGC CTCAACGTCA TGTTTACGAA TATTTTATGG GGCATATCCA TTCGGGCGGG
GTCTGTATCA ACGATCTGCT TTTTCAAGCT GCTATCCCGG CGCTTCCGTT CGGTGGAAGA
GGTACAAGCG GCATTGGGCG CTATCACGGT CGTTCAGGTT TTGAAACCTT TTCACGGCTT
CGCAGTGTTC ATCGTAAAGG GACTTTTCCT GAAAACGCTC TTCGCTATCC TCCGTTCGGG
TCACTGAAGT TCAAACTGTT ACAGCAACTT TTCAAACGTT TTCACTGA
 
Protein sequence
MKNHTVCNSG AAVQTEAEIA ELCVHLRSVF QERRTSGYAW RKEQLMQLQR FLEEREEDIL 
QALHEDFRKP QTETWFTEIH YLLTEITVAL RHLRRWMKPV TVHTPLRYQP GRSYYICEPC
GVVLNIAAWN YPLQLSLAPA VAAIAAGNCL VIKPSEMAPA TAALLSDGLK DYLDSDAIRV
VQGGAEVTAS LLTHRFDHVF FTGSQQVGRL VLSAASRHLT PVTLELGGKS PCIVDKGTNI
DVAARRIVWA KYINAGQTCI SPDYVLVQSA IRQDLLDALQ RAIDAMYGPG SRERGAYAGI
ISEGHVRRLQ ELMKGGSIVC GGGSDAQSRY VEPTILTDVS LSSPLMQEEI FGPLLPVIAY
DTPEEAVAVV RSAGDPLALY IFSPQRHVYE YFMGHIHSGG VCINDLLFQA AIPALPFGGR
GTSGIGRYHG RSGFETFSRL RSVHRKGTFP ENALRYPPFG SLKFKLLQQL FKRFH