Gene Rpal_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1399 
Symbol 
ID6409056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1472331 
End bp1473851 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID642711298 
ProductAldehyde Dehydrogenase 
Protein accessionYP_001990414 
Protein GI192289809 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.398004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTCA TCACAACGAT GCATCCTGAC GCCAGCACAG TCTTCAAGGC CCGCTATGGC 
AACTTCATCG GCGGCCGCTG GGTTGCGCCG GTCGATGGCA GATACTTCGA CAACACCACG
CCGATCACCG GCGCAAAACT GACGGAAATT CCGCGTTCGC AGAAGGAAGA CGTCGAACTC
GCGCTTGATG CAGCGCACAC CGCGAGCGTC ACCTGGAGCA AGACCACGAC GACAGAGCGG
TCACTCATTC TCAATCGCAT CGCCGATCGG ATGGAAGCCA ATCTCGATCT GCTGGCGATC
GCCGAGACGC TCGACAACGG CAAGCCGATC CGCGAGACGC GGGCCGCCGA CCTGCCGCTG
GCGATCGATC ACTTCCGCTA TTTCGCAGGT GCACTGCGCG CGCAGGAAGG TTCGATCTCG
GAGATCGATC ACGACACCAT CGCGTATCAC TTCCACGAGC CGCTCGGCGT CGTCGGCCAG
ATCATTCCGT GGAACTTCCC GTTGCTGATG GCCGCCTGGA AGCTGGCGCC GGCACTCGCC
GCCGGCAACT GCGTGGTGCT GAAACCGGCC GAGCAGACCC CGGCCTCGAT CCTGGTATGG
GCCGAACTGG TCGGCGATCT GTTGCCTCCC GGCGTCGTCA ACGTCGTCAA CGGCTTCGGC
GTCGAAGCCG GCAAGCCGCT GGCGTCCAGC CCGCGCATCG CCAAGATCGC CTTCACCGGC
GAGACCTCGA CCGGGCGGCT GATCATGCAA TATGCAAGCG AGAATCTGGT GCCGGTGTCA
CTGGAACTCG GCGGCAAATC GCCGAACATT TTCTTCGGCG ACGTCACTGC CGAAGACGAC
CCATTCTTCG ACAAGGCGAT CGAAGGCTTC GTGATGTTCG CGCTCAATCA AGGCGAAGTC
TGCACCTGCC CGAGCCGCGC GCTGGTGCAG GAATCGATCT ACGATCGCTT CATGGAGCGG
GCGCTCGCCC GCGTCGCCGC GATCCGGCAG GGCGATCCGC GCGATCCGGC AACGATGATC
GGTGCGCAGG CGTCTCAAGA GCAGCTCGAC AAGATCCTGT CCTACATCGA CATCGGTCGG
CACGAAGGTG CCGAACTGCT GGCCGGCGGC GGGCGGGCGC AGCTGCCCGG CGATCTCGCC
GGTGGCTACT ACGTGCATCC GACCGTGTTC CGCGGCCACA ACCAGATGCG GATCTTCCAG
GAGGAGATCT TCGGCCCCGT GGTGTCGGTC ACCACCTTCA AGGACGAAGC CGAAGCGATC
GCCATCGCGA ACGACACCCA GTACGGACTC GGCGCAGGCG TCTGGACGCG GGACGGCACT
CGCGCCTATC GGTTCGGACG CGCCATCGCT GCGGGCCGGG TGTGGACCAA TTGCTACCAC
GCCTATCCGG CGCATGCCGC CTTCGGCGGC TACAAGCAGT CCGGCATCGG GCGTGAGACC
CACAAGATGA TGCTCGATCA CTATCAGCAC ACCAAGAACC TGCTGGTGAG CTACGGCACC
GGCCCACTCG GCTTCTTCTA G
 
Protein sequence
MNLITTMHPD ASTVFKARYG NFIGGRWVAP VDGRYFDNTT PITGAKLTEI PRSQKEDVEL 
ALDAAHTASV TWSKTTTTER SLILNRIADR MEANLDLLAI AETLDNGKPI RETRAADLPL
AIDHFRYFAG ALRAQEGSIS EIDHDTIAYH FHEPLGVVGQ IIPWNFPLLM AAWKLAPALA
AGNCVVLKPA EQTPASILVW AELVGDLLPP GVVNVVNGFG VEAGKPLASS PRIAKIAFTG
ETSTGRLIMQ YASENLVPVS LELGGKSPNI FFGDVTAEDD PFFDKAIEGF VMFALNQGEV
CTCPSRALVQ ESIYDRFMER ALARVAAIRQ GDPRDPATMI GAQASQEQLD KILSYIDIGR
HEGAELLAGG GRAQLPGDLA GGYYVHPTVF RGHNQMRIFQ EEIFGPVVSV TTFKDEAEAI
AIANDTQYGL GAGVWTRDGT RAYRFGRAIA AGRVWTNCYH AYPAHAAFGG YKQSGIGRET
HKMMLDHYQH TKNLLVSYGT GPLGFF