Gene Rpal_4281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4281 
Symbol 
ID6411965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4607814 
End bp4609355 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID642714163 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_001993252 
Protein GI192292647 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0364491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG CCGCCACGAA CTCTTCCGGA TTTCAGGCCA ATCTCGACCG TGCCGCGCCA 
CTGCTGAAGC AGCTCAAGGC CGATGGCATC GGCCATCTGA TCGGCGGCGA GATCGTCGCA
GCCTCATCCG GCGAAGTGTT CGAGACGGCC TCGCCGATCG ACAATTCGGT GCTGGCGCAG
GTGGCGCGCG GCACGGCGGC GGACATCGAC CGCGCTGCCA AGGCCGCTAA GGCTGCGTTT
CCGGCGTGGC GTGACATGGC GCCCGCCAAG CGGCGCAAAC TGCTACACGC AATCGCGGAT
GCGATCGAGG CGCGCGCCGA CGACATCGCG GTGCTGGAAT GCACCGACAC CGGCCAGGCG
CATCGCTTCA TGGCCAAGGC CGCAATCCGC GCTGCCGAGA ACTTCCGGTT CTTCGCCGAC
AAATGCGCCG AGGCGCGCGA CGGGCTGAAC ACGCCGAGCG ACGAGCATTG GAACGTCTCG
ACCCGGGTGC CGATCGGACC GGTCGGCGTG ATCACGCCGT GGAATACGCC GTTCATGCTG
TCGACCTGGA AGATCGCCCC CGCGCTCGCG GCCGGCTGCA CCGTGGTGCA CAAGCCGGCC
GAGTGGTCGC CGGTGACCGC CGACATGCTG GCGCGCATCT GCAAGGACGC CGGGCTGCCA
GACGGCGTGC TCAACACTGT GCACGGTTTC GGCGAAGAGG CCGGCAAGGC GCTGACCGAA
CATCCGGCTA TCAAAGCGAT CGCCTTCGTC GGCGAAACCG CCACCGGCGC CGCCATCATG
GCGCAGGGCG CGCCGACGCT GAAGCGGGTG CATTTCGAGC TCGGCGGCAA GAACCCGGTG
ATCGTGTTTG GCGACGCCGA TCTCGATCGC GCGCTCGATG CCGTGGTGTT CATGATCTAC
TCGCTGAACG GCGAGCGCTG CACCTCGTCG AGCCGGCTCT TGGTACAGTC CTCGATTGCC
GACAGCTTCA TCGAGAAGCT CGCCGCCCGG GTGCGCGCCC TCAAGGTCGG CCACCCGCTC
GATCCGGCCA CCGAAGTCGG TCCGCTGATC CATCAGCGCC ATCTCGACAA GGTGTGTTCC
TATGTCGACG TCGCCCGAAA AGACGGCGCC ACCATCGCGG TCGGCGGCGC GCCCTTCGAC
GGCCCAGGCG GCGGCCACTA TGTGCAGCCG ACGCTGGTGA CCAATGCGCG CAGCGACATG
CAGGTGGCGC AGGATGAGGT GTTCGGACCT TTCCTCACCG TGATCCCGTT CAAGGACGAA
GCGGACGCTG TCCGTATCGC CAATGACGTC CGCTATGGCC TCACCGGCTA TGTCTGGACC
GCCGACATGG GCCGCGCCCT GCGCGTCGCC GATGCGCTCG AAGCCGGCAT GATCTGGCTG
AACTCGGAGA ACGTCCGCCA TCTGCCGACT CCGTTCGGCG GCATGAAGCA GTCCGGCATC
GGCCGCGACG GCGGCGACTA CTCGTTCGAG TTCTACATGG AGACCAAGCA CGTCTCGCTG
GCCCGCGGCA CGCACAAGAT TCAGAAGCTG GGGGCGGTGT AG
 
Protein sequence
MADAATNSSG FQANLDRAAP LLKQLKADGI GHLIGGEIVA ASSGEVFETA SPIDNSVLAQ 
VARGTAADID RAAKAAKAAF PAWRDMAPAK RRKLLHAIAD AIEARADDIA VLECTDTGQA
HRFMAKAAIR AAENFRFFAD KCAEARDGLN TPSDEHWNVS TRVPIGPVGV ITPWNTPFML
STWKIAPALA AGCTVVHKPA EWSPVTADML ARICKDAGLP DGVLNTVHGF GEEAGKALTE
HPAIKAIAFV GETATGAAIM AQGAPTLKRV HFELGGKNPV IVFGDADLDR ALDAVVFMIY
SLNGERCTSS SRLLVQSSIA DSFIEKLAAR VRALKVGHPL DPATEVGPLI HQRHLDKVCS
YVDVARKDGA TIAVGGAPFD GPGGGHYVQP TLVTNARSDM QVAQDEVFGP FLTVIPFKDE
ADAVRIANDV RYGLTGYVWT ADMGRALRVA DALEAGMIWL NSENVRHLPT PFGGMKQSGI
GRDGGDYSFE FYMETKHVSL ARGTHKIQKL GAV