Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4281 |
Symbol | |
ID | 6411965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4607814 |
End bp | 4609355 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714163 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001993252 |
Protein GI | 192292647 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0364491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATG CCGCCACGAA CTCTTCCGGA TTTCAGGCCA ATCTCGACCG TGCCGCGCCA CTGCTGAAGC AGCTCAAGGC CGATGGCATC GGCCATCTGA TCGGCGGCGA GATCGTCGCA GCCTCATCCG GCGAAGTGTT CGAGACGGCC TCGCCGATCG ACAATTCGGT GCTGGCGCAG GTGGCGCGCG GCACGGCGGC GGACATCGAC CGCGCTGCCA AGGCCGCTAA GGCTGCGTTT CCGGCGTGGC GTGACATGGC GCCCGCCAAG CGGCGCAAAC TGCTACACGC AATCGCGGAT GCGATCGAGG CGCGCGCCGA CGACATCGCG GTGCTGGAAT GCACCGACAC CGGCCAGGCG CATCGCTTCA TGGCCAAGGC CGCAATCCGC GCTGCCGAGA ACTTCCGGTT CTTCGCCGAC AAATGCGCCG AGGCGCGCGA CGGGCTGAAC ACGCCGAGCG ACGAGCATTG GAACGTCTCG ACCCGGGTGC CGATCGGACC GGTCGGCGTG ATCACGCCGT GGAATACGCC GTTCATGCTG TCGACCTGGA AGATCGCCCC CGCGCTCGCG GCCGGCTGCA CCGTGGTGCA CAAGCCGGCC GAGTGGTCGC CGGTGACCGC CGACATGCTG GCGCGCATCT GCAAGGACGC CGGGCTGCCA GACGGCGTGC TCAACACTGT GCACGGTTTC GGCGAAGAGG CCGGCAAGGC GCTGACCGAA CATCCGGCTA TCAAAGCGAT CGCCTTCGTC GGCGAAACCG CCACCGGCGC CGCCATCATG GCGCAGGGCG CGCCGACGCT GAAGCGGGTG CATTTCGAGC TCGGCGGCAA GAACCCGGTG ATCGTGTTTG GCGACGCCGA TCTCGATCGC GCGCTCGATG CCGTGGTGTT CATGATCTAC TCGCTGAACG GCGAGCGCTG CACCTCGTCG AGCCGGCTCT TGGTACAGTC CTCGATTGCC GACAGCTTCA TCGAGAAGCT CGCCGCCCGG GTGCGCGCCC TCAAGGTCGG CCACCCGCTC GATCCGGCCA CCGAAGTCGG TCCGCTGATC CATCAGCGCC ATCTCGACAA GGTGTGTTCC TATGTCGACG TCGCCCGAAA AGACGGCGCC ACCATCGCGG TCGGCGGCGC GCCCTTCGAC GGCCCAGGCG GCGGCCACTA TGTGCAGCCG ACGCTGGTGA CCAATGCGCG CAGCGACATG CAGGTGGCGC AGGATGAGGT GTTCGGACCT TTCCTCACCG TGATCCCGTT CAAGGACGAA GCGGACGCTG TCCGTATCGC CAATGACGTC CGCTATGGCC TCACCGGCTA TGTCTGGACC GCCGACATGG GCCGCGCCCT GCGCGTCGCC GATGCGCTCG AAGCCGGCAT GATCTGGCTG AACTCGGAGA ACGTCCGCCA TCTGCCGACT CCGTTCGGCG GCATGAAGCA GTCCGGCATC GGCCGCGACG GCGGCGACTA CTCGTTCGAG TTCTACATGG AGACCAAGCA CGTCTCGCTG GCCCGCGGCA CGCACAAGAT TCAGAAGCTG GGGGCGGTGT AG
|
Protein sequence | MADAATNSSG FQANLDRAAP LLKQLKADGI GHLIGGEIVA ASSGEVFETA SPIDNSVLAQ VARGTAADID RAAKAAKAAF PAWRDMAPAK RRKLLHAIAD AIEARADDIA VLECTDTGQA HRFMAKAAIR AAENFRFFAD KCAEARDGLN TPSDEHWNVS TRVPIGPVGV ITPWNTPFML STWKIAPALA AGCTVVHKPA EWSPVTADML ARICKDAGLP DGVLNTVHGF GEEAGKALTE HPAIKAIAFV GETATGAAIM AQGAPTLKRV HFELGGKNPV IVFGDADLDR ALDAVVFMIY SLNGERCTSS SRLLVQSSIA DSFIEKLAAR VRALKVGHPL DPATEVGPLI HQRHLDKVCS YVDVARKDGA TIAVGGAPFD GPGGGHYVQP TLVTNARSDM QVAQDEVFGP FLTVIPFKDE ADAVRIANDV RYGLTGYVWT ADMGRALRVA DALEAGMIWL NSENVRHLPT PFGGMKQSGI GRDGGDYSFE FYMETKHVSL ARGTHKIQKL GAV
|
| |