Gene RPC_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0454 
Symbol 
ID3970216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp489560 
End bp491053 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content65% 
IMG OID637923570 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_530348 
Protein GI90421978 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCA CCGCTGCCGC CCGTTCCCAA TCCACTGCGC TTTCGCTGCG CGATCGTCTG 
AAGCATCCCG CGCTGTTGCG CGAGGCCTGC TACATCGACG GGCAATGGAC CGGCACGCCT
GAGACCGTTG TCAGCAACCC GGTCAACGAC CTCGAACTCG GCCGGGTGCC GAAGCTCGGA
GCGACCGAAG CCACGCAGGC GGTGGAAGCG GCGCAACGTG CGTTTCCGGC CTGGGCTAAA
CTCACCGCCA AGCAGCGCTC CAACATCATG CGCAAATGGT ACGAACTGAT CGTCGCCAAC
CGCGAAGATC TGGCGCTGAT CCTCACTTCC GAACAAGGCA AGCCGCTGAC CGAAGCACTC
GGCGAAGTCG ACATCGGCGC CGCCTATGTG GAGTTCTTCG CCGAGGAAGC CCGCAGGGTT
TATGGCGAGA CCATTCCGAC GCAGCGGCCG GATGCCCGGC TGATCGCCAT CAAGCAGCCG
ATCGGGGTGT GCGGCGCGAT CACGCCGTGG AATTTTCCCA ATTCGATGAT CACCCGCAAG
GTGTCGCCGG CGCTCGCCGC CGGCTGCACC GTGGTACTGA AGCCCGCCAA CGAGACGCCG
TTCTCGGCGC TGGCGCTCGC CGCCTTGGCG GAGCAGGCCG GACTGCCGAA CGGCGTGTTC
AACATCGTCA CCGGCCACGC CTCGGCGATC GGCAAGGTGT TGTGCGAGCA TCCGGCGGTG
CGCTTCGTCG GCTTCACCGG CTCCACCGAA GTCGGCAAGA TCCTGTATCA GCAGGCCGCG
GTGGGCGTGA AGAAGCTCGG GCTCGAGCTC GGCGGCAACG CGCCGTTCAT CGTGTTCGAC
GACGCCGATG TCGATGCCGC GGTGGACGGC GCGATGGTGT CGAAATATCG CAACATGGGC
CAGACCTGCG TCTGCGCCAA CCGGATCTAC GTCCAGGACG GCGTCTATGA CGCCTTTGTC
GAGAAACTCG CCGCCAAGGT CGGCGCCATG ACAATCGGCG ACGGCACCGA GCCCGGCGTC
ACCCAAGGCC CGCTGATCAA TCAGGCCGCG GTGGAGAAGA CCGAGCGCCA CATCGCCGAC
GCCGTTGCCA ACGGCGCCAC CATCGTGATC GGCGGCAAGC GCCATGCGCG CGGCGGCACG
TTCTTCGAGC CGACCGTGCT CGCCAACGTC AAGCCCGACG CGCTGGTGGC GCATGAGGAA
ACTTTTGGCC CGCTGGCGCC GGTGTTCCGC TTCAAAACCG AAGAGGAAGT GATCAAGCTC
GCCAACGACT CGCCTTTCGG GCTCGCCTCC TACTTCTACG CCCGCGATCT CGGCCGGGTG
TGGCGCGTCG CTGAAGCGCT GGAGGCCGGC ATGGTCGGCG TCAATTCCGG GCTGATCACC
ACCGAAGTGG CGCCGTTCGG CGGCGTCAAG GAAAGTGGCC TCGGCCGCGA AGGCTCGCAT
CACGGCATGG AGGACTATGT CGAGATCAAA TACGTGATGA TGGCGGGGAT TTGA
 
Protein sequence
MSTTAAARSQ STALSLRDRL KHPALLREAC YIDGQWTGTP ETVVSNPVND LELGRVPKLG 
ATEATQAVEA AQRAFPAWAK LTAKQRSNIM RKWYELIVAN REDLALILTS EQGKPLTEAL
GEVDIGAAYV EFFAEEARRV YGETIPTQRP DARLIAIKQP IGVCGAITPW NFPNSMITRK
VSPALAAGCT VVLKPANETP FSALALAALA EQAGLPNGVF NIVTGHASAI GKVLCEHPAV
RFVGFTGSTE VGKILYQQAA VGVKKLGLEL GGNAPFIVFD DADVDAAVDG AMVSKYRNMG
QTCVCANRIY VQDGVYDAFV EKLAAKVGAM TIGDGTEPGV TQGPLINQAA VEKTERHIAD
AVANGATIVI GGKRHARGGT FFEPTVLANV KPDALVAHEE TFGPLAPVFR FKTEEEVIKL
ANDSPFGLAS YFYARDLGRV WRVAEALEAG MVGVNSGLIT TEVAPFGGVK ESGLGREGSH
HGMEDYVEIK YVMMAGI