Gene RPB_3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3136 
Symbol 
ID3910937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3584160 
End bp3585611 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID637885038 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_486743 
Protein GI86750247 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.362143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATGC CGAGTGATCC TGGCCTGTTG GCCGAGCGCT GCCTGATTGG CGGCAAATGG 
TGTGGTGAGC CGGTCGATGC TGTGTTCAAC CCGGCGACGG GCGAGCGCAT CGGCGCGGTG
CCGCACTTCG GTGCGGACGA GGCCGACGAG GCGGTTCGCG CCGCCGGGCT TGCATTCCGG
CAGTGGTCGA AGCTGCTGGC GAAACAGCGC GCGGCGATGC TGCGCGGCTG GTTCGATCAG
ATCATCGCCC ATCGCGACGA TCTCGCACGG CTGCTGACGT CGGAGCAGGG CAAGCCGCTC
GCCGAGGCAC TGGGTGAAAT CGACTATGCG GCATCGTTCG TCGAGTTCTA TGCCGAAGAG
GCGCGCCGCA TTTACGGCGA GACGATTCCG GCGCACCGCG CCGATTCCAG AACCATGGTG
ATCCGTCAGC CGGTCGGCGT CGTCGCGGCG ATCACGCCGT GGAATTTTCC GGCCGCGATG
ATCACTCGCA AAGTGGCGCC GGCGCTCGCC GCCGGCTGCA CGGTGGTGAT CAAGCCGGCG
CCGGAAACGC CGCTGACGGC GCTTGCACTG GGCGTGCTGG CGCAGCGCGC AGGGTTTCCG
GCAGGCGTGC TCAACATCAT CACCGGCGAC GCGCCTGCGA TCGGCAAGGC GTGGACCGAG
CATCCGACGG TGCGGGCGAT CAGCTTCACC GGCTCCACCG AGGTCGGCAA GATCCTGATG
CGGCAGGCCG CGTCGACCGT GAAGAAGGTC GGGCTCGAGC TCGGCGGCAA TGCGCCGTTC
ATCGTGTTCG ACGATGCAGA TCTCGACGCC GCGGTCGACG GCGTGATCGC CTCGAAGTTT
CGCAACATGG GGCAGACCTG CGTCTGTGCC AACCGGATCT ACGCGCAGAA CGGGATCTAC
GACGCCTTCG TCGAGAAGCT CGCCGCCAAG GTTGCCACGT TGCGGGTCGG CAACGGACTC
GACGACGGCG TCACTCAGGG GCCGCTGATC ACCAAGGAGG CGGTGGCCAA GGTCGAGCAT
CATCTCGCCG ACGCCGTCGC CAAGGGGGCC AAGATCGTGC TCGGCGGCAA GCGTCATGCG
CTCGGACAAA CTTTCTTCGA GCCGACCGTC GTCACCGGCG TCACCGCCGA CATGGTCGTC
ACGCGCGAGG AGACCTTCGG TCCGGTCGCT CCGGTGTATC GCTTCCGCGG CGAAGCCGAC
GTGATCGCGC AGGCCAACGA CTCGCCGTTC GGACTGGCGG CGTATTTTTA CGCCCGCGAC
CTCGGCCGGG TGTTCAGGGT CGCCGAAGCG CTGGAGTCCG GGATGGTCGG CGTCAATTCG
GCGCTGCTCG GAGCCGACGT GGTGCCGTTC GGCGGCGTCA AGGAATCCGG CCTCGGCCGC
GAAGGCTCGC GCCACGGCAT CGAGGAATAT GTCGAGACCA AATACATCCT GCTCGGCGGT
CTCGATCGCT GA
 
Protein sequence
MLMPSDPGLL AERCLIGGKW CGEPVDAVFN PATGERIGAV PHFGADEADE AVRAAGLAFR 
QWSKLLAKQR AAMLRGWFDQ IIAHRDDLAR LLTSEQGKPL AEALGEIDYA ASFVEFYAEE
ARRIYGETIP AHRADSRTMV IRQPVGVVAA ITPWNFPAAM ITRKVAPALA AGCTVVIKPA
PETPLTALAL GVLAQRAGFP AGVLNIITGD APAIGKAWTE HPTVRAISFT GSTEVGKILM
RQAASTVKKV GLELGGNAPF IVFDDADLDA AVDGVIASKF RNMGQTCVCA NRIYAQNGIY
DAFVEKLAAK VATLRVGNGL DDGVTQGPLI TKEAVAKVEH HLADAVAKGA KIVLGGKRHA
LGQTFFEPTV VTGVTADMVV TREETFGPVA PVYRFRGEAD VIAQANDSPF GLAAYFYARD
LGRVFRVAEA LESGMVGVNS ALLGADVVPF GGVKESGLGR EGSRHGIEEY VETKYILLGG
LDR