Gene RPC_3493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3493 
Symbol 
ID3972860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3880600 
End bp3882051 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content68% 
IMG OID637926605 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_533352 
Protein GI90424982 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.348419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC CGAGTGATCC GGGCCTGTTG GCCGAGCGCT GCCTGATCGG CGGCGAGTGG 
AGCGGTGAGC CGGTCGACGC CGTGTTCAAC CCGGCCACCG GCGAACGCAT CGGCGCGGTG
CCGCGGTTCG GCGCCGACGA AGCCGAGGAT GCGGTGCATG CCGCGGGCGT GGCGTTCCGG
CAATGGTCGA AGCTGCTGGC CAAGCAACGC GCCGCACTGC TGCGCGTCTG GTTCGACCTG
ATCATCGCGC ACCGCGACGA GCTGGCGCGG CTGTTGACCG CGGAGCAGGG CAAGCCATTG
GCCGAAGCGC TGGGCGAAAT CGATTACGCC GCGTCGTTCG TGGAGTTCTA CGCCGAGGAG
GCGCGGCGCA TCTACGGTGA AACCATCCCG TCGCACCGCG CCGATTCCCG CACGCTGGTG
ATCCGGCAGC CGGTCGGCGT GGTCGCCGCG ATCACGCCGT GGAATTTTCC CGCCGCGATG
ATCACCCGCA AGGTCGCGCC GGCGCTCGCC GCCGGCTGCA CCGTGGTGGT CAAGCCGGCG
CCGGAGACGC CGTTCACCGC GCTGGCGCTC GGCGTGCTGG CGCAGCGCGC CGGAATTCCG
CCGGGCGTGA TCAACATCAT CACCGGCGAC GCGCCGGCGA TCGGCCGTGT GTTGACCGAG
CATCCTTTGG TGCGCGCGAT CAGCTTCACC GGCTCGACCG CGGTCGGCAA GATTTTGATG
CGGCAGGCCG CGTCCACCGT GAAGCGAGTC GGGCTCGAAC TCGGCGGCAA CGCGCCGTTC
ATCGTGTTCG ACGATGCCGA TCTCGACGCC GCGGTCGAAG GCGTGCTGGT GTCGAAGTTT
CGCAACATGG GACAGACCTG CGTTTGCGCC AATCGAATCT ACGCGCAGGA CAGCATCTAC
GACGCCTTCG TGCAGAAATT GACCGAAAAG GTCGCGGCGC TGAAGGTTGG CAACGGCCTC
GAGGCCGGCG TCACCCAGGG TCCGCTGATC AACAAGCAGG CGGTCGACAA GGTCGAGCGG
CACATCGCCA ATGCCACCGC CAACGGCGCC AAGGTGCTGT TCGGCGGCAA GCGGCACGCA
CTCGGCCGCA CCTTCTTCGA GCCGACGGTT CTATCGGGCG TCACCACCGA CATGGTGATC
ACCCACGAGG AAACCTTCGG CCCGGTGGCG CCGGTGTATC GCTTCAGCGA TGAGGCCGAC
GTGATCGCCA AGGCCAACGC CTCGCCGTTC GGGCTGGCGG CCTATTTCTA CGCCCGCGAT
CTCGGCCGCG TATTCCGGGT CGCCGAGGCG CTGGAGGCCG GCATGGTCGG CGTCAACTCG
GCGCTGCTCG GCGCCGACGT GGTGCCGTTC GGCGGCGTCA AGGAGTCGGG GCTGGGCCGC
GAAGGCTCGC ATCACGGCAT CGAGGAATAT GTCGACATCA AATACATCAT GCTCGGCGGG
CTTGACCGCT GA
 
Protein sequence
MLKPSDPGLL AERCLIGGEW SGEPVDAVFN PATGERIGAV PRFGADEAED AVHAAGVAFR 
QWSKLLAKQR AALLRVWFDL IIAHRDELAR LLTAEQGKPL AEALGEIDYA ASFVEFYAEE
ARRIYGETIP SHRADSRTLV IRQPVGVVAA ITPWNFPAAM ITRKVAPALA AGCTVVVKPA
PETPFTALAL GVLAQRAGIP PGVINIITGD APAIGRVLTE HPLVRAISFT GSTAVGKILM
RQAASTVKRV GLELGGNAPF IVFDDADLDA AVEGVLVSKF RNMGQTCVCA NRIYAQDSIY
DAFVQKLTEK VAALKVGNGL EAGVTQGPLI NKQAVDKVER HIANATANGA KVLFGGKRHA
LGRTFFEPTV LSGVTTDMVI THEETFGPVA PVYRFSDEAD VIAKANASPF GLAAYFYARD
LGRVFRVAEA LEAGMVGVNS ALLGADVVPF GGVKESGLGR EGSHHGIEEY VDIKYIMLGG
LDR