Gene Rsph17025_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0213 
Symbol 
ID5082152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp206007 
End bp207485 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content67% 
IMG OID640481768 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001166428 
Protein GI146276269 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.786998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.906241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACT CCGTCACCGA CCTGCGTTCG ATACTCAAGG ATCCGTCGCT GCTCGAAACC 
CGCGCCTTCG TGGCGGGCGA GTGGGTGGCG GCCGATGACG GTGCCAGCTT CGAGGTAACG
AACCCCGCCC GCGGCGACGT GATCTGCACC ATCCCGGACC TCGGCCGGGC CGAGACCGCG
CGTGCCATCG CCGCCGCGGC CGAGGCGATG AAGGACTGGG CCGCCCGCAC CGGCAAGGAA
CGCGCGCAGG TGATGCGCAA GTGGTTCGAC CTGATGATGG CCAACCAGGA CGACCTCGGC
GCGATCCTGA CGGCCGAGAT GGGCAAGCCC CTCGCCGAGG CCAAGGGCGA GATCGCCTAT
GGCGCCTCCT TCATCGAATG GTTCGGCGAA GAGGCCAAGC GCGTCTATGG CGAGACCATC
CCCGGCCACA TGCGCGACAA GCGCATCACC GTGCTCAAGC AGCCGATCGG CGTCGTGGGC
TCGATCACGC CCTGGAACTT CCCGAACGCG ATGATCACCC GCAAATGCGG CCCGGCGCTG
GCCGCGGGCT GCGGCTTCGT CGCCCGCCCC GCCGCCGAAA CCCCGCTGTC GGCGCTGGCG
CTGGCGGTGC TGGGCGAGCG CGCGGGGCTG CCGAAGGGCA TCCTGTCGGT CATCACCTCG
AGGAAATCCT CGGACATCGG CAAGGAGTTC TGCGAGAATC CGGCCGTCCG CAAGCTGACC
TTCACGGGCT CGACCGAGGT GGGGCGCATC CTGCTCAGGC AGGCGGCGGA TCAGGTGATG
AAATGCTCGA TGGAGCTGGG CGGCAACGCG CCCTTCATCG TTTTCGACGA CGCCGACCTC
GACGCGGCGG TGCAGGGCGC CATGGCGTCG AAGTTCCGCA ACAACGGCCA GACCTGCGTC
TGCGCGAACC GCATCTATGT GCAGGCCGGC GTCCATGACG CCTTCGCCGA GAAGCTGGCG
GCGGCGGTGA AGAAGCTGCG CGTCGGCGAC GGGCTGGTGG AGGGGACCGA GGCCGGCCCG
CTCATCAACG AGAAGGCGGT CGAGAAGGTC GAGGAACATA TCCGCGACGT GCTCGACGGC
GGCGGCGAGG TGTTGACCGG CGGCAAGCGG CACGAGCTGG GCGGCACCTT CTTCGAGCCG
ACCGTCGTGA CCGGCGTCAC GCAGGAGATG AAGGTCTCAA ACGAAGAGAC CTTCGGGCCG
CTCGCGCCGC TCTTCCGGTT CGACACCGAA GAGGAGGTGA TCGGCTATGC GAACGACACG
ATCTTCGGCC TTGCCTCCTA CTTCTACGCT CGCGACGTGG GCCGCATCAC CCGCGTCCAG
GAGGCGCTGG AATATGGCAT CGTCGGCGTC AACACCGGCA TCATCTCGAC CGAGGTCGCA
CCCTTCGGCG GGGTGAAGCA GTCCGGCCTC GGCCGCGAGG GCTCGCGGCA CGGCATCGAG
GATTATCTCG AGATGAAATA CATCTGCCTC TCGATCTGA
 
Protein sequence
MLDSVTDLRS ILKDPSLLET RAFVAGEWVA ADDGASFEVT NPARGDVICT IPDLGRAETA 
RAIAAAAEAM KDWAARTGKE RAQVMRKWFD LMMANQDDLG AILTAEMGKP LAEAKGEIAY
GASFIEWFGE EAKRVYGETI PGHMRDKRIT VLKQPIGVVG SITPWNFPNA MITRKCGPAL
AAGCGFVARP AAETPLSALA LAVLGERAGL PKGILSVITS RKSSDIGKEF CENPAVRKLT
FTGSTEVGRI LLRQAADQVM KCSMELGGNA PFIVFDDADL DAAVQGAMAS KFRNNGQTCV
CANRIYVQAG VHDAFAEKLA AAVKKLRVGD GLVEGTEAGP LINEKAVEKV EEHIRDVLDG
GGEVLTGGKR HELGGTFFEP TVVTGVTQEM KVSNEETFGP LAPLFRFDTE EEVIGYANDT
IFGLASYFYA RDVGRITRVQ EALEYGIVGV NTGIISTEVA PFGGVKQSGL GREGSRHGIE
DYLEMKYICL SI