Gene Rsph17029_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2672 
Symbol 
ID4897069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2817736 
End bp2819214 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID640113273 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001044546 
Protein GI126463432 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.582581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACT CCGTCACCGA CCTGCGCTCG CTGCTCAAGG ACCCGTCGCT GCTCGAAACC 
CGCGCCTTCG TGGCCGGCGA GTGGGTCGAT GCCGACGACG GCGCAACCTT CGAGGTGACG
AACCCCGCCC GCGGAGACGT GATCTGCACG GTGCCCGACC TCGGCCGGGC CGAAACGGCG
CGCGCCATCG CCGCCGCCGA GGAGGCCATG AAGGAGTGGG CCGCCCGCAC CGGCAAGGAG
CGCGCGCAGG TCATGCGCAA GTGGTTCGAC CTGATGATGG CGAACCAGGA CGACCTCGGC
GCGATCCTCA CCGCCGAGAT GGGCAAGCCG CTCGCCGAGG CGAAGGGCGA GATCGCCTAC
GGCGCCTCCT TCATCGAATG GTTCGGCGAA GAGGCCAAGC GCATCTATGG CGAGACCATC
CCCGGCCACA TGCGCGACAA GCGCATCACG GTTCTGAAGC AGCCGATCGG CGTGGTGGGC
TCGATCACGC CGTGGAACTT CCCGAACGCC ATGATCACCC GCAAATGCGG GCCCGCGCTG
GCGGCGGGCT GCGGCTTCGT CGCCCGTCCG GCGGCCGAGA CGCCGCTCTC GGCGCTGGCG
CTCGCGGTTC TGGGCGAGCG GGCCGGGCTG CCCAAGGGCA TCCTCTCGGT CATCACCTCG
AGCCGCTCCT CGGACATCGG CAAGGAGATG TGCGAGAACC CGATCGTCCG CAAGCTCACC
TTCACCGGCT CGACCGAGGT GGGCCGCATC CTGCTGCGGC AGGCGGCCGA TCAGGTGATG
AAATGCTCGA TGGAGCTCGG CGGCAACGCG CCCTTCATCG TCTTCGACGA TGCCGATCTC
GACGCCGCGG TGCAGGGCGC CATGGCCTCG AAGTTCCGCA ACAACGGCCA GACCTGCGTC
TGCGCGAACC GGATCTACGT CCAGTCGGGC GTCTATGACG CCTTCGCCGA AAAGCTCGCC
GCCGCCGTGA AGAAGCTGAA GGTGGGCGAC GGGCTCGTCG AGGGCACCGA GGCCGGCCCG
CTCATCAACG AGAAGGCGGT GGCCAAGGTC GAGGAACATA TCCGCGACGT GCTCGACGGC
GGCGGTCAGG TGCTGACCGG CGGCAAGCGC CACGCGCTCG GCGGCACCTT CTTCGAGCCG
ACGGTCGTGA CCGGCGTGAA GCAGGAGATG AAGGTTTCGA CGGAAGAGAC CTTCGGCCCG
CTCGCCCCTC TCTTCCGCTT CGAGACCGAG GAAGAGGCGG TGGGCTACGC CAACGACACG
ATCTTCGGCC TCGCCTCCTA CTTCTATGCG CGCGACGTGG GCCGCATCAC CCGCGTGCAG
GAGGCGCTGG AATATGGCAT CGTCGGCGTG AACACCGGCA TCATCTCGAC CGAGGTGGCC
CCCTTCGGCG GCGTGAAGCA ATCCGGCCTC GGCCGCGAGG GCTCGCGCCA CGGGATCGAG
GATTACCTCG AGATGAAATA CATCTGCCTC TCGATCTGA
 
Protein sequence
MLDSVTDLRS LLKDPSLLET RAFVAGEWVD ADDGATFEVT NPARGDVICT VPDLGRAETA 
RAIAAAEEAM KEWAARTGKE RAQVMRKWFD LMMANQDDLG AILTAEMGKP LAEAKGEIAY
GASFIEWFGE EAKRIYGETI PGHMRDKRIT VLKQPIGVVG SITPWNFPNA MITRKCGPAL
AAGCGFVARP AAETPLSALA LAVLGERAGL PKGILSVITS SRSSDIGKEM CENPIVRKLT
FTGSTEVGRI LLRQAADQVM KCSMELGGNA PFIVFDDADL DAAVQGAMAS KFRNNGQTCV
CANRIYVQSG VYDAFAEKLA AAVKKLKVGD GLVEGTEAGP LINEKAVAKV EEHIRDVLDG
GGQVLTGGKR HALGGTFFEP TVVTGVKQEM KVSTEETFGP LAPLFRFETE EEAVGYANDT
IFGLASYFYA RDVGRITRVQ EALEYGIVGV NTGIISTEVA PFGGVKQSGL GREGSRHGIE
DYLEMKYICL SI