Gene Rsph17025_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2311 
Symbol 
ID5084961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2354614 
End bp2356065 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID640483874 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_001168505 
Protein GI146278346 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCCC AGCCCGCCGC CAGCCATTTC GTTGACGGCC GCCCGCTCGA GGATGCGGCC 
GGCGCGCCGA TCCCCGTGAT CCACCCGGCC AATGGCGAGG AGATCGCGCG CCTGCACGAG
GCCACGCCCG CCGTGATCGA GGCGGCGCTG GCGTCCGGCG CGCGGGCGCA GAAGGACTGG
GCCGCGCTGC GCCCCGTCGA GAGGGCCCGC ATCCTGCGCC GTGCCTCGGA CCTGATCCGC
GCGCGCAACG AGGAGCTGAG CATCCTCGAG ACGCTCGACA CCGGCAAGCC GCTGCAGGAG
ACCCTGGTGG CGGACTGGCC CTCGGGCGCG GACGCGCTGG AGTTCTTTGC GGGCCTCGCG
CCCGCGGTGA CGGGCGAGAC GGTCCCGCTG GGGCAGGATT TCGTCTACAC GATCCGCGAG
CCGCTGGGGC TCTGCGTGGG CATCGGGGCC TGGAACTACC CCAGCCAGAT CGCCTGCTGG
AAGGCGGCGC CGGCGCTGGC CCTCGGCAAT GCGATGGTGT TCAAGCCGTC CGAGGTGACG
CCGCTCGGCG CGCTGAAGCT GGCCGAGATC CTGATCGAGG CGGGCCTGCC GCCGGGGCTG
TTCAACGTGG TGCAGGGCCG GGGTGCGGTG GGCGCTGCGC TGGTGAGCGA CAGCCGGGTG
GCCAAGGTCT CGCTGACGGG CTCGGTGCCC ACGGGTCGGC GCGTCTATGC CGCTGCGGCC
GAGGGCGTGC GCCATGTCAC GATGGAACTG GGCGGCAAGT CGCCGCTGAT CGTCTTTGAC
GACGCCGATC TCGAAAGCGC CATCGGGGCG GCGATGCTGG GCAACTTCTA TTCGTCGGGC
CAGATCTGCT CGAACGGGAC GCGGGTCTTC GTGCAGAAGG GCATCAAGGA GGCGTTCCTC
GCGCGTCTGG CCGAGCGTGC CGATGCGATC CAAATGGGCG ATCCGCTCGA CCCCGAGGTG
CAGATGGGCC CGCTGGTGTC GCCGGCGCAG CTCGAGAAGG TGCTAAGCTA TATCGAGAAG
GCCCGCGCCG AGGGGGGGCG GCTGGTCTGC GGCGGCGAGG CTTCGGTCAG CCCCGGCTGT
TACGTCCAGC CCACGGTCTT TGCGGATGTG ACCGACGGCA TGACGCTCGC GCGCGAGGAG
GTCTTCGGGC CGGTGATGGC GGTCCTCGAC TTCGAGACCG AGGAAGAGGT GATCGCGCGG
GCGAACGCCA CCGACTTCGG GCTTGCCGCC GGGGTCTTCA CGGCCGATCT CACGCGGGCG
CACCGGGTGG TGGCGCAGCT GCAGGCCGGG ACCTGCTGGA TCAACGCCTA CAACCTGACG
CCGGTCGAGG CGCCCTTCGG CGGGGTCAAG ATGTCGGGCG TGGGCCGCGA GAACGGCCGC
GCGGCCGTCG AGCACTACAC GCAGGTGAAG TCGGTCTATG TCGGCATGGG GCCGGTGGAC
GCCCCCTACT GA
 
Protein sequence
MRAQPAASHF VDGRPLEDAA GAPIPVIHPA NGEEIARLHE ATPAVIEAAL ASGARAQKDW 
AALRPVERAR ILRRASDLIR ARNEELSILE TLDTGKPLQE TLVADWPSGA DALEFFAGLA
PAVTGETVPL GQDFVYTIRE PLGLCVGIGA WNYPSQIACW KAAPALALGN AMVFKPSEVT
PLGALKLAEI LIEAGLPPGL FNVVQGRGAV GAALVSDSRV AKVSLTGSVP TGRRVYAAAA
EGVRHVTMEL GGKSPLIVFD DADLESAIGA AMLGNFYSSG QICSNGTRVF VQKGIKEAFL
ARLAERADAI QMGDPLDPEV QMGPLVSPAQ LEKVLSYIEK ARAEGGRLVC GGEASVSPGC
YVQPTVFADV TDGMTLAREE VFGPVMAVLD FETEEEVIAR ANATDFGLAA GVFTADLTRA
HRVVAQLQAG TCWINAYNLT PVEAPFGGVK MSGVGRENGR AAVEHYTQVK SVYVGMGPVD
APY