Gene RSP_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2183 
SymbolbetB 
ID3719653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp788659 
End bp790110 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID640070352 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_352236 
Protein GI77462732 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGCCC AGCCCGCCGC CAGCCATTTC GTCGACGGTC GTCCGCTCGA GGATGAGACC 
GGCGCGCCGA TCCCCGTGAT CTATCCCGCC ACCGGCGAGG AGATCGCCCG CCTTCACGAG
GCCACGCCCG CCGTGATCGA GGCGGCTTTG GCCTCGGGCG CCCGCGCGCA GGCGGCCTGG
GCCGCGATGC GGCCCGTCGA GCGGGCGCGG ATCCTGCGCC GGGCCTCGGA CCTGATCCGG
GCGCACAACG AGGAGCTGAG CCTTCTCGAG ACGCTCGACA CCGGCAAGCC GCTGCAGGAG
ACGCTGGTGG CCGACTGGGC CTCGGGGGCG GATGCGCTGG AATTCTTCGC CGGTCTGGCG
CCCGTCGTCA CCGGCGAGAC CGTGCCGCTG GGGCAGGATT TCGTCTATAC GATCCGCGAG
CCGCTGGGCC TCTGCGTGGG CATCGGCGCC TGGAACTACC CGAGCCAGAT CGCCTGCTGG
AAGGCCGCGC CCGCGCTCGC GCTCGGCAAT GCGATGGTGT TCAAACCCTC GGAAGTGACG
CCGCTCGGCG CGCTGAAGCT GGCCGAGATC CTGATCGAGG CGGGCCTGCC GCCCGGGCTC
TTCAACGTGG TGCAGGGCCG CGGCGCGGTG GGGGCGTCGC TCGTCACCGA CAGCCGGGTG
GCCAAGGTCT CGCTCACGGG CTCGGTGCCG ACGGGGCGGC GCGTCTATGC GGCTGCGGCC
GAGGGCGTGC GCCATGTCAC GATGGAGCTC GGCGGCAAGT CGCCCCTGAT CGTCTTCGAC
GATGCCGATC TGGAGAGCGC CATCGGCGCG GCGATGCTCG GCAACTTCTA TTCCGCGGGC
CAGATCTGCT CGAACGGGAC GCGGGTCTTC GTGCAGAAGG GGATCAAGGA AGCGTTCCTC
GCCCGGCTGG CCGAGCGGGC CGATGCCATC CGCATGGGCG ATCCGCTCGA CCCCGAGGTC
CAGATGGGGC CGCTCGTCTC GCAGGCGCAG CTCGAGAAGG TGCTGGCCTA TATCGAGAAG
GCCCGCGCCG AGGGCGGCCG CCTCGTCTGC GGCGGCGAGG CCTCGGTCAG CCCCGGCTGC
TATGTCCAGC CCACGGTCTT CGCCGATGTG ACGGACGCCA TGACCCTCGC CCGCGAGGAG
GTGTTCGGTC CGGTGATGGC GGTGCTCGAT TTCGAGACCG AGGAGGAGGC GATCGCGCGG
GCGAATGCCA CCGACTTCGG CCTCGCCGCG GGGGTCTTCA CCGCGGATCT CACGCGGGCG
CACCGGGTGG TGGCGCAGCT GCAGGCCGGG ACCTGCTGGA TCAACGCCTA CAACCTTACG
CCGGTCGAGG CGCCCTTCGG CGGGGTGAAG CTGTCGGGCG TGGGCCGCGA GAACGGCCGG
GCCGCCGTCC AGCACTATAC GCAGGTGAAG TCGGTCTATG TCGGCATGGG GCCGGTGGAC
GCCCCCTACT GA
 
Protein sequence
MRAQPAASHF VDGRPLEDET GAPIPVIYPA TGEEIARLHE ATPAVIEAAL ASGARAQAAW 
AAMRPVERAR ILRRASDLIR AHNEELSLLE TLDTGKPLQE TLVADWASGA DALEFFAGLA
PVVTGETVPL GQDFVYTIRE PLGLCVGIGA WNYPSQIACW KAAPALALGN AMVFKPSEVT
PLGALKLAEI LIEAGLPPGL FNVVQGRGAV GASLVTDSRV AKVSLTGSVP TGRRVYAAAA
EGVRHVTMEL GGKSPLIVFD DADLESAIGA AMLGNFYSAG QICSNGTRVF VQKGIKEAFL
ARLAERADAI RMGDPLDPEV QMGPLVSQAQ LEKVLAYIEK ARAEGGRLVC GGEASVSPGC
YVQPTVFADV TDAMTLAREE VFGPVMAVLD FETEEEAIAR ANATDFGLAA GVFTADLTRA
HRVVAQLQAG TCWINAYNLT PVEAPFGGVK LSGVGRENGR AAVQHYTQVK SVYVGMGPVD
APY