Gene Rsph17029_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0855 
Symbol 
ID4897794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp872410 
End bp873861 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID640111440 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_001042738 
Protein GI126461624 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCCC AGCCCGCCGC CAGCCATTTC GTCGACGGTC GTCCGCTCGA GGATGAGACC 
GGCGCGCCGA TCCCGGTGAT CTATCCCGCC ACCGGCGAGG AGATCGCCCG CCTTCACGAG
GCCACGCCCG CCGTGATCGA GGCGGCTTTG GCCTCGGGCG CCCGCGCGCA GGCGGCCTGG
GCCGCGATGC GGCCCGTCGA GCGGGCGCGG ATCCTGCGCC GCGCCTCGGA CCTGATCCGG
GCGCGCAACG AGGAGCTGAG CCTTCTCGAG ACGCTTGACA CCGGCAAGCC GCTGCAGGAG
ACGCTGGTGG CCGACTGGGC CTCGGGGGCG GATGCGCTGG AATTCTTCGC CGGTCTGGCG
CCCGCCGTCA CCGGCGAAAC CGTGCCGCTG GGGCAGGATT TCGTCTATAC GATCCGCGAG
CCGCTGGGCC TTTGCGTGGG CATCGGCGCC TGGAACTACC CGAGCCAGAT CGCCTGCTGG
AAGGCTGCGC CCGCGCTCGC GCTCGGCAAT GCGATGGTGT TCAAGCCCTC GGAGGTGACG
CCGCTCGGCG CGCTGAAGCT GGCCGAGATC CTGATCGAGG CGGGCCTGCC GCCCGGGCTC
TTCAACGTGG TGCAGGGCCG CGGCGCGGTG GGGGCGGCGC TCGTCACCGA CAGCCGGGTG
GCCAAGGTCT CGCTCACGGG CTCGGTGCCG ACGGGGCGGC GCGTCTATGC GGCTGCGGCC
GAGGGCGTGC GCCATGTCAC GATGGAGCTC GGCGGCAAGT CGCCCCTGAT CGTCTTCGAC
GATGCCGATC TGGAGAGCGC CATCGGCGCG GCGATGCTCG GCAACTTCTA TTCCGCGGGC
CAGATCTGCT CGAACGGGAC GCGGGTCTTC GTGCAGAAGG GGATCAAGGA GGCGTTCCTC
GCCCGGCTCG CCGAGCGGGC CGATGCCATC CGCATGGGCG ATCCGCTCGA CCCCGAGGTG
CAGATGGGTC CGCTCGTCTC GCAGGCGCAG CTCGAGAAGG TGCTGGCCTA TATCGAGAAG
GCCCGCGCCG AGGGCGGCCG CCTCGTCTGC GGCGGCGAGG CCTCGGTCAG CCCCGGCTGC
TATGTCCAGC CCACGGTCTT CGCCGATGTG ACGGACGCCA TGACCCTCGC CCGCGAGGAG
GTGTTCGGTC CGGTGATGGC GGTGCTCGAT TTCGAGACCG AGGAGGAGGC GATCGCGCGG
GCGAATGCCA CGGACTTCGG CCTCGCCGCG GGCGTCTTCA CCGCGGATCT CACGCGGGCG
CACCGGGTGG TGGCGCAGCT GCAGGCCGGG ACCTGCTGGA TCAACGCCTA CAACCTCACG
CCGGTCGAGG CGCCCTTCGG CGGGGTGAAA CTGTCGGGCG TGGGCCGCGA GAACGGCCGC
GCCGCCGTCG AGCACTATAC GCAGGTAAAG TCGGTCTATG TCGGCATGGG GCCGGTGGAC
GCCCCCTACT GA
 
Protein sequence
MRAQPAASHF VDGRPLEDET GAPIPVIYPA TGEEIARLHE ATPAVIEAAL ASGARAQAAW 
AAMRPVERAR ILRRASDLIR ARNEELSLLE TLDTGKPLQE TLVADWASGA DALEFFAGLA
PAVTGETVPL GQDFVYTIRE PLGLCVGIGA WNYPSQIACW KAAPALALGN AMVFKPSEVT
PLGALKLAEI LIEAGLPPGL FNVVQGRGAV GAALVTDSRV AKVSLTGSVP TGRRVYAAAA
EGVRHVTMEL GGKSPLIVFD DADLESAIGA AMLGNFYSAG QICSNGTRVF VQKGIKEAFL
ARLAERADAI RMGDPLDPEV QMGPLVSQAQ LEKVLAYIEK ARAEGGRLVC GGEASVSPGC
YVQPTVFADV TDAMTLAREE VFGPVMAVLD FETEEEAIAR ANATDFGLAA GVFTADLTRA
HRVVAQLQAG TCWINAYNLT PVEAPFGGVK LSGVGRENGR AAVEHYTQVK SVYVGMGPVD
APY