Gene Rsph17029_3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3057 
Symbol 
ID4899069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp68255 
End bp69376 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID640113659 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001044929 
Protein GI126463816 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0713063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG ACCTCACCCG ACCGATTACC CTTCTTCGGC CTGCCGCCGT GCATTTCGGC 
GAAGGCAGCC TTGCCCGCCT GCCAGAGTGG GTGGCCGCGC GCGGCTTTCG CGCGCCCTTC
GTCATCGCCG ATGCGGTGAA TGCGCAGCGG CTGGACCGGC TGGGGCTCGG ATCGGTCGGC
TGCTTCGGGA CCGTCGTGCC CGAGCCCGAC ACCGCCAATC TGCAGGCCGC CGTGGCCGCG
GCCGAAGGGG CCGACCTGAT CGTGGGATTC GGCGGCGGCT CGGCCATGGA CCTGGCCAAG
CTCGTGGCTG TGCTCGTGGG AACCGGCCTC GCGCTTTCGG ACATCTCCGG TCCCGGACGG
GCGCCGGCCC GGCGCGTGGG CCTCGTGCAG GTGCCGACCA CCGCCGGGAC CGGCTCGGAA
GTGGGCACGC GCGCCCTCGT GACGGATCCC GCGAGCCTTG CCAAGATCGC GACCGAAAGC
GCCGAGATGC TTGCCGACAT GGCGATCGTG GACCCTGCGC TCACGCTCAG CGTGCCGCCC
GCGGTCACGG CCGCAACCGG GGTCGACGCC ATGGCCCATT GCGCCGAGGC CCTGACCTCG
AAACGGGCGC ATCCGCTGGT CGACGCCTAT GCTCTGGAGG GGATCGCGCT CGTCGGCCGC
TTCCTGCGTC GCGCGGTCGA GGACGGGCAG GATGTCGAAG CCCGGGCAGG CCTGTCGCTC
GCGGCCTTCT ATGGCGGCAT CTGCCTCGGC CCCGTGAACA CGACGGCGGG CCATGCGCTC
TCCTATCCGC TCGGCACGCG CCACAAGCTG CCCCACGGGA TCGCGAATGC GCTGATCTTC
CCGCATGTGC TGGCGGCCAA CGCCTCGGCC GCGCCGGAGA AGACGGCCCG GATCTGCGCG
GCTCTGGGCT TTGCTGCGGG CGCCGAGGAG ACGGTGCGGG CCGGTGCGCT CGCCTTCTGC
GCCGGGCTCG GGCTCGACAT GCGGCTGCGG GCACATGGCG TGCCGTCCGA GGATCTGCCG
GTCATGGCGG CGGAGGCGCA TGGCATCCGC CGCCTGCTCG ACTGGAACCC GCGCGACCTG
AGCGTGGCCG AGATCGAGGC GATCTACCGC CGCGCCTACT GA
 
Protein sequence
MADDLTRPIT LLRPAAVHFG EGSLARLPEW VAARGFRAPF VIADAVNAQR LDRLGLGSVG 
CFGTVVPEPD TANLQAAVAA AEGADLIVGF GGGSAMDLAK LVAVLVGTGL ALSDISGPGR
APARRVGLVQ VPTTAGTGSE VGTRALVTDP ASLAKIATES AEMLADMAIV DPALTLSVPP
AVTAATGVDA MAHCAEALTS KRAHPLVDAY ALEGIALVGR FLRRAVEDGQ DVEARAGLSL
AAFYGGICLG PVNTTAGHAL SYPLGTRHKL PHGIANALIF PHVLAANASA APEKTARICA
ALGFAAGAEE TVRAGALAFC AGLGLDMRLR AHGVPSEDLP VMAAEAHGIR RLLDWNPRDL
SVAEIEAIYR RAY