Gene Rsph17029_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3621 
Symbol 
ID4898100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp713403 
End bp714446 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content74% 
IMG OID640114229 
Productalcohol dehydrogenase 
Protein accessionYP_001045483 
Protein GI126464370 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TCGTCCTCCA CGCCCCCCAC GACCTGCGGC TCGACGAGAT CGCCGCGGCG 
GCCGATCCCG GTCCCGGCGA GGTGCGCGTC GCGGTGAGCC ACGGCGGGAT CTGCGGCTCG
GACCTGCATT ACTATCACCA CGGCGGCTTC GGCACCGTGC GGCTGCGCGA GCCGATGGCG
CTCGGCCATG AGGTGTCCGG CATCGTGACG GCGCTCGGCG CCGGCGTGAC GGACCTCCGC
GAGGGCGACC GCGTGGCGGT CAACCCCTCG CGGCCCTGCG GACGCTGCGA CTATTGCCGC
CGCGGCCTCG CGCACCATTG TCTCGACATG CGCTTCAACG GCTCGGCCAT GCGCTTTCCG
CACGAACAGG GCCTGTTCCG CGCGGCGGTG ACGCTGCCCG CCGCCCAGGC CGTGCGTCTG
CCTGCGGAGA CCGACCTCGC GCTTGCCGCC ATGTCGGAGC CGCTGGCCGT CTGCCTTCAT
GCCGTGGCGG GCGCGGGCAG CCTGATCGGC AAGCGGGTGC TCGTGTCGGG CTGCGGCCCG
ATCGGCTGCC TGACGATCCT CGCCGCACGC GCGGCCGGGG CAGAAGAGAT CGTGGCCTCC
GACATCGCGG CGCCCGCACT GGCAGCCGCC CGCGCGGTGG GCGCGGACAG GGTGCTCGAC
CTTGCGGCCG AGTCCGAGGC GCTCGAGCCG TTCGCGGAAG GCAAGGGCCG GATCGACGTG
GTGCTCGAAT GTTCGGGCGC CCCGCCCGCG CTTCTGGCGG CCCTCCGGGT GCTGCGTCCG
CAGGGCCTTC TGGTCGCCGT GGGCCTCGGC CCCGAGGTCG CGCTGCCCGT GACCGCGCTC
GTTGCCCGCG AGATCCGCCT GCAGGGCAGC TTCCGCTTCG ATGCGGAGTT CGCCACCGCC
GCCCGGGCCA TCGCCTCGGG CCGCATCGAT GTGTCGCCGC TGCTCACCCG GGTGCTGCCC
GTGACCGAAG CCGCGGACGC CTTCGCCCTC GCCTCGGACA AGAGCCGGGC GATGAAGGTG
CAGATTGCCT TCCCGCCCCC GTGA
 
Protein sequence
MKALVLHAPH DLRLDEIAAA ADPGPGEVRV AVSHGGICGS DLHYYHHGGF GTVRLREPMA 
LGHEVSGIVT ALGAGVTDLR EGDRVAVNPS RPCGRCDYCR RGLAHHCLDM RFNGSAMRFP
HEQGLFRAAV TLPAAQAVRL PAETDLALAA MSEPLAVCLH AVAGAGSLIG KRVLVSGCGP
IGCLTILAAR AAGAEEIVAS DIAAPALAAA RAVGADRVLD LAAESEALEP FAEGKGRIDV
VLECSGAPPA LLAALRVLRP QGLLVAVGLG PEVALPVTAL VAREIRLQGS FRFDAEFATA
ARAIASGRID VSPLLTRVLP VTEAADAFAL ASDKSRAMKV QIAFPPP