Gene Rsph17029_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2057 
Symbol 
ID4896601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2180079 
End bp2181365 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID640112650 
Producthomoserine dehydrogenase 
Protein accessionYP_001043932 
Protein GI126462818 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCC CCCTGCGTCT TGGTATTGCC GGTCTCGGCA CCGTCGGTAT CGGGGTGGTC 
AAGATCATCC AGCGTCACGC CGACCTGATC GCCGCCCGCG CAGGACGCCC GGTCATCATC
ACCGCCGTCT GCGCCCGCGA CCGGTCGAAG AACCGCGATG CCGATCTCTC GGGCTACGCC
TGGGAGACGG ATGCGGTGGC GCTGGCACAG CGCGCGGATA TCGACGTGTT CGTCGAGGTC
ATGGGCGGCT CGGAAGGGGC GGCGCGCGCC TCGACCGAGG CCGCGCTGGC CGCCGGCAAG
GATGTGGTGA CCGCGAACAA GGCGCTTCTT GCCCATCACG GGCAGGCGCT GGCCGAGATG
GCCGAAGCTG CGGGCCTCGC CATCCGGTTC GAGGCGGCCG TCGCGGGCGG CATCCCGGTC
ATCAAGGCGC TGACCGAGGG GCTCGCCGGC AACCAGATGC GCCGGGTGAT GGGCGTCATG
AACGGCACCT GCAACTACAT CCTGACCCGG ATGGAGACCG CGGGCCTGCC CTACGACCAT
GTCTTCGAGG AGGCCCGCCA GCTCGGCTAT CTCGAGGCCG ACCCGAACCT CGATGTCGGA
GGGATCGACG CGGGCCACAA GCTCTCGCTT CTGGCCGCGA TCGCCTTCGG CACCCGCGTC
AGCTTCGACG ACGTGCAGCT CGAGGGGATC GGCCAGATCT CGATCGACGA CATCCGCCAT
GCGGGCGACC TCGGCTTCCG CATCAAGCTT CTGGGCGTGG CGCAGGTGTC GGGCCGCGGG
CTCGAGCAGC GGATGACCCC CTGCCTCGTG CCCGCCGACA GCCCGCTCGG CCAGCTTCAG
GGCGGCACCA ACATGGTGGT GCTCGAAGGA GACGCGGTGG GCCAGATCGT GCTGCGCGGC
CCCGGCGCGG GCGAAGGCCC CACCGCCAGC GCCGTGATGG GCGACGTGAT CGACCTCGCG
CGCGGACTGC GCCTGCCCAC CTTCGGCCGC CCCGCCACGA GCCTCGTCGA GGCGGTGCCC
GCCAAGGTCG CGGCGCCCGC GCCTTGGTAT CTGCGCATGA CGCTCCTCGA CAAGCCGGGG
GCGCTGGCCA AGATCGCGAC CGCGCTCGGC GAGGCCGGCA TCTCCATCGA CCGGATGCGC
CAGTATGGCC ACGAGGGCGG CCACGCCCCG GTGCTGATCG TCACCCACAA GGCCTCTCGC
GACGACATTT CCCACGCGAT CAGCCGGTTC GGGGCCACGG GCGTCCTCGT CGGCGAACCT
GTGGCGATCC GCATCGAAGA GGTTTGA
 
Protein sequence
MAAPLRLGIA GLGTVGIGVV KIIQRHADLI AARAGRPVII TAVCARDRSK NRDADLSGYA 
WETDAVALAQ RADIDVFVEV MGGSEGAARA STEAALAAGK DVVTANKALL AHHGQALAEM
AEAAGLAIRF EAAVAGGIPV IKALTEGLAG NQMRRVMGVM NGTCNYILTR METAGLPYDH
VFEEARQLGY LEADPNLDVG GIDAGHKLSL LAAIAFGTRV SFDDVQLEGI GQISIDDIRH
AGDLGFRIKL LGVAQVSGRG LEQRMTPCLV PADSPLGQLQ GGTNMVVLEG DAVGQIVLRG
PGAGEGPTAS AVMGDVIDLA RGLRLPTFGR PATSLVEAVP AKVAAPAPWY LRMTLLDKPG
ALAKIATALG EAGISIDRMR QYGHEGGHAP VLIVTHKASR DDISHAISRF GATGVLVGEP
VAIRIEEV