Gene TM1040_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2164 
Symbol 
ID4076763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2273080 
End bp2274366 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content64% 
IMG OID638007484 
Producthomoserine dehydrogenase 
Protein accessionYP_614158 
Protein GI99082004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0414769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC CGCTTCGACT GGGGATTGCA GGTTTGGGCA CGGTCGGCAT TGGCGTGGTG 
AAGATCATTC GCCGCCATGC TGCCCTGTTG GAGGCGCGAA CTGGCCGCCC GGTGGTGATC
ACTGCCGTTT CGGCCCGCGA CGCCACCAAG GATCGCGGCG TGTCGCTCAA GGACTATGCG
TGGGAAACCG ATCCAGTCGC CCTTGCCACG CGCGATGATG TCGATGTGTT TGTCGAACTG
ATGGGCGGGC ACGAAGGCCC GGCCCGGCTT GCAACGGAGG CTGCGCTGGC AGCCGGCAAG
GATGTGGTCA CCGCAAACAA GGCGCTCCTG GCGATCCACG GCCAGGATCT GGCCGAACGC
GCAGAGGCCA ATGGCAGCGT CATCCGCTTT GAGGCGGCGG TTGCAGGTGG CATCCCCGTG
ATCAAATCCA TGACCGAGAG CCTTGCAGGC AATGAAATCA CCCGCGTCAT GGGCGTGATG
AACGGCACCT GCAACTATAT CCTCACGCAG ATGGAAGCCA CAGGCCGGGG TTATAACGCT
CTCTTTGAGG AATGCGGCAA GCTTGGCTAC CTGGAGGCCG ACCCGCTGCT GGACGTGGGC
GGTATTGATG CCGGCCACAA GCTGGCGCTC CTGGCCTCTA TCGCGTTTGG GACCAAACCG
GCCTTTGACG ATGTCAAACT CGAAGGCATT CAGCGCATCG CCATTGAAGA CATCCGCCAC
GCCGCCGATA TGGGCTATCG GATCAAGCTT CTGGGCGTTG CACAGCGTTC GGCGCGCGGG
CTTGAGCAGC GCATGACCCC CTGCCTGGTG CCCGCGAATT CTCCGCTCGG GCAGCTTGAG
GGCGGCACCA ACATGGTGGT GATCGAGGGC GACGCCATCG AACAAGTGGT GCTGCGCGGC
CCCGGCGCGG GCGAAGGCCC CACCGCCAGT GCGGTGATGG GCGATGTGCT CGACATTGCG
CGCGGCCTGC GGATCTCGAC CTTTGGCCAG CCGGCCACGA CGCTCTCGAA AGAACCAGCC
GCACAAACCG GCCTGCCTGC GCCCTATTAT GTGCGTATGG CGCTGCAGGA CAAACCCGGC
GCGCTGGCCA AAGTCGCCGC AGCATTGGGG GATGCGGGGG TCTCTATCCA CCGGATGCGC
CAGTATGATC ACGCCACCAC AGTGGCTCCG GTGTTGATCG TGACTCACAA ATGCACGTCT
GCCATGCTGG AACAGGCCCT TGAGGCGCTG GCCGCAACAG GCGTGGTTGA AGGCGCCCCC
GTGGCGCTGC GCATCGAAGA GCTGTGA
 
Protein sequence
MTEPLRLGIA GLGTVGIGVV KIIRRHAALL EARTGRPVVI TAVSARDATK DRGVSLKDYA 
WETDPVALAT RDDVDVFVEL MGGHEGPARL ATEAALAAGK DVVTANKALL AIHGQDLAER
AEANGSVIRF EAAVAGGIPV IKSMTESLAG NEITRVMGVM NGTCNYILTQ MEATGRGYNA
LFEECGKLGY LEADPLLDVG GIDAGHKLAL LASIAFGTKP AFDDVKLEGI QRIAIEDIRH
AADMGYRIKL LGVAQRSARG LEQRMTPCLV PANSPLGQLE GGTNMVVIEG DAIEQVVLRG
PGAGEGPTAS AVMGDVLDIA RGLRISTFGQ PATTLSKEPA AQTGLPAPYY VRMALQDKPG
ALAKVAAALG DAGVSIHRMR QYDHATTVAP VLIVTHKCTS AMLEQALEAL AATGVVEGAP
VALRIEEL