Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2164 |
Symbol | |
ID | 4076763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2273080 |
End bp | 2274366 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638007484 |
Product | homoserine dehydrogenase |
Protein accession | YP_614158 |
Protein GI | 99082004 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0414769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC CGCTTCGACT GGGGATTGCA GGTTTGGGCA CGGTCGGCAT TGGCGTGGTG AAGATCATTC GCCGCCATGC TGCCCTGTTG GAGGCGCGAA CTGGCCGCCC GGTGGTGATC ACTGCCGTTT CGGCCCGCGA CGCCACCAAG GATCGCGGCG TGTCGCTCAA GGACTATGCG TGGGAAACCG ATCCAGTCGC CCTTGCCACG CGCGATGATG TCGATGTGTT TGTCGAACTG ATGGGCGGGC ACGAAGGCCC GGCCCGGCTT GCAACGGAGG CTGCGCTGGC AGCCGGCAAG GATGTGGTCA CCGCAAACAA GGCGCTCCTG GCGATCCACG GCCAGGATCT GGCCGAACGC GCAGAGGCCA ATGGCAGCGT CATCCGCTTT GAGGCGGCGG TTGCAGGTGG CATCCCCGTG ATCAAATCCA TGACCGAGAG CCTTGCAGGC AATGAAATCA CCCGCGTCAT GGGCGTGATG AACGGCACCT GCAACTATAT CCTCACGCAG ATGGAAGCCA CAGGCCGGGG TTATAACGCT CTCTTTGAGG AATGCGGCAA GCTTGGCTAC CTGGAGGCCG ACCCGCTGCT GGACGTGGGC GGTATTGATG CCGGCCACAA GCTGGCGCTC CTGGCCTCTA TCGCGTTTGG GACCAAACCG GCCTTTGACG ATGTCAAACT CGAAGGCATT CAGCGCATCG CCATTGAAGA CATCCGCCAC GCCGCCGATA TGGGCTATCG GATCAAGCTT CTGGGCGTTG CACAGCGTTC GGCGCGCGGG CTTGAGCAGC GCATGACCCC CTGCCTGGTG CCCGCGAATT CTCCGCTCGG GCAGCTTGAG GGCGGCACCA ACATGGTGGT GATCGAGGGC GACGCCATCG AACAAGTGGT GCTGCGCGGC CCCGGCGCGG GCGAAGGCCC CACCGCCAGT GCGGTGATGG GCGATGTGCT CGACATTGCG CGCGGCCTGC GGATCTCGAC CTTTGGCCAG CCGGCCACGA CGCTCTCGAA AGAACCAGCC GCACAAACCG GCCTGCCTGC GCCCTATTAT GTGCGTATGG CGCTGCAGGA CAAACCCGGC GCGCTGGCCA AAGTCGCCGC AGCATTGGGG GATGCGGGGG TCTCTATCCA CCGGATGCGC CAGTATGATC ACGCCACCAC AGTGGCTCCG GTGTTGATCG TGACTCACAA ATGCACGTCT GCCATGCTGG AACAGGCCCT TGAGGCGCTG GCCGCAACAG GCGTGGTTGA AGGCGCCCCC GTGGCGCTGC GCATCGAAGA GCTGTGA
|
Protein sequence | MTEPLRLGIA GLGTVGIGVV KIIRRHAALL EARTGRPVVI TAVSARDATK DRGVSLKDYA WETDPVALAT RDDVDVFVEL MGGHEGPARL ATEAALAAGK DVVTANKALL AIHGQDLAER AEANGSVIRF EAAVAGGIPV IKSMTESLAG NEITRVMGVM NGTCNYILTQ MEATGRGYNA LFEECGKLGY LEADPLLDVG GIDAGHKLAL LASIAFGTKP AFDDVKLEGI QRIAIEDIRH AADMGYRIKL LGVAQRSARG LEQRMTPCLV PANSPLGQLE GGTNMVVIEG DAIEQVVLRG PGAGEGPTAS AVMGDVLDIA RGLRISTFGQ PATTLSKEPA AQTGLPAPYY VRMALQDKPG ALAKVAAALG DAGVSIHRMR QYDHATTVAP VLIVTHKCTS AMLEQALEAL AATGVVEGAP VALRIEEL
|
| |