Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2701 |
Symbol | |
ID | 4077008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2842959 |
End bp | 2844122 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638008026 |
Product | D-3-hydroxyaspartate aldolase |
Protein accession | YP_614695 |
Protein GI | 99082541 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3616] Predicted amino acid aldolase or racemase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.75052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.463179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCAC CCGCAAATTT CGACAGCCTC GAAGTGGGCT TTGACGTCCC CGCCCTGCCC GGCATGGATG AGGCCGACAT CCAGACCCCC TGTCTGGTGC TCGACCTCGA CGCGCTGGAG CGCAACATCA AGAAGATGGG CGATTACGCC CGCGCGCACG GCATGCGCCA CCGCGTGCAT GGCAAGATGC ATAAATCGGT GGATGTGGCC AAACTCCAAG AGCGTCTTGG TGGCGCAATC GGGGTCTGCT GCCAGAAGGT CTCTGAGGCC GAAGTCTTTG CACGAGGCGG CATAAAGGAC GTGTTAGTGT CTAATCAGGT GCGCGATCCG GCCAAGATCG ACCGGCTGGC GCGGCTCCCG AAACTGGGCG CGCGTACCAT CGTCTGTGTG GATGATCCGG CCAATGTCGC GGATCTGTCA GCAGCGGCAC AGAAGCATGA CACCGAGATC GAGTGCTTTG TGGAGATCGA CTGTGGCGCT GGACGCTGCG GCGTGACCAC CACGGAAGAC GTGGTCGAGA TCGCGCAGGC CATTGATGCC GCCCCGGGGC TAAAGTTCAC CGGCATCCAA GCCTACCAGG GCGCCATGCA GCACCTCGAC AGCTACGAGG CGCGCAAGGA AAAGCTCGAC GTTGCAATCG CTCAGGTGCG TGATGCGGTC GAGGGGCTGA AGGCTGCGGA TCTTGCGCCG GAATTGGTCT CTGGAGGCGG TACAGGCTCT TACTACTTCG AGTCTAACTC AGGGGTTTAT AACGAATTGC AGTGTGGCTC CTACGCGTTC ATGGATGCCG ACTATGGCCG AATTCTGGAC CAGGACGGCA AGCGGATCGA CCAGGGCGAG TGGGAGAATG CCTTCTTCAT TCTGACCCAG GTCATGAGCC ACGCCAAAGC TGACAAGGCG ATCTGTGATG CGGGCCTCAA GGCGCAATCC GTGGACAGCG GACTGCCGTT CATCTTTGGT CGCACGGATG TCGAATACGT CAAATGCTCG GACGAGCATG GGGTGATAGC TGACCCCGAT GGCGTGCTGA AGGTGGGCGA GAAACTGAAG CTGGTCCCCG GCCATTGTGA TCCCACCGCG AATGTCCACG ACTGGTACGT CGGCGTCCGC AACGGCAAGG TCGAAACCCT CTGGCCAGTG TCCGCGCGCG GCAAGGCCTA CTGA
|
Protein sequence | MNAPANFDSL EVGFDVPALP GMDEADIQTP CLVLDLDALE RNIKKMGDYA RAHGMRHRVH GKMHKSVDVA KLQERLGGAI GVCCQKVSEA EVFARGGIKD VLVSNQVRDP AKIDRLARLP KLGARTIVCV DDPANVADLS AAAQKHDTEI ECFVEIDCGA GRCGVTTTED VVEIAQAIDA APGLKFTGIQ AYQGAMQHLD SYEARKEKLD VAIAQVRDAV EGLKAADLAP ELVSGGGTGS YYFESNSGVY NELQCGSYAF MDADYGRILD QDGKRIDQGE WENAFFILTQ VMSHAKADKA ICDAGLKAQS VDSGLPFIFG RTDVEYVKCS DEHGVIADPD GVLKVGEKLK LVPGHCDPTA NVHDWYVGVR NGKVETLWPV SARGKAY
|
| |