Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0097 |
Symbol | |
ID | 4078763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 102371 |
End bp | 103426 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638005384 |
Product | deoxyribose-phosphate aldolase |
Protein accession | YP_612092 |
Protein GI | 99079938 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0274] Deoxyribose-phosphate aldolase |
TIGRFAM ID | [TIGR00126] deoxyribose-phosphate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGCC AAACCGCAGA AGACGGCACG CAGCCGCCGA AGACCTCCGC TGCCAAACCC TCCGCCAAGG CCAAGCCGAC CGCAGCCGCC ACGCAGGACC ATGAACACAG CGACCTGCCC CGCAATCCGG GTATCGCGCT TGATCTCGAC TGGGCCCTGA GCCTGGAGGC CAATACCTCC GCGATTGAAC GCCGCTGCGC GAGCCTGCCG GGTCGGCGCT CGGTCAAGAA GGACTATCAG GCCGCTTGGC TCTTGAAGGC GATCAGCCTC ATCGATCTCA CCACGCTCTC GGGCGATGAT ACCGAGGCCC GCGTGCGCCG TCTCTGCGCC AAGGCGCGCC AGCCGGTGCG AGCGGATATT CTCTCGGCAC TTGGCATCGA CGGGATCACG ACCGGAGCCG TCTGTGTCTA TCACGACATG ATCCCCGCCG CCGTGCGCGC GCTGCATGGC ACCCATATCC CGGTTGCCGC CGTCTCTACG GGCTTTCCGG CGGGGCTGTC GCCCTTCCAC CTGCGGCTCG AGGAAATCCG CGAAAGCGTG CGGGCCGGCG CCAAGGAAAT CGACATTGTG ATCTCGCGCC GCCATGTGCT CTCGGGGGAC TGGCAGGCGC TTTATGACGA GATGAAAGCC TTCCGCGAGG CCTGCGGAGA TGCCCACGTC AAGGCGATCC TCGCCACTGG CGAGCTTGGC ACTCTGCGCA ATGTCGCCCG CGCCTCGATG ATCTGCATGA TGGCGGGTGC CGACTTCATC AAGACGTCCA CTGGCAAAGA GAGCGTCAAC GCCACCCTGC CCGTCTCGTT GGTGATGATC CGCGCCATTC GCGAGTATTA CGAGCGCACC GGCTTTCATG TGGGCTACAA GCCCGCTGGT GGCATCTCCA AGGCCAAAGA CGCGCTGGTG TACCTCAGCC TCATGAAAGA GGAACTTGGC AACCGCTGGC TGCAGCCGGA CCTGTTCCGC TTTGGCGCGT CCAGCCTTCT GGGCGACATC GAACGCCAGC TCGAACACCA TGTGACCGGC GCCTATTCCG CTGGCTACCG CCACCCGATG GCGTGA
|
Protein sequence | MDSQTAEDGT QPPKTSAAKP SAKAKPTAAA TQDHEHSDLP RNPGIALDLD WALSLEANTS AIERRCASLP GRRSVKKDYQ AAWLLKAISL IDLTTLSGDD TEARVRRLCA KARQPVRADI LSALGIDGIT TGAVCVYHDM IPAAVRALHG THIPVAAVST GFPAGLSPFH LRLEEIRESV RAGAKEIDIV ISRRHVLSGD WQALYDEMKA FREACGDAHV KAILATGELG TLRNVARASM ICMMAGADFI KTSTGKESVN ATLPVSLVMI RAIREYYERT GFHVGYKPAG GISKAKDALV YLSLMKEELG NRWLQPDLFR FGASSLLGDI ERQLEHHVTG AYSAGYRHPM A
|
| |