Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0030 |
Symbol | |
ID | 4076297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 30394 |
End bp | 31401 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005317 |
Product | aldose 1-epimerase |
Protein accession | YP_612025 |
Protein GI | 99079871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2017] Galactose mutarotase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.413428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.24254 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATG CGCAGATCGT GGAGCATGGT CTCCATCGCG GACATACGCT GAAGGAAGCT CGACTGCAAA GCGCCGGGCT TTCCATCAGT CTGTTGAACT TTGGGGCCGT GACCCGCGAT CTACGCCTTC TGGAGGAGAA CCGTCCGCTC ATTCTCGGCT TTCAGGACCC AGCAGACTAT CTGCTCAACC CCGGCTACCT CGGTGTAATC GCAGGCCGCG TCGCCGGACG GATCAAGAAC GCCCGTTTCA CACTCGGCAG GCAGAGGTTT CAGCTCAATC CCAACGAAGG CGATACCCTC CTGCACGGTG GTGCCAACGG GCTGTGTCAT GTGTTCTGGA ACCTTGAGGT CCTGTCAGAA AACACGGCTC GACTGCGCTA TCACTCGCCC GAAGGCGAGG GTGGTTTTCC CGGTGCAGCC GAGATAACCC TCACGGTAAT GCTTGAGGCA CAGGCGGTTG TCTATGACCT CACGGCCGAA GTAACGGCGC CAACGCCATT CAGCCTTGCT CAGCACAATT ATTACAACCT CATGGGGGGC GCTCAGTCGA TCCGGGAGCA TCGATTGCAG GTTGATGCCA CAAGCTATCT CGGACTGGAC GATGCAAATG TTCCAGATGG CAGGCTTTTG GCGCTTGACG GGTGTCACCA TGATTTCCGT TTGGGGCGCA GCTTTGCGGA ACTTGATCCG CAGACCAAAG GCAGTGACGT GGCCGTGGTG TTTGATGAGT GTCGCGACCC GGAGCAGCCA GTAGCCTCTC TCATTGCGCC GGACGGTCTG CAGATGCGGG TGATCAGCGA TCAGCCCTGC GCGCAGATCT ACACGGCCAG CGCTCTGCCA GAACAGCCCG GCGCTTTGCC GGGACAGCGG ATCGGGTCCG ACATGGGCGT CTGTATTGAG CCACAGGGCT ATGCCAACGC GGTAAACCTG CCGCAGTTTC CAAGCATGAT CGCAACGCCG GAACGGCCCT ACCGGCAACG TCTGCGCCTT GAGTTTGGGA GGATCTGA
|
Protein sequence | MKDAQIVEHG LHRGHTLKEA RLQSAGLSIS LLNFGAVTRD LRLLEENRPL ILGFQDPADY LLNPGYLGVI AGRVAGRIKN ARFTLGRQRF QLNPNEGDTL LHGGANGLCH VFWNLEVLSE NTARLRYHSP EGEGGFPGAA EITLTVMLEA QAVVYDLTAE VTAPTPFSLA QHNYYNLMGG AQSIREHRLQ VDATSYLGLD DANVPDGRLL ALDGCHHDFR LGRSFAELDP QTKGSDVAVV FDECRDPEQP VASLIAPDGL QMRVISDQPC AQIYTASALP EQPGALPGQR IGSDMGVCIE PQGYANAVNL PQFPSMIATP ERPYRQRLRL EFGRI
|
| |