Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4204 |
Symbol | |
ID | 5319099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 685750 |
End bp | 687639 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776009 |
Product | xylose isomerase domain-containing protein |
Protein accession | YP_001312942 |
Protein GI | 150376346 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.460434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT CGATTGCGAC TGTTTCGATC AGCGGCACTC TTTCGGAGAA ACTCGCCGCC GTCGCGGCGG CAGGCTTCGA CGGGGTGGAG ATATTCGAAA ACGATTTCCT GACCTTCGAC GGCTCGCCTG CGGAGGTCGG CCGCATTGTG CGCGATCATG GATTGAAGGT GACGCTTTTT CAGCCTTTTC GCGATTTCGA AGGCCTGCCG GAACCGCACC GGTCACGCGC TTTCGACCGG GCGGAGCGCA AGTTCGACGT GATGCAGGAG CTCGGGACGG AATTGATGCT GGTCTGCTCC AGCGTCTCGC CACTCGCCTT GGGCGGTATC GACCGCGCCG CCGACGATTT CAGGGAGCTG GGCGAAAGGG CGGCGCGGCG GAACCTGCGC GTCGGCTACG AGGCGCTTGC CTGGGGCCGC CATGTCAACG ACCACCGGGA CGCCTGGGAA ATCGTCCGCA GGGCCGACCA CCCGAACGTC GGGTTGATCC TCGACAGCTT TCATACGCTC GCGCGCAACA TCGACCTGCG GTCGATCCGC TCCATTCCGG GGGACCGCAT CTTCATCGTT CAACTCGCTG ACGCGCCGCG GATCGAGATG GACCTCCTCT ATCTCAGCCG CCACTTCCGC AATATGCCTG GGGAAGGCGA CCTTCCGCTC GTCGATTTCA TGTCCGCCGT CGCCGCGACC GGCTATGACG GCGCGATATC GCTCGAGATC TTCAACGATC AGTTCCGTGG CGGATCGCCC CAAGCGATCG CCGAGGATGG ACATCGTTCG CTGGTTTACC TGATGGATCG GGTGCGGCGA CACGAGCCGG CGGCGAAGAG CGCCGAAGCA CTGCCCGACC GGGTGGAGGT GCTGGGGGTC GAGTTCGTGG AGTTCACCGC CGATCAGACG GAGGCGGAAG CGCTCGGCTC GCTGCTTGGG GCCATGGGTT TTCGCGCCGT CGCCTCGCAT CGCGAGAAAG CGGTGACCCT TTGGCGACAG GGTTGCGATC ACCGAGGCAT AAACGTCGTC ATCAATACCG AGCAGAAGGG GTTCGCTCAT TCGAGCTATC TGGTGCACGG CGCATCGGCC TATGCGATTG GCCTGAAAGT ACCCGATGCC TCGGCCGCGG TCGAACGCGC GCGCGCCCTC GGCGCCGAAA TATTCCTGCC GGAGTCAGGT GAAGGCGAGG GGGCGATGGC AGCGATCCGC GGTATGGGCG GCGGCCTTAT TTATTTCGTG GACGAGATCG ACGACGTCTG GTCGCGCGAG TTCGTCGCGC CGGGAGATGA TCGGCAGCCT GCAGCGACCC TCGTCGCCAT AGACCATGTG GCGCAATCCA TGAAGGAAGA GCAGCTTCCG AGCTGGCTGC TTTTCTATAC ATCCATTCTT GATGCGGACA AGCTGGCACA GGTCGATATC GTCGATCCCG CCGGGCTGGT CCGCAGCCAG GTCGTGGAGA ATGCGAGCGG GACGCTCCGG TTGACGTTGA ACGGCGCGGA CAATCACCGC ACGCTCGCCG GCCACTTCAT CGCAGAGAGC TTCGGCTCGG GCGTCCAGCA TGTGGCCTTC CGCACCGACG ACATCTTCGA AGCTGCCGAA CACATGCGCC GCAGCGGCTT TCGCGCCTTG GCGATCTCGC GCAACTACTA TGATGACATC GAAGTGCGGT TCGCGCTCGA ACCGGGTTTT GTCGACAGGC TAAGAGACGA AAACATCCTC TATGATCGCG ATGAGGATGG CGAGTATTTC CAGATCTATG GCCCCACCTT TGGAGAGGGC TTCTTCTTCG AGATCGTCGA GCGCCGCTCC GGTTACCGTG GCTATGGTGC AGCCAACGCA CCTTTCCGTA TCGCCGCGCA GAAACGTTGC CTTCGGCCGG CGGGTATGCC TGCAGAATAG
|
Protein sequence | MKTSIATVSI SGTLSEKLAA VAAAGFDGVE IFENDFLTFD GSPAEVGRIV RDHGLKVTLF QPFRDFEGLP EPHRSRAFDR AERKFDVMQE LGTELMLVCS SVSPLALGGI DRAADDFREL GERAARRNLR VGYEALAWGR HVNDHRDAWE IVRRADHPNV GLILDSFHTL ARNIDLRSIR SIPGDRIFIV QLADAPRIEM DLLYLSRHFR NMPGEGDLPL VDFMSAVAAT GYDGAISLEI FNDQFRGGSP QAIAEDGHRS LVYLMDRVRR HEPAAKSAEA LPDRVEVLGV EFVEFTADQT EAEALGSLLG AMGFRAVASH REKAVTLWRQ GCDHRGINVV INTEQKGFAH SSYLVHGASA YAIGLKVPDA SAAVERARAL GAEIFLPESG EGEGAMAAIR GMGGGLIYFV DEIDDVWSRE FVAPGDDRQP AATLVAIDHV AQSMKEEQLP SWLLFYTSIL DADKLAQVDI VDPAGLVRSQ VVENASGTLR LTLNGADNHR TLAGHFIAES FGSGVQHVAF RTDDIFEAAE HMRRSGFRAL AISRNYYDDI EVRFALEPGF VDRLRDENIL YDRDEDGEYF QIYGPTFGEG FFFEIVERRS GYRGYGAANA PFRIAAQKRC LRPAGMPAE
|
| |