Gene Smed_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4204 
Symbol 
ID5319099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp685750 
End bp687639 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content63% 
IMG OID640776009 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001312942 
Protein GI150376346 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.460434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT CGATTGCGAC TGTTTCGATC AGCGGCACTC TTTCGGAGAA ACTCGCCGCC 
GTCGCGGCGG CAGGCTTCGA CGGGGTGGAG ATATTCGAAA ACGATTTCCT GACCTTCGAC
GGCTCGCCTG CGGAGGTCGG CCGCATTGTG CGCGATCATG GATTGAAGGT GACGCTTTTT
CAGCCTTTTC GCGATTTCGA AGGCCTGCCG GAACCGCACC GGTCACGCGC TTTCGACCGG
GCGGAGCGCA AGTTCGACGT GATGCAGGAG CTCGGGACGG AATTGATGCT GGTCTGCTCC
AGCGTCTCGC CACTCGCCTT GGGCGGTATC GACCGCGCCG CCGACGATTT CAGGGAGCTG
GGCGAAAGGG CGGCGCGGCG GAACCTGCGC GTCGGCTACG AGGCGCTTGC CTGGGGCCGC
CATGTCAACG ACCACCGGGA CGCCTGGGAA ATCGTCCGCA GGGCCGACCA CCCGAACGTC
GGGTTGATCC TCGACAGCTT TCATACGCTC GCGCGCAACA TCGACCTGCG GTCGATCCGC
TCCATTCCGG GGGACCGCAT CTTCATCGTT CAACTCGCTG ACGCGCCGCG GATCGAGATG
GACCTCCTCT ATCTCAGCCG CCACTTCCGC AATATGCCTG GGGAAGGCGA CCTTCCGCTC
GTCGATTTCA TGTCCGCCGT CGCCGCGACC GGCTATGACG GCGCGATATC GCTCGAGATC
TTCAACGATC AGTTCCGTGG CGGATCGCCC CAAGCGATCG CCGAGGATGG ACATCGTTCG
CTGGTTTACC TGATGGATCG GGTGCGGCGA CACGAGCCGG CGGCGAAGAG CGCCGAAGCA
CTGCCCGACC GGGTGGAGGT GCTGGGGGTC GAGTTCGTGG AGTTCACCGC CGATCAGACG
GAGGCGGAAG CGCTCGGCTC GCTGCTTGGG GCCATGGGTT TTCGCGCCGT CGCCTCGCAT
CGCGAGAAAG CGGTGACCCT TTGGCGACAG GGTTGCGATC ACCGAGGCAT AAACGTCGTC
ATCAATACCG AGCAGAAGGG GTTCGCTCAT TCGAGCTATC TGGTGCACGG CGCATCGGCC
TATGCGATTG GCCTGAAAGT ACCCGATGCC TCGGCCGCGG TCGAACGCGC GCGCGCCCTC
GGCGCCGAAA TATTCCTGCC GGAGTCAGGT GAAGGCGAGG GGGCGATGGC AGCGATCCGC
GGTATGGGCG GCGGCCTTAT TTATTTCGTG GACGAGATCG ACGACGTCTG GTCGCGCGAG
TTCGTCGCGC CGGGAGATGA TCGGCAGCCT GCAGCGACCC TCGTCGCCAT AGACCATGTG
GCGCAATCCA TGAAGGAAGA GCAGCTTCCG AGCTGGCTGC TTTTCTATAC ATCCATTCTT
GATGCGGACA AGCTGGCACA GGTCGATATC GTCGATCCCG CCGGGCTGGT CCGCAGCCAG
GTCGTGGAGA ATGCGAGCGG GACGCTCCGG TTGACGTTGA ACGGCGCGGA CAATCACCGC
ACGCTCGCCG GCCACTTCAT CGCAGAGAGC TTCGGCTCGG GCGTCCAGCA TGTGGCCTTC
CGCACCGACG ACATCTTCGA AGCTGCCGAA CACATGCGCC GCAGCGGCTT TCGCGCCTTG
GCGATCTCGC GCAACTACTA TGATGACATC GAAGTGCGGT TCGCGCTCGA ACCGGGTTTT
GTCGACAGGC TAAGAGACGA AAACATCCTC TATGATCGCG ATGAGGATGG CGAGTATTTC
CAGATCTATG GCCCCACCTT TGGAGAGGGC TTCTTCTTCG AGATCGTCGA GCGCCGCTCC
GGTTACCGTG GCTATGGTGC AGCCAACGCA CCTTTCCGTA TCGCCGCGCA GAAACGTTGC
CTTCGGCCGG CGGGTATGCC TGCAGAATAG
 
Protein sequence
MKTSIATVSI SGTLSEKLAA VAAAGFDGVE IFENDFLTFD GSPAEVGRIV RDHGLKVTLF 
QPFRDFEGLP EPHRSRAFDR AERKFDVMQE LGTELMLVCS SVSPLALGGI DRAADDFREL
GERAARRNLR VGYEALAWGR HVNDHRDAWE IVRRADHPNV GLILDSFHTL ARNIDLRSIR
SIPGDRIFIV QLADAPRIEM DLLYLSRHFR NMPGEGDLPL VDFMSAVAAT GYDGAISLEI
FNDQFRGGSP QAIAEDGHRS LVYLMDRVRR HEPAAKSAEA LPDRVEVLGV EFVEFTADQT
EAEALGSLLG AMGFRAVASH REKAVTLWRQ GCDHRGINVV INTEQKGFAH SSYLVHGASA
YAIGLKVPDA SAAVERARAL GAEIFLPESG EGEGAMAAIR GMGGGLIYFV DEIDDVWSRE
FVAPGDDRQP AATLVAIDHV AQSMKEEQLP SWLLFYTSIL DADKLAQVDI VDPAGLVRSQ
VVENASGTLR LTLNGADNHR TLAGHFIAES FGSGVQHVAF RTDDIFEAAE HMRRSGFRAL
AISRNYYDDI EVRFALEPGF VDRLRDENIL YDRDEDGEYF QIYGPTFGEG FFFEIVERRS
GYRGYGAANA PFRIAAQKRC LRPAGMPAE