Gene Saro_0757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0757 
Symbol 
ID3918581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp805250 
End bp806563 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content63% 
IMG OID640443489 
Productxylose isomerase 
Protein accessionYP_496038 
Protein GI87198781 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02630] xylose isomerase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCCG ATTACTTCGC CGATTTCCAG ACGGTCCGCT ACGAAGGGCC GGACAGCGAC 
AATGACTTTG CCTATCGCTG GTACGACAAG GACCGCGTGA TCCTGGGCAA GCGTATGGAG
GATCACCTGC GCTTTGCCGT CTGCATGTGG CACACCTTCT GCTGGCCCGG CAGCGACGTG
TTCGGTGCAG GCACTTTTAC CCGCCCCTGG CTGCAAGGCC CGATGGACGC GAGGAACGCA
GCCGCCAAGC GCGAGGCTGC GCTCGCCTTC GTCGAGAAGC TCGATGTTCC CTTCTACTGC
TTCCATGACG TCGACGTGAT GGCCGAGGCC GAAGGCATTG GCGAATTCCG ATCGAGCTTT
GCCGAAGCGG TCGATCATCT CGAGGAGCTG CAGGGCAAGC ACGGCCGCAA GCTGCTGTGG
GGTACCGCCA ATCTGTTCGG TCACCCTCGC TACATGGCAG GCGCCGCGAC CAATCCCGAT
CCGGAAGTCT TCGCCTGGGG CGCAAGCCAG GTGCGCGACG CGCTGGAAGC GACCCATCGC
CTGGGCGGCG CGAACTACGT GCTGTGGGGC GGCCGCGAAG GCTATGACAG CATCCTCAAC
ACCGAGATCG GGATCGAGCA GGAGAACTTC GGGCGCTTCC TGTCGCTGGT CGTGGATCAC
AAGCATCGCA TCGGCTTCAA GGGCACGATC CTCATCGAGC CCAAGCCGCA CGAGCCGACC
AAGCACCAGT ACGATTTCGA CACCCAGACC GTATTCGGCT TTCTCAAGCG CTTCGGGCTG
GAAAGCGAAG TGAAGGTGAA CATCGAGGCG AATCATGCAA CGCTCTCGGG CCATACTTTC
GAACACGAAC TGGCCATGGC GCGCGCTCTC GGCATTCTCG GCTCGATCGA CGCCAACCGT
GGCGACCACC AGAACGGCTG GGATACCGAC CAATTCCCCA ATTCGGTGGA AGAACTGACG
CTTGCCATGC TTGAACTGAT CCGTGCGGGC GGCTTCACCG ATGGCGGCTT CAATTTCGAC
GCCAAGGTGC GCCGCCAGTC GATCGACGCG GCCGACCTGT TCCACGGCCA CATCGGCGGC
ATCGACACCA TCGCGCACGC GCTGGTCAAG GCGGCGGCGC TGATCGAGGA CGGCAAGCTT
GATGCCTTCC GCGCCGAACG CTACGCGGGG TGGCAGGGGG AACTCGGTCG CAAGATCCAC
GCAGACGGCA CCACGCTGGC CGACATCGCC GACATCGCGG TAGCGCGCGA CCTCGCGCCG
GTGCGCAGGT CGGGCAGGCA GGAGTGGTGT GAAAACCTGA TCAACCGCGT TTGA
 
Protein sequence
MSADYFADFQ TVRYEGPDSD NDFAYRWYDK DRVILGKRME DHLRFAVCMW HTFCWPGSDV 
FGAGTFTRPW LQGPMDARNA AAKREAALAF VEKLDVPFYC FHDVDVMAEA EGIGEFRSSF
AEAVDHLEEL QGKHGRKLLW GTANLFGHPR YMAGAATNPD PEVFAWGASQ VRDALEATHR
LGGANYVLWG GREGYDSILN TEIGIEQENF GRFLSLVVDH KHRIGFKGTI LIEPKPHEPT
KHQYDFDTQT VFGFLKRFGL ESEVKVNIEA NHATLSGHTF EHELAMARAL GILGSIDANR
GDHQNGWDTD QFPNSVEELT LAMLELIRAG GFTDGGFNFD AKVRRQSIDA ADLFHGHIGG
IDTIAHALVK AAALIEDGKL DAFRAERYAG WQGELGRKIH ADGTTLADIA DIAVARDLAP
VRRSGRQEWC ENLINRV