Gene Dshi_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1996 
SymbolxylA 
ID5712991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2114512 
End bp2115816 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content65% 
IMG OID641267920 
Productxylose isomerase 
Protein accessionYP_001533336 
Protein GI159044542 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02630] xylose isomerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00412566 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.553751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT TTTTCGCGGG GATCCCCGCC GTCACCTATG AAGGCCCCGA GGCCCGCAGC 
GACTTCGCCT TCCGGCACTA CAACCCCGAC GAGATGGTCA TGGGCAAGCG GATGGAGGAT
CAGCTGCGCT TCGCCGTGGC CTGGTGGCAT TCCTTTGCTT GGGAAGGGGG CGACCCGTTC
GGCGGTCCGA CCTTCCAGCG CCCGTGGTTC GGCGACACCC TGGATCATGC CCGCGCCAAG
GCCGATGCGG CGTTCGAGAT GTTCCGCATC CTGAACGTGC CGTTCTATTG CTTCCACGAC
GCCGATATCC GCCCCGAGGG CGCGAGTTTC GCCGAGACCA CCGCGAACCT TGAAGCGATG
GTGGACTACC TGGGCCAGAA GCAGGAGGCG AGCGGCAAGC GGCTGCTCTG GGGCACCGCG
AACCTGTTCT CGCACCGGCG CTACATGGCT GGGGCGGCGA CGAACCCGGA CCCGGATGTG
TTCGCCTATG CCGCCGCCAC CATCAAGACC TGCATGGACG CGACCCACAA ACTGGGCGGC
GCGAATTATG TCCTGTGGGG CGGGCGCGAG GGGTACGAGA CCCTGCTCAA CACCGATCTG
GGGCGCGAGC GGGAGCAGGC GGGCCGGATG CTGCAGATGG TTGTGGATTA CAAGCACAAG
ATCGGGTTCG AGGGCACGAT CCTGCTGGAG CCGAAACCGC AGGAGCCCAC GAAGCACCAA
TATGATTACG ACGTGGCCAC GGTGTTCAGC TTTCTGTCGG AGTTCGGCCT GCAGGACGAG
GTGAAGATGA ATATCGAGCA GGGCCATGCG ATCCTGGCCG GGCATTCCTT CGAGCATGAG
CTGGCGCTGG CGCGGGAGTT CGGGATCCTG GGCTCCATCG ACATGAACCG CAACGATTAC
CAGTCGGGCT GGGACACCGA CCAGTTCCCG AACAACATCC CCGAGGTGGC GCTGGCCTAT
TACGAGATCC TGCGCGCGGG CGGCTTCGAT ACCGGGGGCA CCAATTTCGA TTCCAAGCTG
CGGCGGCAAT CGCTGGACCC GGCGGACCTG ATCGCGGCCC ATGTGGCGGC GATGGATGTC
TGTGCCGCGG GGCTGAAGGC GGCGGCACGG ATGCTGGAGG ATGGCGAGTT GGAGCAGCGG
CGCGAGGATC GCTATGCGGG CTGGCGCGCG CCCTCGGCGG AAGCGATGCT GAACGGTGGC
AAGCTGGAGG ACTGCTTTGC CCATGTGATG GAGACCGGGC TTGATCCGCA GCCGGTCTCG
GGCGGGCAGG AACGGCTGGA GGCGCTGGTC GCGCGATACC TGTAA
 
Protein sequence
MSDFFAGIPA VTYEGPEARS DFAFRHYNPD EMVMGKRMED QLRFAVAWWH SFAWEGGDPF 
GGPTFQRPWF GDTLDHARAK ADAAFEMFRI LNVPFYCFHD ADIRPEGASF AETTANLEAM
VDYLGQKQEA SGKRLLWGTA NLFSHRRYMA GAATNPDPDV FAYAAATIKT CMDATHKLGG
ANYVLWGGRE GYETLLNTDL GREREQAGRM LQMVVDYKHK IGFEGTILLE PKPQEPTKHQ
YDYDVATVFS FLSEFGLQDE VKMNIEQGHA ILAGHSFEHE LALAREFGIL GSIDMNRNDY
QSGWDTDQFP NNIPEVALAY YEILRAGGFD TGGTNFDSKL RRQSLDPADL IAAHVAAMDV
CAAGLKAAAR MLEDGELEQR REDRYAGWRA PSAEAMLNGG KLEDCFAHVM ETGLDPQPVS
GGQERLEALV ARYL