Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1996 |
Symbol | xylA |
ID | 5712991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2114512 |
End bp | 2115816 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641267920 |
Product | xylose isomerase |
Protein accession | YP_001533336 |
Protein GI | 159044542 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2115] Xylose isomerase |
TIGRFAM ID | [TIGR02630] xylose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00412566 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.553751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATT TTTTCGCGGG GATCCCCGCC GTCACCTATG AAGGCCCCGA GGCCCGCAGC GACTTCGCCT TCCGGCACTA CAACCCCGAC GAGATGGTCA TGGGCAAGCG GATGGAGGAT CAGCTGCGCT TCGCCGTGGC CTGGTGGCAT TCCTTTGCTT GGGAAGGGGG CGACCCGTTC GGCGGTCCGA CCTTCCAGCG CCCGTGGTTC GGCGACACCC TGGATCATGC CCGCGCCAAG GCCGATGCGG CGTTCGAGAT GTTCCGCATC CTGAACGTGC CGTTCTATTG CTTCCACGAC GCCGATATCC GCCCCGAGGG CGCGAGTTTC GCCGAGACCA CCGCGAACCT TGAAGCGATG GTGGACTACC TGGGCCAGAA GCAGGAGGCG AGCGGCAAGC GGCTGCTCTG GGGCACCGCG AACCTGTTCT CGCACCGGCG CTACATGGCT GGGGCGGCGA CGAACCCGGA CCCGGATGTG TTCGCCTATG CCGCCGCCAC CATCAAGACC TGCATGGACG CGACCCACAA ACTGGGCGGC GCGAATTATG TCCTGTGGGG CGGGCGCGAG GGGTACGAGA CCCTGCTCAA CACCGATCTG GGGCGCGAGC GGGAGCAGGC GGGCCGGATG CTGCAGATGG TTGTGGATTA CAAGCACAAG ATCGGGTTCG AGGGCACGAT CCTGCTGGAG CCGAAACCGC AGGAGCCCAC GAAGCACCAA TATGATTACG ACGTGGCCAC GGTGTTCAGC TTTCTGTCGG AGTTCGGCCT GCAGGACGAG GTGAAGATGA ATATCGAGCA GGGCCATGCG ATCCTGGCCG GGCATTCCTT CGAGCATGAG CTGGCGCTGG CGCGGGAGTT CGGGATCCTG GGCTCCATCG ACATGAACCG CAACGATTAC CAGTCGGGCT GGGACACCGA CCAGTTCCCG AACAACATCC CCGAGGTGGC GCTGGCCTAT TACGAGATCC TGCGCGCGGG CGGCTTCGAT ACCGGGGGCA CCAATTTCGA TTCCAAGCTG CGGCGGCAAT CGCTGGACCC GGCGGACCTG ATCGCGGCCC ATGTGGCGGC GATGGATGTC TGTGCCGCGG GGCTGAAGGC GGCGGCACGG ATGCTGGAGG ATGGCGAGTT GGAGCAGCGG CGCGAGGATC GCTATGCGGG CTGGCGCGCG CCCTCGGCGG AAGCGATGCT GAACGGTGGC AAGCTGGAGG ACTGCTTTGC CCATGTGATG GAGACCGGGC TTGATCCGCA GCCGGTCTCG GGCGGGCAGG AACGGCTGGA GGCGCTGGTC GCGCGATACC TGTAA
|
Protein sequence | MSDFFAGIPA VTYEGPEARS DFAFRHYNPD EMVMGKRMED QLRFAVAWWH SFAWEGGDPF GGPTFQRPWF GDTLDHARAK ADAAFEMFRI LNVPFYCFHD ADIRPEGASF AETTANLEAM VDYLGQKQEA SGKRLLWGTA NLFSHRRYMA GAATNPDPDV FAYAAATIKT CMDATHKLGG ANYVLWGGRE GYETLLNTDL GREREQAGRM LQMVVDYKHK IGFEGTILLE PKPQEPTKHQ YDYDVATVFS FLSEFGLQDE VKMNIEQGHA ILAGHSFEHE LALAREFGIL GSIDMNRNDY QSGWDTDQFP NNIPEVALAY YEILRAGGFD TGGTNFDSKL RRQSLDPADL IAAHVAAMDV CAAGLKAAAR MLEDGELEQR REDRYAGWRA PSAEAMLNGG KLEDCFAHVM ETGLDPQPVS GGQERLEALV ARYL
|
| |