Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2000 |
Symbol | |
ID | 5712995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2119423 |
End bp | 2120445 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267924 |
Product | D-xylose-binding periplasmic protein xylF |
Protein accession | YP_001533340 |
Protein GI | 159044546 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.121993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAG CAATTCTCGC GGCCGCCATC GTCGCGGCCG GTGTCACCAC ATCGGCCTAT GCCGATGTCA CGGTCGGTGT CAGCTGGTCG AATTTTCAGG AAGAGCGTTG GAAGACCGAC GAGGCCGCCA TCAAGGCCGC GCTCGAAGCC GCCGGCGCCA CTTATGTTTC GGCGGACGCG CAGTCGTCCT CGGCCAAGCA GCTGTCGGAT GTGGAGAGCC TGATTGCGCA GGGTGTCGAT GCCCTGATTA TTCTGGCCCA AGACAGCCAG GCCATCGGCC CAGCCGTGCA GGCCGCGGCC GACGAGGGGA TCCCGGTGGT TGGCTATGAC CGCCTGATCG AGGATCCGCG GGCCTTCTAC CTGACCTTCG ACAACGTGGA AGTGGGCCGG ATGCAGGCCC GCGCCGTGCT GGAGCAGGCC CCCGAGGGCA ATTACGTGAT GATCAAGGGC TCGCCCACGG ACCCGAACGC GGACTTCCTG CGCGGCGGGC AGCAGGAGAT CCTGCAGGAT GCCATTGACG CAGGCAAGAT CACCATCGTG GGCGAGGCCT ATACCGATGG CTGGCTGCCG GCGAACGCCC AGCGGAACAT GGAGCAGATC CTGACCGCCC AGGACAACCA GGTGGACGCG GTCGTGGCCT CCAACGACGG GACCGCGGGT GGCGTGGTCG CGGCCCTGAC CGCCCAGGGC ATGGAAGGGA TCCCGGTCTC GGGCCAGGAC GGTGATCATG CCGCGCTGAA CCGGGTGGCC AAGGGCACCC AGACCGTGTC CGTGTGGAAG GACGCGCGGG ATCTGGGCCG GGCCGCGGGT GAGATCGCCG TGGCCATGGC GAACGGCACC GCGATGGCGG ATATCGAGGG TGCGACCTCC TGGACCTCCC CCGGGGGGAC GGAGTTGACC GCCCGGTTCC TGGCGCCGGT GCCGGTGACC GCCGACAACC TTACCGCGGT GGTCGATGCC CAGTGGATCA CGCAAGAGAC CCTGTGCCAG GGCGTGACCG ACGGTCCGGC GCCCTGCAAC TGA
|
Protein sequence | MRKAILAAAI VAAGVTTSAY ADVTVGVSWS NFQEERWKTD EAAIKAALEA AGATYVSADA QSSSAKQLSD VESLIAQGVD ALIILAQDSQ AIGPAVQAAA DEGIPVVGYD RLIEDPRAFY LTFDNVEVGR MQARAVLEQA PEGNYVMIKG SPTDPNADFL RGGQQEILQD AIDAGKITIV GEAYTDGWLP ANAQRNMEQI LTAQDNQVDA VVASNDGTAG GVVAALTAQG MEGIPVSGQD GDHAALNRVA KGTQTVSVWK DARDLGRAAG EIAVAMANGT AMADIEGATS WTSPGGTELT ARFLAPVPVT ADNLTAVVDA QWITQETLCQ GVTDGPAPCN
|
| |