Gene Dshi_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2000 
Symbol 
ID5712995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2119423 
End bp2120445 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content68% 
IMG OID641267924 
ProductD-xylose-binding periplasmic protein xylF 
Protein accessionYP_001533340 
Protein GI159044546 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.121993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG CAATTCTCGC GGCCGCCATC GTCGCGGCCG GTGTCACCAC ATCGGCCTAT 
GCCGATGTCA CGGTCGGTGT CAGCTGGTCG AATTTTCAGG AAGAGCGTTG GAAGACCGAC
GAGGCCGCCA TCAAGGCCGC GCTCGAAGCC GCCGGCGCCA CTTATGTTTC GGCGGACGCG
CAGTCGTCCT CGGCCAAGCA GCTGTCGGAT GTGGAGAGCC TGATTGCGCA GGGTGTCGAT
GCCCTGATTA TTCTGGCCCA AGACAGCCAG GCCATCGGCC CAGCCGTGCA GGCCGCGGCC
GACGAGGGGA TCCCGGTGGT TGGCTATGAC CGCCTGATCG AGGATCCGCG GGCCTTCTAC
CTGACCTTCG ACAACGTGGA AGTGGGCCGG ATGCAGGCCC GCGCCGTGCT GGAGCAGGCC
CCCGAGGGCA ATTACGTGAT GATCAAGGGC TCGCCCACGG ACCCGAACGC GGACTTCCTG
CGCGGCGGGC AGCAGGAGAT CCTGCAGGAT GCCATTGACG CAGGCAAGAT CACCATCGTG
GGCGAGGCCT ATACCGATGG CTGGCTGCCG GCGAACGCCC AGCGGAACAT GGAGCAGATC
CTGACCGCCC AGGACAACCA GGTGGACGCG GTCGTGGCCT CCAACGACGG GACCGCGGGT
GGCGTGGTCG CGGCCCTGAC CGCCCAGGGC ATGGAAGGGA TCCCGGTCTC GGGCCAGGAC
GGTGATCATG CCGCGCTGAA CCGGGTGGCC AAGGGCACCC AGACCGTGTC CGTGTGGAAG
GACGCGCGGG ATCTGGGCCG GGCCGCGGGT GAGATCGCCG TGGCCATGGC GAACGGCACC
GCGATGGCGG ATATCGAGGG TGCGACCTCC TGGACCTCCC CCGGGGGGAC GGAGTTGACC
GCCCGGTTCC TGGCGCCGGT GCCGGTGACC GCCGACAACC TTACCGCGGT GGTCGATGCC
CAGTGGATCA CGCAAGAGAC CCTGTGCCAG GGCGTGACCG ACGGTCCGGC GCCCTGCAAC
TGA
 
Protein sequence
MRKAILAAAI VAAGVTTSAY ADVTVGVSWS NFQEERWKTD EAAIKAALEA AGATYVSADA 
QSSSAKQLSD VESLIAQGVD ALIILAQDSQ AIGPAVQAAA DEGIPVVGYD RLIEDPRAFY
LTFDNVEVGR MQARAVLEQA PEGNYVMIKG SPTDPNADFL RGGQQEILQD AIDAGKITIV
GEAYTDGWLP ANAQRNMEQI LTAQDNQVDA VVASNDGTAG GVVAALTAQG MEGIPVSGQD
GDHAALNRVA KGTQTVSVWK DARDLGRAAG EIAVAMANGT AMADIEGATS WTSPGGTELT
ARFLAPVPVT ADNLTAVVDA QWITQETLCQ GVTDGPAPCN