Gene Dshi_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2001 
Symbol 
ID5712996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2120658 
End bp2121902 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content71% 
IMG OID641267925 
Productputative xylose repressor 
Protein accessionYP_001533341 
Protein GI159044547 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.244297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.176601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGG ATGCCGCGAC GCGCGACACT CCCGGATGCG GGCCCATGCT GCCCGATTCG 
GGCCGCAATG CAAAACCCCT GCGTCAGGCG GTATTCGAGC ATGTGCGCGC CGCCGGGCAC
GCGCCGCGAA TGGACATCGC CCGCGCCCTC GGCATCTCGC CCGGTTCGGT CACCACGCTG
ACCTCGGACC TGATCGAGGC GGGGTTTCTC ACCGAAATCG CCGCCCCCGC CCGCGAGACC
GGGCGCGGTC GCCCGCCCGT GGCCCTCGCC GTGGTGCCCG CGGCCCGCTA CGTGCTTGGC
CTGCGCCTGT CGGACGAGAT GCACACGGTC AGCCTGTCGG ATTTTTCCGG CACCGAACTG
GCCACCGCCC ACCGCGCGAG CCAGCCGGGG CGCTATGCGG TCGAGGCGCT GCTGACCGAG
ATGGCCACCC TGATCGACGA GGTGTTGGCG GCAGCCGCCC TGCCCCGCGA CCGGGTGGCG
GCGCTCGGCG TCGGTCTGCC GGGGGCCGTC CATCACGAAA CCGGCCGCGT CGCCTGGTCG
CCGATCCTCG CCGGGCAGGA TCATGCCCTC CAGGCGATCA TCGAAGACCG CTTCGGCCTG
CCCGCGCATC TGGAGAATGA CGCCAATGTC CTGACGCTGG CCGAGCTGTG GTTCGGTGCG
GGCCGCGCGA TGCAGGACTT CGCCGTGGTC ACGATCGAAC AAGGGGTCGG CATGGGGCTG
GTGCTGAACA ACCGGTTGTT TCGCGGCGCA CAGGGGCTCG GGCTGGAGCT GGGGCACACC
AAGGTGCAGC TCGACGGGGC GCTCTGCCGC TGTGGGCAGC GCGGCTGCCT GGAGGCGTAT
CTGGCCGACT ACGCGCTGGT GCGCGAGGCC TCCACCGCGC TCGACCGCGA CCCCCGCTCG
GCCCAGACCG CCGCCGCCAT GCTGGAGAGC CTGTTCGATC AGGCCAAGGC CGGCAACGGC
GCGGCCAAGG CGATCTTTCA GCGCGCCGGG CGCTTCCTGT CGCTGGGACT GGCCAATGTG
GTGCAGCTTT TCGATCCGGA ACTCATCATT CTGAGCGGCG CGCGGATGCG CTACGACTAC
CTTTATGCCG AAGAGGTGCT CGCCGAGATG CAACGCATGA CCCTGCACCC CGCCACCCCG
CGCAGCCGGG TCGAGATCCA CGCCTGGGGC GACCAGGTCT GGGCGCGCGG GGCGACGGCG
CTGGCGCTGT CGGCGGTCAC GGACGCGCTC ATGGGGGAGA GATGA
 
Protein sequence
MPEDAATRDT PGCGPMLPDS GRNAKPLRQA VFEHVRAAGH APRMDIARAL GISPGSVTTL 
TSDLIEAGFL TEIAAPARET GRGRPPVALA VVPAARYVLG LRLSDEMHTV SLSDFSGTEL
ATAHRASQPG RYAVEALLTE MATLIDEVLA AAALPRDRVA ALGVGLPGAV HHETGRVAWS
PILAGQDHAL QAIIEDRFGL PAHLENDANV LTLAELWFGA GRAMQDFAVV TIEQGVGMGL
VLNNRLFRGA QGLGLELGHT KVQLDGALCR CGQRGCLEAY LADYALVREA STALDRDPRS
AQTAAAMLES LFDQAKAGNG AAKAIFQRAG RFLSLGLANV VQLFDPELII LSGARMRYDY
LYAEEVLAEM QRMTLHPATP RSRVEIHAWG DQVWARGATA LALSAVTDAL MGER