Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2001 |
Symbol | |
ID | 5712996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2120658 |
End bp | 2121902 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641267925 |
Product | putative xylose repressor |
Protein accession | YP_001533341 |
Protein GI | 159044547 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.244297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.176601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAGG ATGCCGCGAC GCGCGACACT CCCGGATGCG GGCCCATGCT GCCCGATTCG GGCCGCAATG CAAAACCCCT GCGTCAGGCG GTATTCGAGC ATGTGCGCGC CGCCGGGCAC GCGCCGCGAA TGGACATCGC CCGCGCCCTC GGCATCTCGC CCGGTTCGGT CACCACGCTG ACCTCGGACC TGATCGAGGC GGGGTTTCTC ACCGAAATCG CCGCCCCCGC CCGCGAGACC GGGCGCGGTC GCCCGCCCGT GGCCCTCGCC GTGGTGCCCG CGGCCCGCTA CGTGCTTGGC CTGCGCCTGT CGGACGAGAT GCACACGGTC AGCCTGTCGG ATTTTTCCGG CACCGAACTG GCCACCGCCC ACCGCGCGAG CCAGCCGGGG CGCTATGCGG TCGAGGCGCT GCTGACCGAG ATGGCCACCC TGATCGACGA GGTGTTGGCG GCAGCCGCCC TGCCCCGCGA CCGGGTGGCG GCGCTCGGCG TCGGTCTGCC GGGGGCCGTC CATCACGAAA CCGGCCGCGT CGCCTGGTCG CCGATCCTCG CCGGGCAGGA TCATGCCCTC CAGGCGATCA TCGAAGACCG CTTCGGCCTG CCCGCGCATC TGGAGAATGA CGCCAATGTC CTGACGCTGG CCGAGCTGTG GTTCGGTGCG GGCCGCGCGA TGCAGGACTT CGCCGTGGTC ACGATCGAAC AAGGGGTCGG CATGGGGCTG GTGCTGAACA ACCGGTTGTT TCGCGGCGCA CAGGGGCTCG GGCTGGAGCT GGGGCACACC AAGGTGCAGC TCGACGGGGC GCTCTGCCGC TGTGGGCAGC GCGGCTGCCT GGAGGCGTAT CTGGCCGACT ACGCGCTGGT GCGCGAGGCC TCCACCGCGC TCGACCGCGA CCCCCGCTCG GCCCAGACCG CCGCCGCCAT GCTGGAGAGC CTGTTCGATC AGGCCAAGGC CGGCAACGGC GCGGCCAAGG CGATCTTTCA GCGCGCCGGG CGCTTCCTGT CGCTGGGACT GGCCAATGTG GTGCAGCTTT TCGATCCGGA ACTCATCATT CTGAGCGGCG CGCGGATGCG CTACGACTAC CTTTATGCCG AAGAGGTGCT CGCCGAGATG CAACGCATGA CCCTGCACCC CGCCACCCCG CGCAGCCGGG TCGAGATCCA CGCCTGGGGC GACCAGGTCT GGGCGCGCGG GGCGACGGCG CTGGCGCTGT CGGCGGTCAC GGACGCGCTC ATGGGGGAGA GATGA
|
Protein sequence | MPEDAATRDT PGCGPMLPDS GRNAKPLRQA VFEHVRAAGH APRMDIARAL GISPGSVTTL TSDLIEAGFL TEIAAPARET GRGRPPVALA VVPAARYVLG LRLSDEMHTV SLSDFSGTEL ATAHRASQPG RYAVEALLTE MATLIDEVLA AAALPRDRVA ALGVGLPGAV HHETGRVAWS PILAGQDHAL QAIIEDRFGL PAHLENDANV LTLAELWFGA GRAMQDFAVV TIEQGVGMGL VLNNRLFRGA QGLGLELGHT KVQLDGALCR CGQRGCLEAY LADYALVREA STALDRDPRS AQTAAAMLES LFDQAKAGNG AAKAIFQRAG RFLSLGLANV VQLFDPELII LSGARMRYDY LYAEEVLAEM QRMTLHPATP RSRVEIHAWG DQVWARGATA LALSAVTDAL MGER
|
| |