Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_0591 |
Symbol | |
ID | 4058602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 629514 |
End bp | 630779 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641229605 |
Product | extracellular solute-binding protein |
Protein accession | YP_604062 |
Protein GI | 94984698 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0610101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG CTCTCGCTCT CGCCAGTCTG ACCGCCGCCT TTGCACTCAG CGCTGCCCAC GCGCAGGGCG TGACCCTCAC CTTTGCCTGT GACAGCGTGG GGCAGGGGTA CGACGAGTGC AAGAAGGGCG CGGACGCCTG GGCCAAGCAG ACGGGCAACA CCGTGAAGCT GGTGCAGGTG CCCAAGGAGA CGGACCAGCG CCTGGCACTC TACCAGCAGC AACTGGGAGC CAAGTCCGGT GACGTGGACG TTTATATGAT TGACGTGGTG TGGCCTGGCC TGATCGGCCA GCACTTGGTC GACCTCAAGC AGTACATCCC GCAGAGTCAG ATTGCACAAC ACTTCCCGGC CATCATTCAG AACAACACCG TCAACGGCAA GCTGGTGGGG ATGCCCTTCT TCACGGACGC CGGAGTGCTG TACTACCGCA CCGACCTGCT GAAGAAGTAC GGCTACACCC GCCCGCCCAA GACTTGGAAC GAACTCGCTA CGATGGCGCA GAAGATCCAG GCGGGCGAGC GCAAGACCAA CCCCAAGTTC GTGGGCTTTG TCTTCCAGGG CAAAAACTAT GAGGGCCTGA CCTGCGACGC GCTGGAGTGG ATCAATTCCT TCGGCGGCGG GACCATCGTG GATCCCAGCG GCAAGATCAC GGTGAATAAT CCCAAGGCGG TGGCGGCCCT GCGCGCGATC CAAGCGATGA TTGGCCCGGT GGCCCCTAGC GCCGTCACCA CCTATGGTGA GGAAGAAGCG CGCAACGTGT GGCAGGCTGG CAACTCGGCG TTTATGCGCA ACTGGCCCTA CGCCTACGCG CTGGCAGAAG CGCCCGACAG CCCGATCAAG GGCAAAGTTG GGGTAGCTGC GCTGCCTGCT GGCCCTGGCG GTAAGCCCGC TGCGACCCTG GGCGGCTGGC AGCTTGCCGT CAACGCCTAC AGCAAGCACC CCAAGGAAGC CGCCAGCTTG GTGCAGTACC TGACCAGCGC CCAGGAGCAG AAGCGCCGCG CCATCCAGGC AAGCTATAAC CCCACCATCG CTTCGCTCTA CAAGGATCCG CAGGTCCTCA AGGCCGTGCC CTTCTTTGGT AGCCTGTACG AGGTATTCAC AAACGCCGTG GCGCGCCCCG CCACCGTGAC GGGCGGCAAG TACAACGAGG TGAGCAACGC CTTCAGCACG TCCGTGTACA ACGTGTTGAC GGGCAAGAGT GCGCCCGACG CGGCCCTCAA GTCGCTCGAA AGCCAGCTCG CGCGCATCAA GGGCCGCGGC TGGTAA
|
Protein sequence | MKKALALASL TAAFALSAAH AQGVTLTFAC DSVGQGYDEC KKGADAWAKQ TGNTVKLVQV PKETDQRLAL YQQQLGAKSG DVDVYMIDVV WPGLIGQHLV DLKQYIPQSQ IAQHFPAIIQ NNTVNGKLVG MPFFTDAGVL YYRTDLLKKY GYTRPPKTWN ELATMAQKIQ AGERKTNPKF VGFVFQGKNY EGLTCDALEW INSFGGGTIV DPSGKITVNN PKAVAALRAI QAMIGPVAPS AVTTYGEEEA RNVWQAGNSA FMRNWPYAYA LAEAPDSPIK GKVGVAALPA GPGGKPAATL GGWQLAVNAY SKHPKEAASL VQYLTSAQEQ KRRAIQASYN PTIASLYKDP QVLKAVPFFG SLYEVFTNAV ARPATVTGGK YNEVSNAFST SVYNVLTGKS APDAALKSLE SQLARIKGRG W
|
| |