Gene Dgeo_0591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0591 
Symbol 
ID4058602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp629514 
End bp630779 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID641229605 
Productextracellular solute-binding protein 
Protein accessionYP_604062 
Protein GI94984698 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0610101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CTCTCGCTCT CGCCAGTCTG ACCGCCGCCT TTGCACTCAG CGCTGCCCAC 
GCGCAGGGCG TGACCCTCAC CTTTGCCTGT GACAGCGTGG GGCAGGGGTA CGACGAGTGC
AAGAAGGGCG CGGACGCCTG GGCCAAGCAG ACGGGCAACA CCGTGAAGCT GGTGCAGGTG
CCCAAGGAGA CGGACCAGCG CCTGGCACTC TACCAGCAGC AACTGGGAGC CAAGTCCGGT
GACGTGGACG TTTATATGAT TGACGTGGTG TGGCCTGGCC TGATCGGCCA GCACTTGGTC
GACCTCAAGC AGTACATCCC GCAGAGTCAG ATTGCACAAC ACTTCCCGGC CATCATTCAG
AACAACACCG TCAACGGCAA GCTGGTGGGG ATGCCCTTCT TCACGGACGC CGGAGTGCTG
TACTACCGCA CCGACCTGCT GAAGAAGTAC GGCTACACCC GCCCGCCCAA GACTTGGAAC
GAACTCGCTA CGATGGCGCA GAAGATCCAG GCGGGCGAGC GCAAGACCAA CCCCAAGTTC
GTGGGCTTTG TCTTCCAGGG CAAAAACTAT GAGGGCCTGA CCTGCGACGC GCTGGAGTGG
ATCAATTCCT TCGGCGGCGG GACCATCGTG GATCCCAGCG GCAAGATCAC GGTGAATAAT
CCCAAGGCGG TGGCGGCCCT GCGCGCGATC CAAGCGATGA TTGGCCCGGT GGCCCCTAGC
GCCGTCACCA CCTATGGTGA GGAAGAAGCG CGCAACGTGT GGCAGGCTGG CAACTCGGCG
TTTATGCGCA ACTGGCCCTA CGCCTACGCG CTGGCAGAAG CGCCCGACAG CCCGATCAAG
GGCAAAGTTG GGGTAGCTGC GCTGCCTGCT GGCCCTGGCG GTAAGCCCGC TGCGACCCTG
GGCGGCTGGC AGCTTGCCGT CAACGCCTAC AGCAAGCACC CCAAGGAAGC CGCCAGCTTG
GTGCAGTACC TGACCAGCGC CCAGGAGCAG AAGCGCCGCG CCATCCAGGC AAGCTATAAC
CCCACCATCG CTTCGCTCTA CAAGGATCCG CAGGTCCTCA AGGCCGTGCC CTTCTTTGGT
AGCCTGTACG AGGTATTCAC AAACGCCGTG GCGCGCCCCG CCACCGTGAC GGGCGGCAAG
TACAACGAGG TGAGCAACGC CTTCAGCACG TCCGTGTACA ACGTGTTGAC GGGCAAGAGT
GCGCCCGACG CGGCCCTCAA GTCGCTCGAA AGCCAGCTCG CGCGCATCAA GGGCCGCGGC
TGGTAA
 
Protein sequence
MKKALALASL TAAFALSAAH AQGVTLTFAC DSVGQGYDEC KKGADAWAKQ TGNTVKLVQV 
PKETDQRLAL YQQQLGAKSG DVDVYMIDVV WPGLIGQHLV DLKQYIPQSQ IAQHFPAIIQ
NNTVNGKLVG MPFFTDAGVL YYRTDLLKKY GYTRPPKTWN ELATMAQKIQ AGERKTNPKF
VGFVFQGKNY EGLTCDALEW INSFGGGTIV DPSGKITVNN PKAVAALRAI QAMIGPVAPS
AVTTYGEEEA RNVWQAGNSA FMRNWPYAYA LAEAPDSPIK GKVGVAALPA GPGGKPAATL
GGWQLAVNAY SKHPKEAASL VQYLTSAQEQ KRRAIQASYN PTIASLYKDP QVLKAVPFFG
SLYEVFTNAV ARPATVTGGK YNEVSNAFST SVYNVLTGKS APDAALKSLE SQLARIKGRG
W