Gene Dgeo_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0687 
Symbol 
ID4058269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp744989 
End bp746212 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content61% 
IMG OID641229706 
Productextracellular solute-binding protein 
Protein accessionYP_604158 
Protein GI94984794 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.376534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.15795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGG TCCTGATCAC GGCGGCCCTG CTGACCCTCG GCAGCGCCAG CGCGCAAAAA 
ACGCAGATGG AGTTTTGGAC GATCGCTCTC GCGCCCCTGT TCAACGATGA GATGAACCGG
CTGGTGGCGC AGTTCGAGAA GGAAAACCCG AACGTGGAAC TGAAGTGGGT GGATGTACCC
GCCTCGGCCA TCGAGCAGAA GCTGCTGGCC GCCATCGCCT CGGGCCGTCC GCCCGCCGCC
GTCAACCTGT CGTCCGACAT GGTGGTGAAG ATGGTGGACC AGGGAGCGCT GGAACCGCTG
ACGCTGACTG ACGCGCAGAA GAAGGTGTAC TTCCCGTCGC CCCTGAATAC CTTTACCTTC
GACGGCAAGG TGATGGGCGT GCCCTGGTAC TGGTCACCCA AGGTGGTGGC CTACAACACC
GAGATTTTCC GCAAGGCGGG CCTGGATCCC AATAACCCGC CGCGCACCAT TCAGACGTTG
ATCGCCGCGG CCAAGCAAAT CAAGGACAAG ACCGGCCTCT ACGGCTTCAT GCCGAACATC
AACAACCTGA ACATGCTGTA CCTGTTCCAG GAGGCAGGCC TGCCCGTCCT CAAGGGGGGC
CGTGCGGTCT TCAATAGCCC CGAACACGTC AAGCTGCTCC AGACCTATGT GGACCTCTAC
AAGCAGGGCT ATATCCCGGA AGACACCATG CGCCGGGGCT TCACCGCGGC AACTGAGCTG
TACTCGGCGG GCAAGCTGGG CATGCTGATC ACCGGCCCAC AGTTCATCCT GCGCGTGGCG
AACGACAACC GGGACATCTA CAACGTGACC AAGGTTGCGC CGTACCCGAT CAATCTGGCG
GGAAACGTGA TCCACACCGG GCTCATGGGC TTCGTGGTGC CCAAGGGCGT AAAGGACAAG
GCGCTCGCGC AGAAGCTGGC CCTGTTCCTC ACGAACGACG TGAACCAGCT CCAGTTTAGC
CGAGTCACCA AGACGACTTT CCCCAGCACC GTAAAGGCCA GCACCGACAA GTTCTTCAAG
CAGGGCGGCC AGAATGCCAT CGACCAGGGG AGGCTGGTCG CCAGCACAGA GTTGAAGAAG
GCCAAGGACC TCACGCTGGT CTACCCCGAC GCCAGCCGGC TGAACAAGGT CTTCAAGGAC
AACATCGAGG CTGCGATGGC CGGGCAGAAG AGCGCCAAGC AGGCGCTGGA CGACATTGTG
AAGGCGTGGA ACGCGAGCTT ATAA
 
Protein sequence
MKKVLITAAL LTLGSASAQK TQMEFWTIAL APLFNDEMNR LVAQFEKENP NVELKWVDVP 
ASAIEQKLLA AIASGRPPAA VNLSSDMVVK MVDQGALEPL TLTDAQKKVY FPSPLNTFTF
DGKVMGVPWY WSPKVVAYNT EIFRKAGLDP NNPPRTIQTL IAAAKQIKDK TGLYGFMPNI
NNLNMLYLFQ EAGLPVLKGG RAVFNSPEHV KLLQTYVDLY KQGYIPEDTM RRGFTAATEL
YSAGKLGMLI TGPQFILRVA NDNRDIYNVT KVAPYPINLA GNVIHTGLMG FVVPKGVKDK
ALAQKLALFL TNDVNQLQFS RVTKTTFPST VKASTDKFFK QGGQNAIDQG RLVASTELKK
AKDLTLVYPD ASRLNKVFKD NIEAAMAGQK SAKQALDDIV KAWNASL