Gene Dgeo_2870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2870 
Symbol 
ID4074099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp151794 
End bp153137 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content61% 
IMG OID641228610 
Productextracellular solute-binding protein 
Protein accessionYP_594373 
Protein GI94972333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAC TCTCCCTGCT CAGCCTTGCT CTTTGTGCGA CTTCACTGGG GGTCAGTGCC 
CAGGCGGCCA CAACCATCAC CATTGCGACG GTAAATAACC CCGACATGGT GACCATGCAA
AAACTCACCC CAGAATTCAC AAAGAAGTAC CCTGACATCA ATGTGAAGTG GGTTGTTCTC
CCAGAGAATG AGCTGCGCCA GAAGGTAACG CTGGATGTCG CCAGCAACGC CGGGGGTTTC
GATGTGGCCA CCGTTGGGAC GTATGAGGTG CCGATCTGGG CCAAGAACGG CTGGTTGGAT
CCGCTCAACC CGATGTTCAA CAAGGACACC GCCATTGCCA AAAGCTACGA CCTGACCGAT
GTGCTGGAGC CGGTGCGCAA GGGCCTGTCC TACAATGGCC AGCTCTACGC GCTGCCCTTC
TACGCCGAGT CGAGCATGAC CTACTACAAC AAGGATCTGT TCAAGAAGGC GGGGCTGACC
ATGCCCGCAC AGCCCACCTG GCGGCAGATC GAGCAGTTTG CGGCCAAGAT TCATAACCCC
GCGCAGGGCG TGTACGGGAT CTGCCTGCGT GGCCTGCCGG GCTGGGGCGA GAACATGGCG
CTCTTCACCA CGATCGTGAA CACGTTCGGC GGGCGCTGGT ACGACCAGAA CTGGAATGCT
CAGCTCAACA CGCCTGCCTG GAAGAACGCG ATGACCTTTT ACGTCAATCT GCTGAAGAAG
TATGGTCCTC CCGGTGCCAC CAGCAACGGC TTTACCGAAA ACCTCACTCT GATGAGTCAG
GGCAAATGTG GGATGTGGGT GGACGCGACG GTGGCGGCAG GCTTCCTCAA AGACCCTGCC
TCCAGCAAGA TCGTGAACTC GGTGGGCTTT GCCAACGCCC CGGTCGGCCC CGGCACGCCG
CGCGGCAGCA ACTGGTTGTG GTCGTGGAGC CTGGCGATTC CCAAGAGCAC CAAAAAAGAG
GACGCGGCCT TTAAGTTCAT CACCTGGGCG ACCTCCAAGG ACTACATCGC GCTTGTCGCC
AAAGAGAAAG GCACCTGGGC AGCTGTGCCC CCCGGCACCC GCGCAAGCAC CTACGCCAAC
CCCAATTACA AGAAGGCCGC CGGGGACTTT GCCAGCCTGG TGCTGAACTC GATCCGGCGC
GCCGATCCCA CCCGCCCCAC CAAAGACCCA GTGCCCTACA CCGGTATTCA GTTCGTGGGG
ATCCCGCAGT TCCAGGCGCT CGGCACCCAG GTCGGGCAGT ACCTGGCGGG CGCGCTGAGC
GGCCAGACCA GCATTGATCA GGCGCTCAAG CAGGCGCAGG ACGCTGCCGC GCGGGTGGCC
AAGGAAGGCG GTTACCAGAA GTAA
 
Protein sequence
MKRLSLLSLA LCATSLGVSA QAATTITIAT VNNPDMVTMQ KLTPEFTKKY PDINVKWVVL 
PENELRQKVT LDVASNAGGF DVATVGTYEV PIWAKNGWLD PLNPMFNKDT AIAKSYDLTD
VLEPVRKGLS YNGQLYALPF YAESSMTYYN KDLFKKAGLT MPAQPTWRQI EQFAAKIHNP
AQGVYGICLR GLPGWGENMA LFTTIVNTFG GRWYDQNWNA QLNTPAWKNA MTFYVNLLKK
YGPPGATSNG FTENLTLMSQ GKCGMWVDAT VAAGFLKDPA SSKIVNSVGF ANAPVGPGTP
RGSNWLWSWS LAIPKSTKKE DAAFKFITWA TSKDYIALVA KEKGTWAAVP PGTRASTYAN
PNYKKAAGDF ASLVLNSIRR ADPTRPTKDP VPYTGIQFVG IPQFQALGTQ VGQYLAGALS
GQTSIDQALK QAQDAAARVA KEGGYQK