Gene Dgeo_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1500 
Symbol 
ID4057386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1587019 
End bp1588200 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID641230518 
Productextracellular solute-binding protein 
Protein accessionYP_604964 
Protein GI94985600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG CACTGACTGT TCTGTCTCTC GCCCTGCTGG GGAATGCCAG CGCCGCCACC 
ATCACTGTCT GGACACATTT TGGCGGGCCC GAGCAGGCGT GGCTCAAGGA TCAGGCGCAA
GCCTTCGAAA AGAAGACTGG GAACCGAGTG CAGCTCGTCA ATGTGCCCTT CGAGCAGATT
CCCGACAAGT TCATTCAGAG CGCGCCCAAG GGCCAGGGGC CGGACCTGCT GGTCACGCAG
CCGCAGGACC GCATCGGGCA GTTTGCGGCA GCGGGCGTGA TCGAGCCGAT GGACAAGTAC
CTGGTCAGCC GCAGCGACCT CGACAAGACG GCGCTGGGCG CCATGACTTA CAAGGGCAAG
CTGTTCGGCC TGCCGATGTT CGCCGAGGCG GTGGGCGTGG TCTACAACAA GAAGCTGGTG
CCTAACCCGC CTACCACCTG GGCTGAATTC CTGAAGGTGG CGCAGGCCAA CACCGGGAGC
GGCAAGTTCG GCTACCTGGA AGACCTCAGC GAGGCCTACC AGAACTACGG CGTGATCAGT
GCGTACGGCG GCTACGTCTT CAAGAACAAT GGCGGCACCC TCAATGTCAA GGACGTGGGC
CTGAACAACG CTGGGGCAGT CAAGGCGAGC AGCTTCTTGA ACGACCTGCG TTACAAGTAC
AACCTAGTGC CCGAAGGGGT TACCAGCGAC GTGGCCAAGA GTGCCTTCCT GGACGGGCGT
CTCGCCATGT TCCTGACCGG GCCCTGGAAC ATGGGCGATA TCAAGAAGGC AGGCATCAGC
TACGGCATCA TGCCTTTCCC CACGCCTCCC GGCGCGAGCG GCAAGTGGAG CCCCTTCGTG
GGGGTGCAGG GCATCATGCT GAGCGCGTAC AGCAAGAACA AGGCCGCCGC GGCGCAGTTT
GCCAAGCAGC TTGTGACCAG CGACGCGCAA GTCGGCTTCA ACAAGGCGGG CGGGCGCATC
CCGGTCAGCC TGAGCGCGCG GACCAAGCTC AAGAATGATC CAGTGGTTGC GGGCTTCGGT
AAGACCATCA GCATGGGCAC CCCGATGCCC AACGTGCCCG AGATGAGCGC AGTGTGGGGC
CCCTGGACGA ACGCCATCGC CCAGAGCGTG CAGAAGCCGG GCGCCGACTA CAAGCAGATC
CTCGACAAGG CCGTCGCGGA AATCAACAGC AACATCAAGT AA
 
Protein sequence
MKKALTVLSL ALLGNASAAT ITVWTHFGGP EQAWLKDQAQ AFEKKTGNRV QLVNVPFEQI 
PDKFIQSAPK GQGPDLLVTQ PQDRIGQFAA AGVIEPMDKY LVSRSDLDKT ALGAMTYKGK
LFGLPMFAEA VGVVYNKKLV PNPPTTWAEF LKVAQANTGS GKFGYLEDLS EAYQNYGVIS
AYGGYVFKNN GGTLNVKDVG LNNAGAVKAS SFLNDLRYKY NLVPEGVTSD VAKSAFLDGR
LAMFLTGPWN MGDIKKAGIS YGIMPFPTPP GASGKWSPFV GVQGIMLSAY SKNKAAAAQF
AKQLVTSDAQ VGFNKAGGRI PVSLSARTKL KNDPVVAGFG KTISMGTPMP NVPEMSAVWG
PWTNAIAQSV QKPGADYKQI LDKAVAEINS NIK