Gene Dgeo_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_3106 
Symbol 
ID5687581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_009939 
Strand
Start bp196050 
End bp197276 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content61% 
IMG OID641262569 
ProductABC transporter, substrate binding periplasmic component 
Protein accessionYP_001527843 
Protein GI158421616 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCACC GTACCCTGTC CGTCGCCCTG ACCTTGTTCC TGGTCCTTGG CAGCAGTGCT 
GAGGCCGCTG ACCTGCGCTT CAGCACTTGG GCCGGCGGCG AGGGTCTGGC CCTCTTGCAG
CAACTTGCCA AGGAGTACAC TGCCAAGACG GGTACGAACG TCAAAGTCGA GGTCACGCCC
TTCGCGGACT ACAGCCGCAA GCTCTCCGTG CAGATTGCCT CGGGTGACGC CCCGGACATC
GGCTGGGTGG CTGAGCGGGA CGTGCCGACC TTCCTCGCCT CGAACAATCT CGCCAACCTC
AGCGCTTTAA GCAAGGACGC CTCGTTCAAT CTGAACGACT TCCCAACCTC CTCGTTGGCC
CTCTGGAAGC AGGGCGGCAA TCTATATGGC ATTCCCTTTT CAAATTCACC GCTGGTGCTC
TTTTACAACA AGGATCTCTT TAAGCAAGCT GGGGTTGCAG ACCCAATGAC CCAGTACGCC
AAGGGGCAGT GGAGCTACAA CGACTTCCAA AAGAGTGCGC TCGCCATCAA ACAGAAAACC
GGCAGCTACG GTGCACGCGT GATGCGCCTC GACCCCAAGG CGTGGGCGGG TGGCTTGCTG
GCCGTTCTGT GGTCCCAGGG GGGCGGGGTA TACGACAAAA ATATGAAGTG TAACCTCAAC
GCTCCCGGGA GCCTGCAAGC CTTCAGCCTC ATGCAGAACA TGATGTTCAA AGACCAGTCG
ATGCCGCGCC CCGGCGACCA GACCAGCTTC GACGGCGGAA GGCTAGGCAT GTACTTCGAC
AACATCAGCT ACGCCGGGCA ACTTAAGGAC GCCAAGTTCA AGTGGGGCAT CGCGCCGCTA
CCGAAGGGGA GCGCGGGCCG GATCACCCAG CTCGGGCAGG CTGGATACGC TGTCTTCAGT
AAGGGGCGGA ATCAGGCGGA GGCCGTCAAT TTTCTGAAGT TCATCGCCTC TAAGGAGAAT
ATGGCCCGCA CCGCCAAGTT CTTCCCGCCG CCCCGCCAGT CAGTTCTCAG GAGCAGCGCC
TACTTGAACG CCAACCCTGC AATTCCTGCC AGCGCCCTCA AGACCGCCCT TATCAGCCAG
CTCGGCAGCG CCCGTGTGCT GCAAACCGAC ACCCACTGGC TCAAGGCGAA CGACGCGATC
ACGGGCAGCC TCGACCAGGT ATTCCAGCCT GGCACCAACA CGAAAGCCAT CCTGGACCGT
ACCTGCCAGA CGGTGGACGG CCTGTAG
 
Protein sequence
MRHRTLSVAL TLFLVLGSSA EAADLRFSTW AGGEGLALLQ QLAKEYTAKT GTNVKVEVTP 
FADYSRKLSV QIASGDAPDI GWVAERDVPT FLASNNLANL SALSKDASFN LNDFPTSSLA
LWKQGGNLYG IPFSNSPLVL FYNKDLFKQA GVADPMTQYA KGQWSYNDFQ KSALAIKQKT
GSYGARVMRL DPKAWAGGLL AVLWSQGGGV YDKNMKCNLN APGSLQAFSL MQNMMFKDQS
MPRPGDQTSF DGGRLGMYFD NISYAGQLKD AKFKWGIAPL PKGSAGRITQ LGQAGYAVFS
KGRNQAEAVN FLKFIASKEN MARTAKFFPP PRQSVLRSSA YLNANPAIPA SALKTALISQ
LGSARVLQTD THWLKANDAI TGSLDQVFQP GTNTKAILDR TCQTVDGL