Gene Dgeo_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2641 
Symbol 
ID4073872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp401172 
End bp402671 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID641228835 
Productmajor facilitator transporter 
Protein accessionYP_594148 
Protein GI94972108 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCTT CCAGTGCCGC CCTGTCGCAT GTTGACTCGT TCGCCGCGTC CCTCAAAGCA 
CAGCTCGCTT CCCATCGCGC GGACAGCCTC CGTCGGCCCA CCGAGATTCT CTTCGCGATT
CTGCAGGTGT GCGCACGGTT GCTTGTCACC TTGGCCGTAA CCCCGCAAAC CACCGCGCAG
CTGGGGTCTA CACTCCAAAG GTGTTCGCTG TGTCTCGGCC TTCCGGAGGT CAAACCCGTG
TCTGATGCGG CTCTGTCCTC TTCTACCTCG AGCCGTGCGG TGCTGCGCCT CCCCGAATTC
CGGGCTATGC TGCTGGCGAC CGTCTGCAGC ACCCTGGCCG GACGCGCCGT GGCGCTCACC
GTGGCCTATC AGCTCTATCA GCTCACCAAG AATCCGCTCA CCCTGGGCAT CTTGGGCCTG
GTGGAGGCCA TCCCAGCGCT GAGTCTTGCG CTGCTCGGGG GCGTGGTAGC CGACCGCAAC
GACCGTCGCC GTATTCTGCT GCTGACCACC AGCGTAGAAG TGATCTGCGC GCTGCTGTTT
TTCCTGTATG CGCCGCATGC CTCAACCCTG GGCTACGCCC CGATTCTGGC CCTGATCTTC
CTGCTGGGGA TTGCCCGCGG CTTTTCCGAC CCAGCACTTC CCGCCTTTGA GGCCCAGGTT
GTGCCACGCG AGCTCTTGCT GCGTGCCTCA GCCTGGCAGT CGAGCGCGTG GCAGGCGGCG
GCCATCCTGG GGCCGGCTCT GGGAGGTGTG CTGTATGCAG CCGTCAGCGC GCGCGGCACT
TACCTCGTTG CAGCCGTCCT GTATGGCTTG GCCCTGGCTT GCCTCGCCTA TGTCAGGCCC
AAGCCGCGTC CGGCATTCAC CCCTGGCGAG CCGGTGTGGC AGAGCGTGAA GGAGGGCTTG
GCCTTTGTGA TGCAGCGGCA GGTGCTGGTG GGCAGTATGG CCCTGGACCT GTTCAGCGTG
CTGTTCGGCG GCGCGGTGGC CTTGCTTCCA GTCTTTGCCT CCGACATCCT GCGGGTGGGA
CCGCAGGGTC TAGGGGTGCT GGTCGCCGCA CCCAGCATTG GGGCCCTGGC CGTGATGCTG
GCAGCAACTC ACCGCCCCCC AGGACGCGGC GCAGGACGCA CGCTGCTGCT TGCCGTGGCG
GGCTTTGGGA TATGTATGGT GGTGTTCGGG CTGTCACGCA ACTTCTTCCT CAGTGTGGCG
GTACTGGTTG CAGCAGGTGT GTTCGACGGC ATCAGCATGG TGGTGCGCCG AGCAACACTG
CGGCTCAAGG CCCCCGACCA CATGCGCGGG CGGGTCAGCG CGGTCAGCAG CATGTTTATC
GGAGCGAGCA ACGAGCTGGG CGCCTTTGAG AGTGGCCTGG CCGCGAGCTG GCTGGGCACC
GCGCGCAGCG TGTGGCTGGG CGGGCTGGTC ACCCTGCTGG TGGTGGGTGT GACGGCCTAC
CTCGCGCCAG AACTGCGGGC GATGGATCTC ACCGACATCG CCAAGGACCG GTCAGGCTGA
 
Protein sequence
MSASSAALSH VDSFAASLKA QLASHRADSL RRPTEILFAI LQVCARLLVT LAVTPQTTAQ 
LGSTLQRCSL CLGLPEVKPV SDAALSSSTS SRAVLRLPEF RAMLLATVCS TLAGRAVALT
VAYQLYQLTK NPLTLGILGL VEAIPALSLA LLGGVVADRN DRRRILLLTT SVEVICALLF
FLYAPHASTL GYAPILALIF LLGIARGFSD PALPAFEAQV VPRELLLRAS AWQSSAWQAA
AILGPALGGV LYAAVSARGT YLVAAVLYGL ALACLAYVRP KPRPAFTPGE PVWQSVKEGL
AFVMQRQVLV GSMALDLFSV LFGGAVALLP VFASDILRVG PQGLGVLVAA PSIGALAVML
AATHRPPGRG AGRTLLLAVA GFGICMVVFG LSRNFFLSVA VLVAAGVFDG ISMVVRRATL
RLKAPDHMRG RVSAVSSMFI GASNELGAFE SGLAASWLGT ARSVWLGGLV TLLVVGVTAY
LAPELRAMDL TDIAKDRSG