Gene Dgeo_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2304 
Symbol 
ID4059251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2422606 
End bp2423793 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content73% 
IMG OID641231352 
Productmajor facilitator transporter 
Protein accessionYP_605765 
Protein GI94986401 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGGA CCCGGCCCCT GGTGATGCTG CGCTTGCTGG CCCTGCTGCT CACAAGCGAA 
CTGGTCCGCA CCGGCTTTTT CGTCTCTGCG CTGCCGGTGG CGGGGCCAGG GCTGGGGCTG
GGGACGGCGG TGATCGGCCT GATGGTCGGG GCCCACTACC TCGCGGACGC GCTGGCCAAG
GGGCCGATGG GCCTGGTGAC CGAGCGCTGG GGGCTGGGGC GGGTGCTGGC CTTGGGGTCG
GCGCTGGGGC TGACGGCAGT GTTGGGCGCG CGCCTGGCTC CGTCGCCCGC TTGGGGCGTG
CTGGGCTGTG CCCTCTGGGG CGTGGCCTAC GCGGCGCTGT GGCCGGGCGT GATGAACACC
TCGCAGGTGC TGGCGCGGCC CGGTCACATC GCGCGGGCAC TGACCCTCTC CAGCCTGAGT
GTGGCGCCCG CCATCCTGGG CGGCGTGCTG GGGGTGGGGC CGCTGATGCA GGCACACCCC
GGGGCAGCCT GGGCGCTGCT GGCCGGGGTG CAGGGCGCAG CGCTGCTGTT GGCGCTCAGT
CTTGTGAGGT TGCACCTGCC CGGCACGGGG GTGCCAAGCG GAAGCGTGTG GCAGGGTTGG
GCACGGGTGG CGGTGCTGCT GCCTGCTGCC TTCGCGCAGA CACTGGCACC GGGGCTCCTC
GTCACCCTGT TCTACCCGCT GCTCTCCAGG CTGGGGCTGG GGCTGGGCGA CCTGATCGGG
CCGGGGCTGC TGGCGCTGGC CGCCTTCGGG GTGTGCCTGT GGGGGGCGGG GAGGCTGGCC
GACCAGGCCC ACCCGCGCCA CGCCCTCACG CCGGGGCTGC TGCTGCTGGC CCTCACCTTC
GCCGCAGCGA CACTGCCGGG GTTGGAGGGG CGGCTGTGGT TCCTCGCGCC GCTGCTGGGG
CTGAGTTACG GAGCTTTCAG TGCCGGGTGG AACGGGCTGG TGGGCCGGGT GTTGCCCAGC
GGCCACCGGG CCGCCGCGTG GGGCACCGTG ATGGCGGTCG AGTCGCTGGG CTACGCCGTC
GGTCCGCTGC TGGGTGGCCT TGCCTGGGCA CAGGCGGGAC CGGCGGGCGT CTTCACGCTG
GGGGCGGCGG TGTTCCTGCT GACAGAAGGT TATTACCTGC TGCCGGGGCG CTCGCTGACG
CGCCTGGCAC CACAGGAGAA CAAGCCGTCC GACCAGCCCA CCGGCTAA
 
Protein sequence
MLWTRPLVML RLLALLLTSE LVRTGFFVSA LPVAGPGLGL GTAVIGLMVG AHYLADALAK 
GPMGLVTERW GLGRVLALGS ALGLTAVLGA RLAPSPAWGV LGCALWGVAY AALWPGVMNT
SQVLARPGHI ARALTLSSLS VAPAILGGVL GVGPLMQAHP GAAWALLAGV QGAALLLALS
LVRLHLPGTG VPSGSVWQGW ARVAVLLPAA FAQTLAPGLL VTLFYPLLSR LGLGLGDLIG
PGLLALAAFG VCLWGAGRLA DQAHPRHALT PGLLLLALTF AAATLPGLEG RLWFLAPLLG
LSYGAFSAGW NGLVGRVLPS GHRAAAWGTV MAVESLGYAV GPLLGGLAWA QAGPAGVFTL
GAAVFLLTEG YYLLPGRSLT RLAPQENKPS DQPTG