Gene Dgeo_2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2547 
Symbol 
ID4073778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp502551 
End bp503852 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID641228928 
Productarsenical pump membrane protein 
Protein accessionYP_594055 
Protein GI94972015 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID[TIGR00935] arsenical pump membrane protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.545416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTCG CCGTCCTGAT CTTCCTGTTC ACCCTCGTCC TCGTCATCTG GCAGCCGAAG 
CTCAGGTGGC AACCAGGGGG CCTGGGCATC GGCTGGAGTG CGTCACTCGG CGCGGTCCTC
GCCCTGCTCA CCGGGGTCGT CCACCTCGCG GACATTCCGG TGGTGTGGAA CATCGTGTGG
AACGCGACCA TCACCTTCGT TGCCCTCATC ATCATCAGCC TGATCCTCGA CGAGGCCGGG
TTCTTCAAGT GGTCTGCCCT GCACGTGGCC CGCTGGGGCC GCGGGCACGG CCACCTGCTC
TTTGCTCTGG TGATCCTGTT GGGTGCCGCC GTGAGTGCCC TGTTCGCCAA CGACGGCACG
GCGCTGATCC TCACGCCCAT CGTGCTCGCC ATGCTCACCG CGCTGGGCTT CCGGCCCGCC
ACCACCCTCG CGTTCATCCT CGCCACGGGG TTCATCGCCG ACAGTGCCAG CCTGCCGCTG
GTCATCAGCA ACCTGGTGAA CATTGTCAGC GCCGACTACT TCAACCTGGA CTTCGGACAG
TACGCCAGGG TGATGGTGCC GGTGGACCTC GCGGCGATCC TCGCCAGTCT CGGCGTGCTG
TACGTGATGT TCCGCCGCGA TCTGCCTGCG CGTTACGACC CAGGCACGCT GGGAACGCCT
GCCCAGGCCA TCCGTGACCC CAACGTCTTC CGGGTGGGCT GGATCGTCCT GGTGGTCCTG
CTGGTCGGGT ACTTCGCCGC CGGGCCGCTC GGGGTGCCCG TCAGCCTGGT CGCGGCGCTG
GGGGCAGGCC TCCTGTGGCT CGTCGCCGCT CGTGGGCACG TCGTGAGCAC CCGGAACGTC
CTCAGGGGCG CGCCCTGGCA GATCGTCATT TTCTCGCTGG GCATGTACCT GGTCGTGTAC
GGTCTGCGAA ACGCCGGGCT GACCGACTTG CTGGCGGGCG TCCTTGACCG ACTGGCTCAG
GGCGGACTTT GGAGCGCCAC CCTCGGCACC GGCTTTCTGA CCGCCTTCCT CGCCAGTGTG
ATGAACAACA TGCCCAGCGT CCTGATCGGC GCGCTCGCCA TCGACGCCAG CCAGGCCACC
GGAGCCGTCA AGCAGGGCAT GGTCTACGCG AATGTCGTCG GCAACGACCT GGGGCCGAAG
ATCACGCCCA TCGGGAGCCT CGCCACGCTG CTGTGGCTGC ACGTGCTGGC CAGCAAGGGG
ATCCGCATCG GGTGGGGCCA GTATTTCCGG GTCGGGATCG TCCTCACGCT GCCGGTGCTG
CTGGTCACGC TCGCGGCGCT CGCGCTGCGC CTGGGAGGCT GA
 
Protein sequence
MLLAVLIFLF TLVLVIWQPK LRWQPGGLGI GWSASLGAVL ALLTGVVHLA DIPVVWNIVW 
NATITFVALI IISLILDEAG FFKWSALHVA RWGRGHGHLL FALVILLGAA VSALFANDGT
ALILTPIVLA MLTALGFRPA TTLAFILATG FIADSASLPL VISNLVNIVS ADYFNLDFGQ
YARVMVPVDL AAILASLGVL YVMFRRDLPA RYDPGTLGTP AQAIRDPNVF RVGWIVLVVL
LVGYFAAGPL GVPVSLVAAL GAGLLWLVAA RGHVVSTRNV LRGAPWQIVI FSLGMYLVVY
GLRNAGLTDL LAGVLDRLAQ GGLWSATLGT GFLTAFLASV MNNMPSVLIG ALAIDASQAT
GAVKQGMVYA NVVGNDLGPK ITPIGSLATL LWLHVLASKG IRIGWGQYFR VGIVLTLPVL
LVTLAALALR LGG