Gene Dshi_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1822 
Symbol 
ID5712813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1900181 
End bp1901206 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID641267745 
Productputative arsenic resistance protein 
Protein accessionYP_001533165 
Protein GI159044371 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.932603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGGT TCGAGCGGTA TCTGTCGGTC TGGGTGGCGC TGGCCATCGG CGCGGGGCTG 
GTTTTGGGCA GCGTGGTGCC GGGGGTGTTC GCGGGGCTTG CGGCGCTGGA GGTGGCGTCG
GTCAACCTGC CCGTGGCGGT GCTGATCTGG GCGATGGTGT TCCCGATGAT GGTGGGCGTG
GATTTCGGCG CGCTCGGGGG CGTGGCGACC CGGCCCAAGG GGCTGGTGAT CACGCTGGTG
GTGAACTGGC TGGTCAAGCC GTTCACCATG GCCGCGTTGG GCGTGCTGTT TTTCAACCAC
GTCTTCGCGC CCTTCATCCC GCCGGAGGAT GCTCAGATGT ACCTGGCGGG CGTGATCCTG
CTGGGGGCCG CGCCCTGCAC GGCGATGGTG TTCGTGTGGT CGAACATGAC CAAGGGCGAT
CCGAATTACA CGCTGGTGCA GGTGAGCGTG AACGACGTGG TGCTGGTCTT TGCCTTCGCG
CCCATCGTGG CGCTCTTGCT GGGGGTGACG GATATCGTGG TGCCGTGGGA GACGCTGCTG
TTGTCCGTGG CGCTTTACGT GGTGATCCCG CTGGCCGTGG GCGTGGTGGT GCGCAAACGG
CTGCTGGACG GCGGAGGGGC GGAGGCGCTG GCCGCGTTCC AGGCGCGGGT AAAGCCCGCC
TCGGTAGCGG GGCTTTTGCT GACCGTGGTG CTCCTGTTCG GGTTCCAGGG GCGCGTGATC
CTGGAGACGC CTTTGGTGAT CGCGATGATC GCCGTGCCGC TGCTGATCCA GAGCTACGGG
GTGTTCTTCG TGGCCTATTA CGCCGCCAAG GCCTGGCGCG TGCCGCACGA GGTCGCCGCC
CCCTGCGCCC TGATCGGGAC GTCGAATTTC TTCGAGCTGG CCGTGGCCGT GGCGATCAGC
GTCTTCGGCC TCGGCTCCGG CGCGGCGCTG GCCACGGTGG TCGGCGTGCT GGTGGAGGTG
CCGGTGATGC TGTCGCTGGT GGCGTTTGCC AACCGGACGC GGGGGTGGTT TCCTAAGTCC
CCGTGA
 
Protein sequence
MGRFERYLSV WVALAIGAGL VLGSVVPGVF AGLAALEVAS VNLPVAVLIW AMVFPMMVGV 
DFGALGGVAT RPKGLVITLV VNWLVKPFTM AALGVLFFNH VFAPFIPPED AQMYLAGVIL
LGAAPCTAMV FVWSNMTKGD PNYTLVQVSV NDVVLVFAFA PIVALLLGVT DIVVPWETLL
LSVALYVVIP LAVGVVVRKR LLDGGGAEAL AAFQARVKPA SVAGLLLTVV LLFGFQGRVI
LETPLVIAMI AVPLLIQSYG VFFVAYYAAK AWRVPHEVAA PCALIGTSNF FELAVAVAIS
VFGLGSGAAL ATVVGVLVEV PVMLSLVAFA NRTRGWFPKS P