Gene Hoch_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2834 
Symbol 
ID8545222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3885987 
End bp3887024 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID646387523 
Productarsenical-resistance protein 
Protein accessionYP_003267251 
Protein GI262196042 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.823873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.919185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCT TTGAACGCTT TCTCTCCCTC TGGGTCGCGC TGGCCATCGC CGCCGGCGTC 
GGCCTCGGGC TCGTCGCTCC CGGGCTCTTC GAGGTCGTCT CCCGCTTCGA GTGGGCGCGC
GTCAACCTCG TCGTCGCGGT GCTCATCTGG CTGATGATCT ACCCGATGAT GCTCAAGGTG
GAGCCGTCGT GCCTCAAGGA CGTCGGCAAG AAGCCCAAGG GGCTCGCGCT CACCCTGGTC
GTCAACTGGC TGATCAAGCC CTTCACAATG GCTGCCCTGG GCGTGCTCTT CTTCCAGCAC
GTCTTCGCAG GCCTCGTCCC CGCCGAGGAC GCCCAGCAGT ACATCGCCGG CATGATCTTG
CTCGGCGTCG CGCCGTGCAC CGCGATGGTC TTCGTCTGGA GCCACCTGAC CGACGGCGAC
GCCAACTACA CGCTGGTCCA GGTCTCGGTG AACGACATCA TCCTCGTCTT CGCCTTCGCC
CCCATCGCCG GGCTGCTGCT GGGCGTCACC GACCTCACGG TCCCCTGGGA GACGCTGCTC
GCCTCGGTGG TGATCTTCGT CGTCATCCCG CTGGGCGCAG GGATGCTGAC CCACAAGCAA
CTGATGAAGA CCGGCGGCGC CGAGGCCATC GAGCGGCTCT CGAGCAAGCT CAAGCCGACC
TCCATCGTCG GGCTGCTGCT GACCGTCGTG CTTCTCTTCG GCTTCCAGGC CGAGACCATC
GTCGACCAGC CCGGCCGCGT GGTGCTCATC GCCATCCCGC TGCTCATCCA GAGCTACGGC
ATCTTTGCCA TCGCCTACGG GCTCGCTCGC GTGCTCAAGC TGCCGTTCAA CGTGGCGGCG
CCGGCCGCGA TGATCGGCAC GTCCAACTTC TTTGAGCTGG CCGTCGCCGT CGCGATCAGC
CTCTTCGGCC TCGCCTCGGG CGCGGCGCTC GCCACCGTGG TGGGTGTGCT CATCGAGGTG
CCGGTGATGC TCTCGCTCGT CGCCTTCGCC AACCGCACCA AGGGCTGGTT CCCGGCCCCA
TCATCTGCTT CCGCCTGA
 
Protein sequence
MGIFERFLSL WVALAIAAGV GLGLVAPGLF EVVSRFEWAR VNLVVAVLIW LMIYPMMLKV 
EPSCLKDVGK KPKGLALTLV VNWLIKPFTM AALGVLFFQH VFAGLVPAED AQQYIAGMIL
LGVAPCTAMV FVWSHLTDGD ANYTLVQVSV NDIILVFAFA PIAGLLLGVT DLTVPWETLL
ASVVIFVVIP LGAGMLTHKQ LMKTGGAEAI ERLSSKLKPT SIVGLLLTVV LLFGFQAETI
VDQPGRVVLI AIPLLIQSYG IFAIAYGLAR VLKLPFNVAA PAAMIGTSNF FELAVAVAIS
LFGLASGAAL ATVVGVLIEV PVMLSLVAFA NRTKGWFPAP SSASA