Gene Htur_3445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3445 
Symbol 
ID8744065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3548911 
End bp3550071 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID646514026 
Productarsenical-resistance protein 
Protein accessionYP_003404980 
Protein GI284166701 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAACG CGACCCACGA CCACGGGCCG GACTGCAGCT GCGAGGCCTG TGGCGATCCG 
CGGTCGATGG ACTTCCTCGA CAAGTACCTG ACCGTCTGGA TCTTCGCCGC GATGGCCGTC
GGCGTCGGCC TCGGCTACGC GGCGCCGTCC GTGACCGAAC CGATTCGGGA CCTCCACCTC
GTGGAGATCG GGCTCGTCGC CATGATGTAC CCGCCGCTGG CGAAGGCGGA CTACGGGCGG
CTTCCGACGG TGTTTCGCAA CTGGCGCGTG CTCAGCCTGA GCCTCGTCCA GAACTGGCTC
ATCGGCCCGA CCCTGATGTT CGGGCTCGCG GTGTTCTTCT TCAGCGGACT CGTACCCGGC
CTCCCGGCCC GTCCCGAGTA CTTCCTGGGA CTCGTGTTCA TCGGGATGGC CCGGTGTATC
GCGATGGTGC TCGTCTGGAA CGAACTCGCG GAGGGATCGA CCGAGTACGT GACCGGACTG
GTCGCGTTCA ACAGCCTCTT CCAGATCGTT ACCTACGGCG TCTACGTCTG GTTTTTCGCC
CTGTTCTTGC CGCCGCTGCT GGGCATGGAG TCGCTCGCCG CCGAAATCAC GACGTTCAAC
GTGACGCCCG AACAGGTGTT CTGGGCGATC GTCGTCTTCC TCGGCATCCC CTTCGCCGGG
GGAATCCTCA CCCGATACGT CGGCACGCGA GCGAAGGGCG AGGCGTGGTA CGACGAGGAG
TTCGTCCCGA CGATCGACCC GCTCACGCTG GTCGCCCTAC TGTTTACCGT CGTCGTGATG
TTCGCCACGC AGGGCGAGAA CATCGTCGCC GCGCCCGCGG ACGTGTTGCT GATCGCCGTC
CCGCTGACGA TCTACTTCGT CGTCATGTTC CTCGTGAGCT TCGGCATGGG CCGAGGCGTC
GGCGCCGACT ACTCGACGAC GACGGCCATC GGCTTCACCG CGGCCTCGAA CAACTTCGAA
CTCGCGATCG CTGTCGCGGT CGCCGTCTTC GGCGTCGGCT CCGGCGTCGC CTTCACGACC
GTCGTCGGCC CGCTCATCGA GGTCCCCGTG TTGCTCGCGC TGGTCCACGT CGCGCTGTAC
TTCCAGCGGA AACTGGACTG GGGCGGCCGC GACGCCGGCG AACCGACCGT ATCGACTCGA
GAGACGCCCA CCGACGACTA A
 
Protein sequence
MRNATHDHGP DCSCEACGDP RSMDFLDKYL TVWIFAAMAV GVGLGYAAPS VTEPIRDLHL 
VEIGLVAMMY PPLAKADYGR LPTVFRNWRV LSLSLVQNWL IGPTLMFGLA VFFFSGLVPG
LPARPEYFLG LVFIGMARCI AMVLVWNELA EGSTEYVTGL VAFNSLFQIV TYGVYVWFFA
LFLPPLLGME SLAAEITTFN VTPEQVFWAI VVFLGIPFAG GILTRYVGTR AKGEAWYDEE
FVPTIDPLTL VALLFTVVVM FATQGENIVA APADVLLIAV PLTIYFVVMF LVSFGMGRGV
GADYSTTTAI GFTAASNNFE LAIAVAVAVF GVGSGVAFTT VVGPLIEVPV LLALVHVALY
FQRKLDWGGR DAGEPTVSTR ETPTDD