Gene Hlac_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0661 
Symbol 
ID7401796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp676742 
End bp677920 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID643707727 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002565333 
Protein GI222479096 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.626504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.149059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATA TCGACGTCGA GCCGGTGGAT CGCGTCGAGG AGCCGGAGAT CAACGAGACC 
GCTGACGCCG ACCCCGCAAG CGACGCTGAT CTCCCGACCG AAACGGACGC TGCCACGGAC
CTCCCGGCGG GCGTCGACGC CCCGGACTAC GTCCTCTACG GCGGGAAGGG CGGCGTCGGG
AAGACGACGA TGGCGGCCGC GACCGGACTC GCCTCGGCGG CGGGCGGGGT CAACACCTTG
GTGGTCTCCA CCGATCCGGC CCACTCCCTC TCCGATACCT ACGAGACGGA GATCCCGGCG
AAACCAGCGC GCATTCGCGA GGACATGCCG CTGTACGCCG CCGAGATCGA CCCCGACGAC
GCGATGGAGG AGGGGATGTT CGGCGCCGAC GGCGACCCCC TCGGCGGGAT GGGCGAGATG
GGGGACGCGA TGGGCGGAAT GATGGGCGGT GCGAGCGACC CGGACGGCCC CGCAGACGAC
GAGGCCGACG GCGGCCTCGG CTCCCTACTC GGCGGGACGA TGCCCGGCGC CGACGAGGCG
GCCGCGATGC GCCAACTGCT GGAGTACCTC GACGACCCGC GGTTCGACCG CGTGATCGTC
GACACCGCAC CGACGGGCCA CACCCTCCGG CTGCTCCAAC TCCCAGAGAT CATGGATTCG
ATGATCGGCC GGGTGATGAA ACTCCGCAAC CGATTCTCCG GGATGATGGA CGGGATCAAG
GGGATGTTTG GCGGCGGGGA CGACGACCCC GATCCCTCTG CCGACCTCGA CGAGCTCCGC
GAGCGGATCG AGCGCCTCCG GAGCGTGCTG CAGGATCCCG AAAAGACCGA CTTCCGCGTG
GTGACCATCC CCGAGGAGAT GAGCGTCACC GAGTCCGAAC GGCTCGTCGC GCGCCTCGAC
GAGTTCGGGA TTCCGGTGAA CACCCTCGTC GTCAACCGGG TGATGGAGGG CGTCGGCGAC
GTGACCGACG GGAGCGGGGC CGCGATCGAC CCCGAGTGGG TCGTCGAGCC GAACCCGGAC
TCCTGTGAGT TCTGTGCGCG CCGATGGGAG GTCCAGCAGG CGGCACTGCG TCGGGCCACG
GACCTGTTCC GCGGACGCGA CGTGAAGCGA GTCCCGCTGC TCGCGAAGGA AGTTCGCGGG
GAGGCCGCAC TGCGGGTCGT GGCTGCGTGC CTACGCTGA
 
Protein sequence
MDDIDVEPVD RVEEPEINET ADADPASDAD LPTETDAATD LPAGVDAPDY VLYGGKGGVG 
KTTMAAATGL ASAAGGVNTL VVSTDPAHSL SDTYETEIPA KPARIREDMP LYAAEIDPDD
AMEEGMFGAD GDPLGGMGEM GDAMGGMMGG ASDPDGPADD EADGGLGSLL GGTMPGADEA
AAMRQLLEYL DDPRFDRVIV DTAPTGHTLR LLQLPEIMDS MIGRVMKLRN RFSGMMDGIK
GMFGGGDDDP DPSADLDELR ERIERLRSVL QDPEKTDFRV VTIPEEMSVT ESERLVARLD
EFGIPVNTLV VNRVMEGVGD VTDGSGAAID PEWVVEPNPD SCEFCARRWE VQQAALRRAT
DLFRGRDVKR VPLLAKEVRG EAALRVVAAC LR