Gene Hlac_2712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2712 
Symbol 
ID7401323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2703484 
End bp2704509 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID643709787 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_002567353 
Protein GI222481116 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGGT TCGTCTTCTT CGGCGGGAAG GGTGGCGTCG GCAAGACCAC CGTCTCCTGT 
GCGTACGCCT CTCGCTGTGC GAACGACGGG GTGCGGACGC TGGTCGTCTC GACGGACCCC
GCACACTCGG TGTCGGACGT GTTCGACCAG TCGTTCGGCG ATGAGCCGGC GCCCGTCGAC
GGGATCGAGG GACTCGACGC GATGGAGATC GACCCCGAAG ACGAGATGCA GCGGCACCTC
CAGGAGATCC GCGAGGCGCT CTCCGAGCAG GTGTCGGCGG CGATGGTCTC GGAGATCAAC
CGCCAACTGG AGATGTCGCA CGGCACGCCG GGCGCGTACG AGGCTGCGCT CTTCGACGCG
TTCGTGAGCG TGATGCGCGA GGAGGGTGAG TCGTACGATC GGATCGTCTT CGACACCGCG
CCGACCGGGT CGACGCTGCG GCTCTTGGGG CTCCCCGAGT TCCTCGGCGA CTGGATCGAC
CGGCTGCTGT ACAAGCGCAA GCAGTCGATC GACCTGTTCG AGAAGGCCGC TATCGGCGAC
ATGGAACCCC GGCGGTTGAT GGACGGCGAC CCCGTCTTAG AGCGGCTCCA GCGCCGCAAG
GAGTTCTTCG AGTTCGCGGG CGACACCATG CGAGACGAGG CCGCCTTCTT CCTCGTGTTG
AACCCCGACC AGCTCTCGGT CAACGAGACA GGACGGGCGA TCGAGGGGTT CGCCGAGCGC
GACTTGCGCG TCCGTGGGCT CGTCGCGAAC AAGCTTACCC CGGAGCCCGA CGACGACGAG
GAGGGACGCG GAGCCACCTA CCTCCGCGAG AAGGTCGCGA CCGAGCGCGA CCGGCTCCGG
CAGGTCCGAG AGGAGTTCGA GCCCCCGCTC GTCGCCGAGA TTGAGTCGCG GACGCGGGAA
GTCCGCGGCG ACGTGCTCGC GGAGGTGGCG GCCGCGCTCG ACATCGAGAC GGCGAGCGAC
GTGAGCGGGG AGGACGACGA CCGCACCCGA AGTGACGACG GTGGCCCCGT CCGCGCCGAT
CGGTAA
 
Protein sequence
MERFVFFGGK GGVGKTTVSC AYASRCANDG VRTLVVSTDP AHSVSDVFDQ SFGDEPAPVD 
GIEGLDAMEI DPEDEMQRHL QEIREALSEQ VSAAMVSEIN RQLEMSHGTP GAYEAALFDA
FVSVMREEGE SYDRIVFDTA PTGSTLRLLG LPEFLGDWID RLLYKRKQSI DLFEKAAIGD
MEPRRLMDGD PVLERLQRRK EFFEFAGDTM RDEAAFFLVL NPDQLSVNET GRAIEGFAER
DLRVRGLVAN KLTPEPDDDE EGRGATYLRE KVATERDRLR QVREEFEPPL VAEIESRTRE
VRGDVLAEVA AALDIETASD VSGEDDDRTR SDDGGPVRAD R