Gene Hlac_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1224 
Symbol 
ID7399492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1233669 
End bp1235039 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID643708289 
ProductAnion-transporting ATPase 
Protein accessionYP_002565887 
Protein GI222479650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.916142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000394119 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCGAGCGT CGCCGCTCTC GACCGACCTG ATCCATCGTC CGATCGGCGA GTTCGCGGAA 
CGCCTGCGCT GGCTCCTCGC TCTTCTACAG CGACCGCGAG CCGTCGAGCG CGACGAGGAC
GGCGTTCGCG CGCGGCGTGG CTCGGTACAT CCGCCACTTC CCCTCCTTCC GGCGCGTAAC
GAGACCGGCG TCGGTGAGCT TCGAGAGGGC GTGGCTGATC GCACTGTCGC TCACGTCCAG
AAGCGGCGAG AGCTCGCAGA CGCACAATTC GTCGCCGTCG GCCGCATGAA GCATCCGAAC
GATCTTGTAC CGCGTCTCGT TCCCGAGCGC CGACAGCAAC TCGACGTCGG GACGGCGACC
GACGTTGATG GCGCGGGCCA CGCTGACGCC GACGGTGCGA CCGACTTCGA CGCGGTGGCG
GACGCCGACG CGGTCGTCGA CCAACTGACG CCGGGCGAGG AGACGCAGTA CCTTTTCTTC
ACCGGGAAAG GCGGTGTCGG GAAGAGCACG GTCGCCTCAA CGGCGGCGAC GAAGCTCGCC
GAAGCGGGCC ACGAAACGCT CGTCGTTACG ACCGATCCGG CCGCACACTT GGAGGACATC
TTCGGCGAGC CCGTGGGCCA CGAGCCGACT TCGGTCGGGC AGGACAACCT CGACGCGGCC
CGGATCGACC AGGAGAAGGC GCTCGCCGAG TACCGTGAGC AGGTCCTCGA CCACGTCACG
GAGATGTACG AGGAGAAGGA GAACACGCAG ATCGACGTCG ACGCTGCGAT CGCGAACGTT
GAAGAGGAAC TGGAGTCTCC CTGTGCCGAG GAGATGGCCG CCCTCGAGAA GTTCGTGAGC
TACTTCGACG AGGACGGCTA CGACGTGGTC GTCTTCGACA CGGCCCCGAC GGGGCACACC
CTTCGGCTGC TCGAACTCCC GTCCGACTGG AAGGGGTTCA TGGACCTCGG CTCGCTGACG
AAGGGTGCCG CGCCCGCGAA GGGCGACCAG TATGACGAGG TCATCGAGAC GATGAAAGAT
CCCAACCAAA GTACCTTCGC GTTCGTGATG TACCCCGAGT ACACCCCCAT GATGGAGGCG
TACCGGGCCG CCGCCGACCT CGAAGACCAA GTCGGCATCG AGACTTCGTT GGTCGTCGCC
AACTATCTCC TTCCCGAGGA GTACGGCAAC AACGCCTTCT TCGCGAATCG GCGCGCTCAG
CAGGCGAAGT ACCTCGACGA GATCCGCGAT CGGTTCGACG CGCCGCTCAT GTTGGCGCCA
CTCCGGCAAG ACGAGCCGAT CGGACTCGAC GAGCAGAGCG CATTCGGCGA GGAGATCACT
GGGCTGGCGG ACATCGCTGA GGCGGATGCG CCGGAGGTGA CTCCCTCATG A
 
Protein sequence
MRASPLSTDL IHRPIGEFAE RLRWLLALLQ RPRAVERDED GVRARRGSVH PPLPLLPARN 
ETGVGELREG VADRTVAHVQ KRRELADAQF VAVGRMKHPN DLVPRLVPER RQQLDVGTAT
DVDGAGHADA DGATDFDAVA DADAVVDQLT PGEETQYLFF TGKGGVGKST VASTAATKLA
EAGHETLVVT TDPAAHLEDI FGEPVGHEPT SVGQDNLDAA RIDQEKALAE YREQVLDHVT
EMYEEKENTQ IDVDAAIANV EEELESPCAE EMAALEKFVS YFDEDGYDVV VFDTAPTGHT
LRLLELPSDW KGFMDLGSLT KGAAPAKGDQ YDEVIETMKD PNQSTFAFVM YPEYTPMMEA
YRAAADLEDQ VGIETSLVVA NYLLPEEYGN NAFFANRRAQ QAKYLDEIRD RFDAPLMLAP
LRQDEPIGLD EQSAFGEEIT GLADIAEADA PEVTPS