Gene Hlac_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2202 
Symbol 
ID7401137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2186157 
End bp2187296 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID643709274 
Producthypothetical protein 
Protein accessionYP_002566849 
Protein GI222480612 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.161577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC GATCGCGGAC GACAGACGAC GACCGCTCAC AGGACACTTC CACCGACGGC 
TTCCGGCGAC GGGAGTTCGT GGCGCTCGGC GCGGGCGTGA GCGCGACGAT GCTCGCGGGC
TGCGCCGGAG ACGGGGGGGC CACCTCGTCC GACGGGTCCG ACGGGTCGGA CGGATCGGAG
ACGCTTACCG GCAACTTCAG ACTGCTCATT AGCGACGCGC CGGCCGACAT CGGCGACTTC
GACCAACTGA ACGTCACTCT CGACGAGGCT CGGATCTTCG AGGCGAATGA GGGAGGAGAC
GACGACGAAG AGGCGGACGA CGACGAAGAG GCGGACGACG AGGAGGAGAC CGATGAAGAG
GAGGAAGACG ACGCGGATGC GGACGACGAA GACGACGAGA GCAATCAGAC CGGCAACGAG
ACCGAAGAGG ATGACCCCAC GAACGGAACC GCGGACGAAG AAGACGATGC GGACGTGGAG
GACGAGGACG ACGCTGGCGA CGACGACGAG GAAGCGGACG ACGACGACGA GGAAGCGGAC
GACGACGACG AGTCCGACCG CGGCTTCACC GTCGTCGAAC TCGACGGTGC GACGGTCGAT
CTCACACAGG TGATCGAGGA CGACGCGATC GCCGTGTTCG ACGGCGAGAT CTCGGCGGGA
AGCTACGAGA AGATCGAGCT CTCCGTCACC GACATCGAGG GGATCGTCGA CGGCGAGGAG
GTCGACGTGA AGCTCCCGAG CGAGAAGCTC CAGATCACGA ACGACTTCGA GGTCACGCCC
GACGAGCCCG TCAGCTTCGT CTTCGACATC AACGTCGTCA AGCGTGGTCC GAACAACGGC
TACATCCTCC AGCCCGTGAT CTCCGGGAGC GGGGTTGCCG GTCGAGATAT CGATGTGAAC
GAAATCGACG AGGACGGTGA CGACGGAGAT GATGAAGATG GCGACGAGGG CGACGACGAC
AACGAAGACG ACGACGACAG CGAAGACGAC GACGACAGCG ACGACAGCGA CGACGACAGC
GACGACAGCG ACGACGACAG CGGCGAGAGC GACGGATCGA CCACCGGCGG AAGCGAGACC
GACGACGGGT CGAGCGGCAC CGAAAACGAA ACGGCCACTG GGAACGTAAG CGAGAGCTGA
 
Protein sequence
MTDRSRTTDD DRSQDTSTDG FRRREFVALG AGVSATMLAG CAGDGGATSS DGSDGSDGSE 
TLTGNFRLLI SDAPADIGDF DQLNVTLDEA RIFEANEGGD DDEEADDDEE ADDEEETDEE
EEDDADADDE DDESNQTGNE TEEDDPTNGT ADEEDDADVE DEDDAGDDDE EADDDDEEAD
DDDESDRGFT VVELDGATVD LTQVIEDDAI AVFDGEISAG SYEKIELSVT DIEGIVDGEE
VDVKLPSEKL QITNDFEVTP DEPVSFVFDI NVVKRGPNNG YILQPVISGS GVAGRDIDVN
EIDEDGDDGD DEDGDEGDDD NEDDDDSEDD DDSDDSDDDS DDSDDDSGES DGSTTGGSET
DDGSSGTENE TATGNVSES