Gene Hlac_3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3294 
Symbol 
ID7402440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp34809 
End bp35897 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content53% 
IMG OID643709851 
Producthypothetical protein 
Protein accessionYP_002567417 
Protein GI222481181 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGGAG GAAGCGGTGG ATATGAGCGA CCGTCTGATA GCGGCGGATC GGGAGGAGAA 
TCTAGTGAAT CGGATAGTGT ACCCCCACCC GAGTCAGACA CTGAAGAAGA GACAGACGAA
GATCAGTCTG ATGAGTCAAA TGAAACAAGT GACGGAACGA CTCCAGAGAG CCCACCAGTT
ACTGGTGGCG GGGGCGGTAG CGGCGCACCA GAATCTGGTT CTGGGGACGG TTCTAGCGGT
GCATCTGGAG AGGATGATTC AGAGCCGGAG CGAGACGAAC CATCACCAGA TGAGGGCACT
GAAGATGGAG AACCGGAGGA ACAAGAAGAC AACAGCGGGC ATAATGACGA AGAGTCAGAG
AAGCAGAACC CGGCAGATTC AGGTCCCGAA GACGACTCGG ATGATGCGGA CTCTAATAAC
GCTGAGCAGG AGGGCAGTCA GGAGAACCAT GAGGAGGATC AAGAAGACCA AGAACACGAG
AGTGACAATG AGGATGCCGA AGATGATGAC GAGGATCGGG AGGAAGACGA AGATGATGAC
GAGGACGATG ATGAAGAAGA CGAGTGCCTG ATTGCAGAAT CAGCTCTTCT CCATTCACCG
AACCCAGAAC CGTTAGAAGA TGTAGACGAG GGTGATGTCT GTTCAGTACG CCTTCGAGAG
GAAGCGATCT GTATTGTAGA TTCACTAGGC AGAACTATCG GTGCCATCGC TGAACCGTGG
GTTGGTACAC TGAAGGAGTG TATCGAGCAG GGCCGACAAT ATCGTGCTCG GGTTCTCAAC
ATCGACGGAG GGAAATGCGA AGTTCGAGTA ACCAACAAGT GCCTCGTTAA CCAGGACGTC
AATCTGACCG CGACCAATAC TGCAGTACGG GACCAACTTC ATCCGGAACT TTCCCTATCA
GTCGAAAAAA CGACCGAAGA AGTAGTTGTC CTCACGGATG ACGGAGCTAG AGTCGGTGAC
GTTCCTGACC CATGGGCTCG TCTTCTCAAC GAGTGTATCG ACCAAGGACG GTCATACCAG
GCAGAGGTTC GTGAGGTTAC ACCGGAGTAT TGCAGAGTCA ATATTCAGAC GGGTGCTAGT
GACGAATGA
 
Protein sequence
MGGGSGGYER PSDSGGSGGE SSESDSVPPP ESDTEEETDE DQSDESNETS DGTTPESPPV 
TGGGGGSGAP ESGSGDGSSG ASGEDDSEPE RDEPSPDEGT EDGEPEEQED NSGHNDEESE
KQNPADSGPE DDSDDADSNN AEQEGSQENH EEDQEDQEHE SDNEDAEDDD EDREEDEDDD
EDDDEEDECL IAESALLHSP NPEPLEDVDE GDVCSVRLRE EAICIVDSLG RTIGAIAEPW
VGTLKECIEQ GRQYRARVLN IDGGKCEVRV TNKCLVNQDV NLTATNTAVR DQLHPELSLS
VEKTTEEVVV LTDDGARVGD VPDPWARLLN ECIDQGRSYQ AEVREVTPEY CRVNIQTGAS
DE