Gene Hlac_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2801 
Symbol 
ID7398864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp63219 
End bp64577 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content49% 
IMG OID643706629 
Producthypothetical protein 
Protein accessionYP_002564255 
Protein GI222475734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGG AACGTATCGG GCGAAAAACG CTTAAAGAGC AACTACCCGC AATTCTCCAC 
GAACAGTTTC AACTAGATAA TTTGCCGCAA GATCACGAAC CGTCGTGGGA GTATATCACC
GCGAATACAC GGTACTCCGC CCAAGGACTG AACAACAAGT CCAAAGAGCT GTACGGACAG
ACCATTCTTG AATTTCTTCG AGAACAGGGA TTTGGCGTGC GTAGTACCGG AAAATGGCCG
ACAGACGATG AAGAAACCAT CCGTTCGCTG GAGTACTATA TTGAGAGCGC CGAAGAGCGG
AAAGAGTGGA GTGAGAACAC CATTGATTCC GTTGAGTCGG TGATGAACAA GGTGTACGAA
GCGATTCGAG ACGAAGGACT CGACATCGAG ATGCTGGATA TCGGCTACTA TGACTCTGAA
AAGAACCGTG TGGAGAATAT CCAACACGCT ATCACAATCA TCGAATACAT GGATCGTGAC
CTGGCCGACA GCACGATGGG AAACTACCCT CGATATTTTG AGGAATATTA CAACATCGTG
AAAAACAAGC ACCAGATCAA TATTAATCCA GTCGAAGAAG CACTCGATGA ATTCGAATGG
TACCGGAGTG ATAGTGACGC ACAGCCAGTC ACTGAAGCAC AACTAAACGA TCTGTGGAAC
GCGTTAGATG TCCTTGACGA GTGTCCTGTG GACGGTCACG ATTTAGAGCG GTGGCGGTTA
TGGATGAAAA TGCTGCTCAT CTTCTTGATC GCCGTTGGTC CCCGGTCAAA TGAAGTCGAA
CAACTTGATT TGCGGACACA ACTTCATTTT GGTGATGACC CGCATGTTCA CTTCGCCGTA
CGAAAGAACA TGCGACGAGA TGAGGGACCA GCAAAAGTCC CGATAATGAT GGGTGGTGAT
TTCCTTCGAG CGTATCGTGA GTACATTGAC GCGATCGGTG GGAATGGGAA GTTGGTTCCG
AGTGACCAGT CTGAATCTGG CTGCCGGACT CCAAGCACGC TGAATGAATG GCTGGGGCGA
CTATGTAAGA TTGCTGGCGT TCGACTTGAT GGCGGGGAGT TTCCGACGAT TCAGAACTTT
CGCCAGTTTT GGAAGACACT GTATAAGAGG GCAGTTGCAG AGAACCGAGA GCAGATCAAA
TTTGTCTCTG AAGAAGATGG CAAGAAGGAT TACGAGAGTG ATGAGCGTGA TTACATCGAC
GATGTAGTGA ACCGACAGCA TGTTCGTGGT CTTGGTCGGG AATATTTTGG TGACGTGCTG
GACCTCGGTG AATTACCTGA ATTAGTTCGG GAAGAGCTGG ACCAAGATCA GCATGGTGAG
CGACAGACCA AGTTCACCGA TCACGACTTT GGCACCTGA
 
Protein sequence
MTRERIGRKT LKEQLPAILH EQFQLDNLPQ DHEPSWEYIT ANTRYSAQGL NNKSKELYGQ 
TILEFLREQG FGVRSTGKWP TDDEETIRSL EYYIESAEER KEWSENTIDS VESVMNKVYE
AIRDEGLDIE MLDIGYYDSE KNRVENIQHA ITIIEYMDRD LADSTMGNYP RYFEEYYNIV
KNKHQININP VEEALDEFEW YRSDSDAQPV TEAQLNDLWN ALDVLDECPV DGHDLERWRL
WMKMLLIFLI AVGPRSNEVE QLDLRTQLHF GDDPHVHFAV RKNMRRDEGP AKVPIMMGGD
FLRAYREYID AIGGNGKLVP SDQSESGCRT PSTLNEWLGR LCKIAGVRLD GGEFPTIQNF
RQFWKTLYKR AVAENREQIK FVSEEDGKKD YESDERDYID DVVNRQHVRG LGREYFGDVL
DLGELPELVR EELDQDQHGE RQTKFTDHDF GT