Gene Hlac_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2014 
Symbol 
ID7402033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2007186 
End bp2008265 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content65% 
IMG OID643709085 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_002566662 
Protein GI222480425 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAC GCGTTCGCGA ACACGCGGAG ATCATCGCCG ACCACTCCAC CGACATCCAG 
TCGGGCGACG ACGTGGTCAT CCAGATGCCG AAGGAGGCCG AGGACCTCGC GGTCGCCCTC
CACGAGATCT GCGGGGATCG CGGTGCCAAC CCCGTCTACC TCAACTACTC GAAGCGCGCC
CAGCGCGCCT TCAAGCGCTC ATCGGACGAC TTCACCGAGC CGAGCCACCG ACGCGCGCTC
TACGAGGAGG CCGACGTGTT CATCATCGCG CGCGGCGGCT CGAACGCCAC CGAGGACGCC
GACATCGACC CGGAGACCAA CGCGGCCTAC AACCGGGCGA TGGAGGACGT CAAGCGGACG
CGGCTCTCGA AGACGTGGTG TCTCACGCAG TACCCGACCG CGAGCCACGC CCAGCTCGCC
GGAATGAGCA CCGAGGCGTA CGAGAACTTC GTGTGGGACG CCGTCTCGCT CGACTGGGAC
GAACAGCGCG AGTTCCAGTC GAACATGGTC GAGATCCTCG ATACCGCCGA CGAGGTCCGG
ATCACATCCG GTGAGGAGAC CGACCTGACG ATGGACCTGT CGGGTAACTC CACGCTTAAC
GACTACGGCG AGGCCAACCT TCCCGGCGGT GAAGTGTTCA CCGCGCCCGT GCGCGACGGC
GTCGACGGCG AGGTTCACTT CGATCTACCA CTCTATCGCT ACGGCCGCGA GATCGAGGGG
GTCCGGCTCC GGTTCGAGGA CGGAGAGGTC GTCTCCCACT CCGCCGAACG CAACGAGGAC
CTGCTGACGG GGATCCTCGA CACCGACGAG GGATCTCGGC ATCTCGGGGA ACTCGGCATC
GGGATGAACC GCCAGATCGA CCGGTTCACC TACAACATGC TGTTCGACGA GAAGATGGGC
GACACCGTCC ACATGGCGGT CGGTTCCGCG TACCCGGAGA CGGTCGGCGA AGGCAACGAG
GTCAACGAGT CCGCCGAGCA CGTCGACATG ATCGTCGACA TGAGCGAGGA CTCCGTCATC
GAAGTCGACG GCGAGGTTGT CCAGCGCAAC GGGACGTTCG TCTTCGAGGA CGGGTTCTAA
 
Protein sequence
MDARVREHAE IIADHSTDIQ SGDDVVIQMP KEAEDLAVAL HEICGDRGAN PVYLNYSKRA 
QRAFKRSSDD FTEPSHRRAL YEEADVFIIA RGGSNATEDA DIDPETNAAY NRAMEDVKRT
RLSKTWCLTQ YPTASHAQLA GMSTEAYENF VWDAVSLDWD EQREFQSNMV EILDTADEVR
ITSGEETDLT MDLSGNSTLN DYGEANLPGG EVFTAPVRDG VDGEVHFDLP LYRYGREIEG
VRLRFEDGEV VSHSAERNED LLTGILDTDE GSRHLGELGI GMNRQIDRFT YNMLFDEKMG
DTVHMAVGSA YPETVGEGNE VNESAEHVDM IVDMSEDSVI EVDGEVVQRN GTFVFEDGF