Gene Htur_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3053 
Symbol 
ID8743672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3133445 
End bp3135181 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content69% 
IMG OID646513638 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003404593 
Protein GI284166314 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTCTG ACATCCGAAT CCTCGGAATC GAAGGCACCG CCTGGGCGGC CAGCGCGGCA 
GTCTACGATT CCGCGACCGA CGACGTCTTC ATCGAGAGCG ACGCCTACCA GCCCGACAGC
GGCGGCATAC ACCCCCGCGA GGCCGCCGAA CACATGCACG ACGCGATTCC CCGCGTCGTC
GAGACCGCAC TCGAGCACGC CCGCGAGACC CACGACGGGC CCGCCGGCGA GGCGCCGGTC
GACGTCGACG AGCGAAGCTC GTCGGGCCAA CAGGCTGCGC CTGTTGATGC CATTGCGTTC
TCTCGAGGGC CGGGGCTCGG CCCCTGTCTG CGGATCGTCG GCACGGCCGC CCGGGCGCTC
TCGCAGGCGC TCGAGGTCCC GCTCGTCGGC GTCAACCACA TGGTCGCCCA CCTCGAGATC
GGGCGCCATA CGGCGGACTT CGACTCGCCA GTCTGTCTGA ACGCCAGCGG CGCCAACGCC
CACCTCCTGG CCTATCGCAA CGGCCGCTAC CGCGTGCTCG GGGAAACGAT GGACACCGGC
GTCGGCAACG CTATCGACAA GTTCACCCGC CACGTCGGCT GGTCCCACCC CGGCGGGCCG
AAGGTCGAGG CGGCCGCCGA GGACGGCGAG TACGTCGACC TCCCGTACGT CGTCAAGGGC
ATGGACTTCT CCTTTTCGGG GATCATGAGC GCCGCAAAGC AAGCTTACGA CGACGAGACG
CCGGTCGAGG ACATCTGTTT CTCGCTGCAG GAGAACATCT TCGGCATGCT GACCGAAGTG
GCCGAACGCG CGCTCTCGCT GACCGGCAGC GACGAACTCG TGCTGGGCGG CGGCGTCGGA
CAGAACGAGC GCTTACGCGA GATGCTCGCG GAGATGTGCG CCCAGCGCGG GGCCGAGTTC
CACGCGCCCG AACCCCGGTT CCTCCGGGAC AACGCCGGCA TGATCGCCGT GCTCGGCGCG
AAGATGTACG AGGCCGGCGA CACACTCGAG ATCGAGGACT CGCAGGTCGA TCCGAACTAT
CGGCCGGATC AGGTGCCGGT GACGTGGCGA CGCGACGAGC CCGAGCTCGC GGCCGGCCGC
GGGGCGGACG GCAGCGAGAC GCAGGTCCGC GGCGCCGAAG CGCTCGTCGA CCTCGAGCCC
GAAACCGGCC GTGTCACGAA ACACCGCGAG GTCAAGAGCT ACCGCCATCC CGAACTCGAC
GAGCGACTGC GCCGCGAGCG GACGACCCTC GAGGCCCGCC TGACCAGCCT CGCACGCCGC
GAGGGGGTGC CGACGCCGGT GCTCTCGGAC GTCGATCCGC GGGAGGCGCG CCTCGAACTC
GAGTACGTCG GCGAGACGGA TCTCCGCGAT GGGCTGACCG CCGAGTGCGT TCGCGACGTC
GGTCGACACC TCGCACGACT GCACTGGGCC GGGTTCGTCC ATGGCGATCC GACGACGCGA
AACGTCCGCG TCGGGCGTGC GGGACGCGAC GCCTCCCGAG ACGAGCGAAC GGACGAAGTC
CGTGAGCGAA CCGTCCTCAT CGACTTCGGC CTCGGCTACC ACACCGACCA CGTCGAGGAC
TACGCGATGG ACATCCACGT CTTCGACCAG AGCCTCGTCG GTACCGCCGA TGACCCCGAC
CCGCTCCGCG AGGCGCTTCG GGAGGGCTAC CGCGAGGTCG GCGAGGAGCG AGTGCTCGAG
CGCCTGCGGG ACGTCGAGGG ACGCGGCCGG TACGTTACCG ACGACGCTCC GGAATAG
 
Protein sequence
MNSDIRILGI EGTAWAASAA VYDSATDDVF IESDAYQPDS GGIHPREAAE HMHDAIPRVV 
ETALEHARET HDGPAGEAPV DVDERSSSGQ QAAPVDAIAF SRGPGLGPCL RIVGTAARAL
SQALEVPLVG VNHMVAHLEI GRHTADFDSP VCLNASGANA HLLAYRNGRY RVLGETMDTG
VGNAIDKFTR HVGWSHPGGP KVEAAAEDGE YVDLPYVVKG MDFSFSGIMS AAKQAYDDET
PVEDICFSLQ ENIFGMLTEV AERALSLTGS DELVLGGGVG QNERLREMLA EMCAQRGAEF
HAPEPRFLRD NAGMIAVLGA KMYEAGDTLE IEDSQVDPNY RPDQVPVTWR RDEPELAAGR
GADGSETQVR GAEALVDLEP ETGRVTKHRE VKSYRHPELD ERLRRERTTL EARLTSLARR
EGVPTPVLSD VDPREARLEL EYVGETDLRD GLTAECVRDV GRHLARLHWA GFVHGDPTTR
NVRVGRAGRD ASRDERTDEV RERTVLIDFG LGYHTDHVED YAMDIHVFDQ SLVGTADDPD
PLREALREGY REVGEERVLE RLRDVEGRGR YVTDDAPE