Gene Huta_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1954 
Symbol 
ID8384247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1978078 
End bp1979739 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content68% 
IMG OID644973023 
ProductO-sialoglycoprotein endopeptidase/protein kinase 
Protein accessionYP_003130855 
Protein GI257053022 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTC TCGGCATCGA AGGCACGGCG TGGGCGGCGA GTGCCGCCGT CTACGAACGG 
ACGGACAGCG GTGAATCCGT CGTTATCGAG ACTGACGCCT ACGAACCCGA CAGCGGCGGG
ATTCACCCGC GGGAAGCCGC CGAGCACATG CGCGAGGCGA TCCCGCAGGT CGTCGAACGG
GCACTCGACA TCGCCCGCGA GCAGGCTGCC GACGCGGGCG AAGACCCCGA CGAATCGCCG
GTCGACGCCG TCGCTTTCTC ACGCGGTCCG GGACTGGGGC CCTGTCTGCG GATCGTCGCC
ACGGCCGCCA GGGCACTGGC ACAGCGGCTG GACGTCCCGC TGGTCGGCGT CAATCACATG
GTTGCGCATC TGGAGATCGG CCGTCATCGC TCGGGCTTTT CCGCGCCGGT GTGTCTGAAC
GCCTCCGGCG CGAACGCCCA CATTCTGGGG TATCGAAACG GGCGGTATCG CGTCCTCGGG
GAGACGATGG ACACCGGCGT CGGCAACGCC ATCGACAAGT TCACCCGCCA CCTCGGGTGG
TCCCATCCCG GCGGGCCGAA GGTCGAAAAG CGGGCAAAAG ACGGCGAGTA CATCGACCTG
CCCTACGTCG TCAAGGGGAT GGACTTCTCC TTTTCGGGAA TCATGAGCGC CGCCAAGCAA
GCGATTGACG ATGGGGAGGC AGTAGAGGAC GTCTGCTACT CGCTCCAGGA GAACATCTTC
GCGATGCTGA CGGAAGTCGC GGAGCGGGCC CTCTCCCTGA CCGACGCCGA CGAACTCGTC
CTCGGCGGGG GTGTCGGGCA GAACGAGCGC CTCAGAGAGA TGCTCGGCAA GATGTGCGAC
CAGCGCGGGG CTGATTTTTA CGCGCCCGAA CCCAGATTTC TCCGGGACAA CGCGGGGATG
ATCGCCGTCC TCGGCGCGAA GATGTACGAC GCGGGCGACA CGATTCCGAT CGAGGACTCA
CGCGTCCGGC CGGACTTCCG GCCCGACGAG GTTGACGTGA CCTGGCGATC CGACGAGGCC
GTCGGTTCGT GGGGCGGGTC GAGCGACGAC GGGACGGTCG GTGCCCGGGA CGGAGCGGGA
GCCGACGATG CCGTCCAGGG GGCCGAAGCG ACCGTCACCG TCGAGGACGG CCGGGTCAGG
AAGGAGCGCC AGCCACGGAC CTACCGCCAT CCGACGCTCG ACGAGCGCCT CCGGACCGAG
CGAACGCGCG AAGAAGCCCG ACTCACGAGC GAAGCGCGCC GCGTCGGCGT CCCGACGCCG
GTCGTCCACG ACGTCGACCC GCAGGAAGGC GTCCTGGTCT TCGAGCGCGT GGGCGAGCGG
GATCTCCGTG AGGCCCTGAC ACTCGATCGG GTCCGGGACG TCGGGCGACA CCTGGCGACG
ATCCACGACG CGGGGTTCGT CCACGGCGAT CCGACGACGC GAAATGTCAG AGTTTCAGAA
GATCGCACTC ACCTCATCGA CTTCGGCCTG GGCTACTACA CCGGCCACGC CGAGGATCAC
GCGATGGACC TCCACGTCTT CGCCCAGTCG CTGGCTGGAA CCGCTGACGA CCCCGAGGCA
CTGCGATCGG CCGCCGAGGA CGCCTATCGC GAGACGGCAG ACGAAGGCGG GGCGGTGCTG
GATCGTCTCC GCGAGATCGA GGGACGCGGC CGGTATCAGT GA
 
Protein sequence
MRILGIEGTA WAASAAVYER TDSGESVVIE TDAYEPDSGG IHPREAAEHM REAIPQVVER 
ALDIAREQAA DAGEDPDESP VDAVAFSRGP GLGPCLRIVA TAARALAQRL DVPLVGVNHM
VAHLEIGRHR SGFSAPVCLN ASGANAHILG YRNGRYRVLG ETMDTGVGNA IDKFTRHLGW
SHPGGPKVEK RAKDGEYIDL PYVVKGMDFS FSGIMSAAKQ AIDDGEAVED VCYSLQENIF
AMLTEVAERA LSLTDADELV LGGGVGQNER LREMLGKMCD QRGADFYAPE PRFLRDNAGM
IAVLGAKMYD AGDTIPIEDS RVRPDFRPDE VDVTWRSDEA VGSWGGSSDD GTVGARDGAG
ADDAVQGAEA TVTVEDGRVR KERQPRTYRH PTLDERLRTE RTREEARLTS EARRVGVPTP
VVHDVDPQEG VLVFERVGER DLREALTLDR VRDVGRHLAT IHDAGFVHGD PTTRNVRVSE
DRTHLIDFGL GYYTGHAEDH AMDLHVFAQS LAGTADDPEA LRSAAEDAYR ETADEGGAVL
DRLREIEGRG RYQ