Gene HY04AAS1_0691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0691 
Symbol 
ID6743494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp624089 
End bp625120 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content38% 
IMG OID642750489 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_002121356 
Protein GI195953066 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000597905 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACG ACGAAGAAAA GCTATGGCTT GGTATTGAGA CCTCTTGTGA CGATACTGCT 
TTAGCTTTGT ATAGTAGTAA AAGAGGTCTT ATAGATAATC TCCTTAGCTC CCAAGTAAAT
GCTCACAAGA TATACAACGG CATAGTCCCA GAGCTTTGCT CCAGAGAGCA TACAAAAAAT
CTTTATATAC TGTTTTATGA GCTTTTAGAA AAACATAAAA TAAAACCCTC TGATATAGAT
TTTTTGGCAG TCACGATAGC CCCCGGGCTT ATATTATCTC TTTTGGTTGG AGCTTCTTTT
GCCAGTGGTT TATCTTATGC TTTGGATATA CCTATAGTAC CAGTCCATCA TATAGAAGCT
CACATATACT CGGTGTTTTT AGAATACAAC GTAGAATATC CATTTTTGGC CCTTGTGGTG
TCCGGAGGTC ACACTGAGAT TTACCTCGTA AAAGGCTTTG AACATTATGA GCTAATAGGG
AAAACCCTTG ACGATGCAGC CGGCGAGGCC TTTGATAAAG GAGCTGTGCT TCTTGGTCTT
CAATACCCTG GCGGTCCTGC CATAGAAAAG TTTTTATCTT CATATGAAAA TCCTGAAACG
ATAGACTTTC CAATACCTAT AAAAGATGAT AGGATAGCGT TTTCTTTTAG TGGTTTAAAA
ACGTTTTTAA GAGAAAACAA AGACAAATAC CCCAAAGATG CCCTTGTTTT TTCCTACCAA
GAGGCTATAG TAAATCACAT TATAAGAACC TTGCAAAAAG CTATAAAAAA AACCGCCGTC
AACCGTCTTG TGGTGGTGGG TGGTGTAGCT GCCAACAAAC GCCTAAGAGA AAAGCTAAAC
GCCCTTGATA TAGAGTGTTA TATCCCCTCC ATAAAATACT GCACAGACAA CGCAGCTATG
GTAAGCTTGG TAGGCAACAT GAGATTTTTA AAAGGAAAAT ATTATAAAAA ATCGGATTTA
CATAAATTAA ATCCAGATCC CTCTTTGAGA TTGGAGGATT TTGTAAGGAG TATTTTGTAT
AAGTTCAGAT AG
 
Protein sequence
MKHDEEKLWL GIETSCDDTA LALYSSKRGL IDNLLSSQVN AHKIYNGIVP ELCSREHTKN 
LYILFYELLE KHKIKPSDID FLAVTIAPGL ILSLLVGASF ASGLSYALDI PIVPVHHIEA
HIYSVFLEYN VEYPFLALVV SGGHTEIYLV KGFEHYELIG KTLDDAAGEA FDKGAVLLGL
QYPGGPAIEK FLSSYENPET IDFPIPIKDD RIAFSFSGLK TFLRENKDKY PKDALVFSYQ
EAIVNHIIRT LQKAIKKTAV NRLVVVGGVA ANKRLREKLN ALDIECYIPS IKYCTDNAAM
VSLVGNMRFL KGKYYKKSDL HKLNPDPSLR LEDFVRSILY KFR