Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0691 |
Symbol | |
ID | 6743494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 624089 |
End bp | 625120 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642750489 |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_002121356 |
Protein GI | 195953066 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00000597905 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACG ACGAAGAAAA GCTATGGCTT GGTATTGAGA CCTCTTGTGA CGATACTGCT TTAGCTTTGT ATAGTAGTAA AAGAGGTCTT ATAGATAATC TCCTTAGCTC CCAAGTAAAT GCTCACAAGA TATACAACGG CATAGTCCCA GAGCTTTGCT CCAGAGAGCA TACAAAAAAT CTTTATATAC TGTTTTATGA GCTTTTAGAA AAACATAAAA TAAAACCCTC TGATATAGAT TTTTTGGCAG TCACGATAGC CCCCGGGCTT ATATTATCTC TTTTGGTTGG AGCTTCTTTT GCCAGTGGTT TATCTTATGC TTTGGATATA CCTATAGTAC CAGTCCATCA TATAGAAGCT CACATATACT CGGTGTTTTT AGAATACAAC GTAGAATATC CATTTTTGGC CCTTGTGGTG TCCGGAGGTC ACACTGAGAT TTACCTCGTA AAAGGCTTTG AACATTATGA GCTAATAGGG AAAACCCTTG ACGATGCAGC CGGCGAGGCC TTTGATAAAG GAGCTGTGCT TCTTGGTCTT CAATACCCTG GCGGTCCTGC CATAGAAAAG TTTTTATCTT CATATGAAAA TCCTGAAACG ATAGACTTTC CAATACCTAT AAAAGATGAT AGGATAGCGT TTTCTTTTAG TGGTTTAAAA ACGTTTTTAA GAGAAAACAA AGACAAATAC CCCAAAGATG CCCTTGTTTT TTCCTACCAA GAGGCTATAG TAAATCACAT TATAAGAACC TTGCAAAAAG CTATAAAAAA AACCGCCGTC AACCGTCTTG TGGTGGTGGG TGGTGTAGCT GCCAACAAAC GCCTAAGAGA AAAGCTAAAC GCCCTTGATA TAGAGTGTTA TATCCCCTCC ATAAAATACT GCACAGACAA CGCAGCTATG GTAAGCTTGG TAGGCAACAT GAGATTTTTA AAAGGAAAAT ATTATAAAAA ATCGGATTTA CATAAATTAA ATCCAGATCC CTCTTTGAGA TTGGAGGATT TTGTAAGGAG TATTTTGTAT AAGTTCAGAT AG
|
Protein sequence | MKHDEEKLWL GIETSCDDTA LALYSSKRGL IDNLLSSQVN AHKIYNGIVP ELCSREHTKN LYILFYELLE KHKIKPSDID FLAVTIAPGL ILSLLVGASF ASGLSYALDI PIVPVHHIEA HIYSVFLEYN VEYPFLALVV SGGHTEIYLV KGFEHYELIG KTLDDAAGEA FDKGAVLLGL QYPGGPAIEK FLSSYENPET IDFPIPIKDD RIAFSFSGLK TFLRENKDKY PKDALVFSYQ EAIVNHIIRT LQKAIKKTAV NRLVVVGGVA ANKRLREKLN ALDIECYIPS IKYCTDNAAM VSLVGNMRFL KGKYYKKSDL HKLNPDPSLR LEDFVRSILY KFR
|
| |