Gene HY04AAS1_0929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0929 
Symbol 
ID6743741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp876362 
End bp877783 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content40% 
IMG OID642750735 
Productprotease Do 
Protein accessionYP_002121594 
Protein GI195953304 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000572701 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TTTTTGCAAT ATTATCTGTA TTTGCCCTTT TGTTTTTCAC AAACTCCTGT 
GTGAAGAAAA GTAGAGTAGA ACAAGCTACC TCGACTCAGC AACCGTCTAG CCAGTACCAA
CTCAAATTAA ATGTACCAGT ATTGGCACAG ATGCAAGATG AACTTGTGCA GATTGTAAAA
AGAGTCTCTC CTTCTGTGGT AACGATATTT TCTACCCAAG AGGTAAACGT ACCGCTTTTT
CCACAAATAC CTGGTTTTGA CCTTCCAACA CCTTCGATAC CACAAGAAAC AAAGGCTCTT
GGGTCCGGTG TTATATTTGA ATACAACAAG CAAAACGATA CGTTTTTTAT ACTTACAAAC
AACCATGTTA TAGCTCATAG CAAAAGCGTA GTGGTGAATT TTGGAAAAAA TGAACAGCAT
CAAGCTAAAG TATTAGGTGC AGATCCAAAA ACAGATTTGG CAGTATTAGA AGTTAGTGCA
AAGGGGATAA AAGATCCAGA TTCAAGAGTG GCAACGCTTG GTAATTCAGA TACACTCCAA
GTGGGCCAAA TCGTGCTAGC TATAGGTAAT CCTTATGGCC TAGATAGGAC TGTGACGATG
GGGGTTATAT CTGCTTTACA TAGGAGTATA GGTTTAACTC AATATGAAAA TTACATACAA
ACGGATGCTG CTATAAACCC AGGCAACAGC GGCGGGCCTC TTGTAAATAT ACAAGGCCAG
GTAATAGGTA TAAACTCTGC TATGGTAGAA GGCGGTCAAG GTCTTGGCTT TGCCATCCCT
ATAAATTTAG CAAAATGGGT ATCTTCTCAA ATTATAAAAC ATGGTTCTGT TACAAGGGGA
TGGATAGGTG TGATGATACA ACAAGTTACG CCAAGCTTAG CAAAAGCTCT AAAAGTTCAA
AACGGTGCTG TGGTAGTTCA AGTTATGCCA AATGGCCCTG CAGATAAAGC TGGTATAAAA
GTGGGTGATG TTATAGTAGG TATAGACAAC GAAAATATAA GCACTATACA ACAGCTGCAA
TTTAAAGTGA TGGAGACAAA ACCTGGTACT ACTCTCACAT TTCACATAAT AAGAAATGGA
AAGCCCATGG ACTTAAAAGT AACTATAGGG AAAATGCCTA CAAACCCAAC GTCTGTTAGT
GAAACTCAAA CTACCACAGA CCTTGGCATA TCTGTAGCAA ACCTAACTCC ACAACAGATG
CAAACTTACG GCGGTGGTGT TTATGTGGTA AGCGTAGGCC CAAACAGTCC AGCAGCTAGT
TCTCTTCAGC CTGGAGACGT GATACTAATG GTAAACAACC ATCCTGTGAA CTCTGTAAAT
GATTTTAAAT CGCTTGTATC TCAATATGTA AAATCTGGAT ATGTGTTGTT TTTGGTGGCA
AGGGATGGGC AAAGATTTTA CGTAAGTATA CAAACAAGGT GA
 
Protein sequence
MRKIFAILSV FALLFFTNSC VKKSRVEQAT STQQPSSQYQ LKLNVPVLAQ MQDELVQIVK 
RVSPSVVTIF STQEVNVPLF PQIPGFDLPT PSIPQETKAL GSGVIFEYNK QNDTFFILTN
NHVIAHSKSV VVNFGKNEQH QAKVLGADPK TDLAVLEVSA KGIKDPDSRV ATLGNSDTLQ
VGQIVLAIGN PYGLDRTVTM GVISALHRSI GLTQYENYIQ TDAAINPGNS GGPLVNIQGQ
VIGINSAMVE GGQGLGFAIP INLAKWVSSQ IIKHGSVTRG WIGVMIQQVT PSLAKALKVQ
NGAVVVQVMP NGPADKAGIK VGDVIVGIDN ENISTIQQLQ FKVMETKPGT TLTFHIIRNG
KPMDLKVTIG KMPTNPTSVS ETQTTTDLGI SVANLTPQQM QTYGGGVYVV SVGPNSPAAS
SLQPGDVILM VNNHPVNSVN DFKSLVSQYV KSGYVLFLVA RDGQRFYVSI QTR