Gene HY04AAS1_0673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0673 
Symbol 
ID6743473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp602086 
End bp603276 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content36% 
IMG OID642750468 
Productpeptidase U32 
Protein accessionYP_002121338 
Protein GI195953048 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC CAGAGATACT ATCTCCAGTA GGTCATTTTG AAGGATTGAT GTCCGCAATA 
AAAGCCGGTG CTGATGCTGT ATACATGGGA CTTGAAAAAC TAAATCAAAG AGCTGGAAAA
GGTGGTTTTT CAAAAGAAGA TATAAAAGAA ATTAGACTTA TAACCAAAGA CCATGGTATA
AGGCAATATA TAACGCTAAA TTCTATAGTG TTTGATGAAG ATTTACCTTA TTTGGAAGAT
GTACTTGATT TTTTAAAAGA AATAGAAGTT GATGCGGTTA TAGCTTGGGA TTTCTCGGTA
GTACTTGGAA GTATAAAAAG AGGTTTAGAA ACACATATCT CCACAATGGC ATCGGTATCA
AACCACATAT CTGGTAAATT CTATAAAGAA CTTGGCGTAA AACGAATAGT CCCGGCAAAG
GAGTTAGATT TAAACTCCAT TAAAAGCTTA AAACAAAATA CCGGATTAGA AGTAGAGGTT
TTCGTACATG GATCTATGTG TATGGCTGTA TCCGGTAGAT GTTTTTTGAG CCATGAGGTT
TTTCAAAAAT CCGGAAACAG AGGAGAGTGC TATCAAGTCT GTAGACACGA GTTTGACATA
AAGGTAATAT CGAAAAACTC CGGCACAGAG TATTACCTTG GTAGCGACTA TGTTATGTCA
GCCAAAGATC TTCTTACTAT AAACTTTGCG GATAAGCTTA TGTGGGCAGA CGCTTGGAAG
ATAGAAGGAA GAAACAAAAA CCCAGATTAT GTTTATATGA CCACAAAAGC TTACAGAGAA
GCCAGAGAAC GTATATTAAA CAACGAGTGG ACCCAAAAAG GCTATCAAGA CCTTATAGAT
ATGTTAGAAA GAGTATACCA TAGGGAATGG GACGGTGGTT TTTACTTTGG TGAGGCTTCC
TTTGGTATAA ACTCATCTAT AGCAAAAGAA GAAAAGATAT ACGTAGGGGA TGTTGTAAAA
TTTTATCCCA AGGCTTCTGT GGCAGAGGTA AAAGTAGTAG CACATCCTTT AAAAGTAGGC
GATACCATAC ATATAATAGG AGAAACCACA GGCCTTGTAA GACAGAAAGT AGAATCAATG
GAGATAGAAA ATCAACGCAT AGATCAAGCA GAAAAAGGCA CAGCTATAGG TTTAAAAGTG
AACGAAAAAG TAAGAGAAAA AGATAAAGTT TATATTGTAA AAGAAAAATA G
 
Protein sequence
MKKPEILSPV GHFEGLMSAI KAGADAVYMG LEKLNQRAGK GGFSKEDIKE IRLITKDHGI 
RQYITLNSIV FDEDLPYLED VLDFLKEIEV DAVIAWDFSV VLGSIKRGLE THISTMASVS
NHISGKFYKE LGVKRIVPAK ELDLNSIKSL KQNTGLEVEV FVHGSMCMAV SGRCFLSHEV
FQKSGNRGEC YQVCRHEFDI KVISKNSGTE YYLGSDYVMS AKDLLTINFA DKLMWADAWK
IEGRNKNPDY VYMTTKAYRE ARERILNNEW TQKGYQDLID MLERVYHREW DGGFYFGEAS
FGINSSIAKE EKIYVGDVVK FYPKASVAEV KVVAHPLKVG DTIHIIGETT GLVRQKVESM
EIENQRIDQA EKGTAIGLKV NEKVREKDKV YIVKEK