Gene HY04AAS1_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1490 
Symbol 
ID6744320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1407051 
End bp1408964 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content31% 
IMG OID642751311 
Producttransglutaminase domain protein 
Protein accessionYP_002122152 
Protein GI195953862 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA GTTTAAAAGT TTACAGCTTT CTTTTAATAC TGGTATATTT AAGCATAGCA 
TTCTCAATAC TATCTATAAG TTTAGTTAGT TTTAGCCCTA TATTTTTTAT AGGTATTGCT
TTGGTTGTTA TAGGCATTTT TCAGGATTTT AAAAACAAAT ATTTACCAAG AATCGTTGTA
AATACAATTG CAGTTTCATC TGTGATTTTG GTTTTTGCAG TAAGCTTCAA CATAACTGCA
CTTTTTGATT TTGCTAAAAA CATAATAATA ACGTTCTTAG GCATCAAATC CTTAGAGAAA
AAGCAACCAA GAGACATATA CCAAATACTT ATATTGGAAA CCATGGGTAT GGGTATAGTA
GGGGTTGCCA CCACAGATAT AAAGTTTTTA GCGATGCTAA TAATTTGGGT TTTCTTGAGC
ATTTTTATAT TCTTAACCAC AAATATTTTT AAAAGCCTAA AAGACGAAAT CTTAACAAAA
TATCATATAA AACTTATCAG CTACGCTGTA GGGTTTATAT CTATAAGCAC CGTAGTTATA
GGATTTTTCA TATTTCTTGC TATGCCAAGG ATACAATCGC CTCTTTTAAA CATAGGAATA
GGGGGTATTT CAAATACAGT AGGATTTTCT AATACACTTT CTCCATCAAA CGCCACCAAT
GTACTAGAAA ACACCTCCAC AGTTTTCAGA ATATTTAACA TAAAAGGAGA TATAAATTTA
AAGGATGCTT ACTTCATAGG AGAAACCTTA GATTATTTCA ACGGCATAAG TTGGACTCAC
AAACATAGAG CAAAGGGACC AAAGATTCTA AAAGGTAACC TTGTATCTTG CGATATCATG
ATAGAGCCAA GCTACGACAA TATTCTTTTT GGTATCTTGT TTCCATACAT GGTTAAAATC
TATAAAAACC CTATAAAAGT TTATATAACA AGCGATAATA CGATAAGAAC AAATAAACCC
ATCACAAATA GAACTGTTTA TAGGGTGTGG TCTTACGTAA CAGATTCCTA TAGACAAAAC
CTTGTAAATA TGAACAGATT TTTACAACTG CCTCAAAACA TAGACCCATC AATAGTAAAA
CTTGCACAAT TCCTAAAAGC CCAAAACAAA AATCCCATAG AGGCTGTAGA AAACTATTTT
AAGCAGGAAA ACTTCAAATA TTCTCTAAGT AACAAAGCTT CAAATAACTT TCTATACGAT
TTTTTATTTA AGTATAAGGC TGGAAACTGT GAAGCCTACG CTTCTTCAAC TGCTCTTTTG
CTTAGATTGA TGGGTGTACC TTCAAGAGTG ATAGTGGGTT TTCATGGGGC TATTTACAAC
AAAGATGGGC ACTACTTTTT TGTAACAAAC TCCTCAGCTC ATTCATGGGT AGAAGCATAC
TACAACGGTA AATGGCAAAC CGTGGATACA ACCCCAACAG ACTACACACA GACAATACCA
AAATTAAGTA AAGCAAGGAT GTTTCTTGAT TATATAAACT ACCTATGGGA TATAAATGTA
ATATACTATT CTACCGCAAG GCAAAAGTAT CTACTGGAAA GCACCGCTAA AAATATAAAA
GCTATAGCTA CTCATTATAC AAAATATTTA GTTTATGCAG TGGTTGTTTT GATGGTTTTT
TATATGGCTT TTAATAAAAT ATTACTAATG TTTAGCATAG ATGCCATGTA TAAAGATATA
TGCAAAAGGT TGAAACAGTC TAATATGAAA TATTGCACAC CAGAGAATGC ACCAAGCTTC
ATAAAAGAAA ACGGTTTTAA AGTCTTTTTT GATATTTATA TAAAAGCTAA GTATTCAAAA
TACGGCATAG ATAAAAAAGA GAAAAAAATG GCAAAAATAT ATTATAATAA TACTATAAAG
GCTATAAAAG AGTTTAACAG TTCTATTAAT AGACAACTTA GCAAAGATAC ATAA
 
Protein sequence
MAKSLKVYSF LLILVYLSIA FSILSISLVS FSPIFFIGIA LVVIGIFQDF KNKYLPRIVV 
NTIAVSSVIL VFAVSFNITA LFDFAKNIII TFLGIKSLEK KQPRDIYQIL ILETMGMGIV
GVATTDIKFL AMLIIWVFLS IFIFLTTNIF KSLKDEILTK YHIKLISYAV GFISISTVVI
GFFIFLAMPR IQSPLLNIGI GGISNTVGFS NTLSPSNATN VLENTSTVFR IFNIKGDINL
KDAYFIGETL DYFNGISWTH KHRAKGPKIL KGNLVSCDIM IEPSYDNILF GILFPYMVKI
YKNPIKVYIT SDNTIRTNKP ITNRTVYRVW SYVTDSYRQN LVNMNRFLQL PQNIDPSIVK
LAQFLKAQNK NPIEAVENYF KQENFKYSLS NKASNNFLYD FLFKYKAGNC EAYASSTALL
LRLMGVPSRV IVGFHGAIYN KDGHYFFVTN SSAHSWVEAY YNGKWQTVDT TPTDYTQTIP
KLSKARMFLD YINYLWDINV IYYSTARQKY LLESTAKNIK AIATHYTKYL VYAVVVLMVF
YMAFNKILLM FSIDAMYKDI CKRLKQSNMK YCTPENAPSF IKENGFKVFF DIYIKAKYSK
YGIDKKEKKM AKIYYNNTIK AIKEFNSSIN RQLSKDT