Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0202 |
Symbol | |
ID | 6742991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 178954 |
End bp | 179805 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642749992 |
Product | protein of unknown function DUF114 |
Protein accession | YP_002120872 |
Protein GI | 195952582 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAG ATATAGGTGG GCTTTTTAAT CTATTTTGGT TTTTATTTTT ATTTTTCTTT GTAGTTTGGC CTTTTTTGAA AAGAAGCCTT TTAAATAGAG AAAGGGAGGC AATGATAAGG CTTATAGAAA AAAAATATAA CGCCAGAGTC ATAACGATGA TTCATAGACA AGAAGGATTG TCATTTTTTG GGTTTGCCTT TATGAAGTTT ATAAACATAG AAGACTCTGA ACAGGTGTTA AGAGCCATAA GGATGACTCC AGAAGATATG CCCATTGTTA TGATACTTCA TACACCCGGT GGTTTAGCGC TGGCAGCTTC TCAAATAGCT AGTGCACTGG CAAAACACAA GTCAAAAGTT ATAGTCATAA TACCTCACTA CGCCATGAGT GGTGGTACAC TAATAGCCTT ATCTGCCGAC GAGATAACGA TGGACCATAA CGCCGTGTTG GGACCCGTTG ACCCACAAAT AGGGCAAATG CCGGCTGCTT CTATACTAAA AGTGCTTGAT GTAAAAAAAC CAGAAGACAT AGATGATGAA ACTATGATAA TGGCCGATGT ATCCAAAAAA GCAATAGAGC AGATGAAAAG CTATGTATAC GAGCTTTTAA AGAAAAAAGG ACATCCTGAT GATGTTGCAA AAAAGATAGC AGAAGAGTTA TCCACAGGTA AATTTACACA TGATTATCCT CTTGATGTAG ACCAGCTAAA AGCCATGGGC TTAAATATAA ACACCGATGT ACCAGAAGAG GTTTATGAAC TCATGGAGCT TTACGACCAA CCCACGAATT CTCAAGTACC CTCTGTCCAG TACATACCTA TTCCTTATAA AACCACTCAA AACAAAAAAT AA
|
Protein sequence | MNQDIGGLFN LFWFLFLFFF VVWPFLKRSL LNREREAMIR LIEKKYNARV ITMIHRQEGL SFFGFAFMKF INIEDSEQVL RAIRMTPEDM PIVMILHTPG GLALAASQIA SALAKHKSKV IVIIPHYAMS GGTLIALSAD EITMDHNAVL GPVDPQIGQM PAASILKVLD VKKPEDIDDE TMIMADVSKK AIEQMKSYVY ELLKKKGHPD DVAKKIAEEL STGKFTHDYP LDVDQLKAMG LNINTDVPEE VYELMELYDQ PTNSQVPSVQ YIPIPYKTTQ NKK
|
| |