Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0735 |
Symbol | |
ID | 7408429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 825692 |
End bp | 826918 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715107 |
Product | peptidase U32 |
Protein accession | YP_002572623 |
Protein GI | 222528741 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0327554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA AAAAGCCTGA ACTTGTTGCA CCTGCGGGCG ATTTGGAAAA GCTCAAAACT GCAATTTTGT ACGGGGCAGA CAGTGTATAT ATTGGCGGCA GGGAATTTGG ACTTAGAAAA TATGCGGGCA ATTTTGATTT TGATGAGATG AAAGAAGGCA TTGATTTTGC ACACAAACAT GGCAAAAAGG TGTATCTTAC AGCTAACATA TTTGCAAGGA ATGAAGATAT TAAGAAAATA GATGAATTTT TTGACATTAT AAAAAGCTTT GAATTTGACG GAATAATTGT GTCCGACCCA GGGATATTCG TGAAAGCTAA AAAGCTTGGA ATACCAATTC ATATAAGCAC TCAGGCAAAT ACAACAAACT ATGAATCTGC TCGTTTTTGG CATCAGCTTG GAGCAAAGAG GATTGTACTT GCAAGGGAGT TGTCTTTGGA TGAAATCAGA GAAATAAGAG AAAATGTTCC AGATATACTT GAGCTTGAGG CTTTTGTTCA TGGAGCGGTC TGCATTTCAT ATTCAGGCAG GTGTTTTCTA AGTGCGTATA TGACATACAG AGACGCAAAC AGAGGCGAGT GTGCTCATCC GTGCAGGTAT AAATATTATG TCATGGAAGA AAAAAGACCA GGCCAGTATT TTGAGGTATT TGAAGATGTT GATGGCACGT ATATTTTTAA CTCGAAAGAC CTTTGCATGG TTGAGCACAT TGACAAGCTT GCTTTTGCAG GTATTGACGC TTTTAAGATT GAAGGAAGGA TGAAAAGTAG TTTCTACGTT GCGACAGTTG TAAGTGTGTA CAGAAAAGCA ATTGACAAGT TTATAAAAGA CCCTGAACAT TTTGAACCTG AAAAAGAGTG GCTTGAAGAA ATTGCAAAGT GTTCTCACAG AAGTTACACA ACGAACTTTT ATTTTGGAAA GCCAGATCAT GACGACTATA GATTTGAGTC AAGCAAATAT GTCAGGGAAT ATGAATTTGT GGGAATTGTC AAAGAGGTAT TGAATGACGG CTGGGCTGTG GTTGAGCAAA GAAATAGGTT TTTCAAAGGA GATACTGTTG AGGTTATACT TCCAAACGGA AAATATTTTA TACAAAAACT TAATGAAATA TACGACTTAG AAAACAATCC TTTAGATGTT GTTCCTCATG CCCAGCAGCT TACAAAAATA AAATTTGACA GACCTGTTGT AGAGTTTGCA ATGCTAAGAA AGAAGGTGGA AAATTAA
|
Protein sequence | MKIKKPELVA PAGDLEKLKT AILYGADSVY IGGREFGLRK YAGNFDFDEM KEGIDFAHKH GKKVYLTANI FARNEDIKKI DEFFDIIKSF EFDGIIVSDP GIFVKAKKLG IPIHISTQAN TTNYESARFW HQLGAKRIVL ARELSLDEIR EIRENVPDIL ELEAFVHGAV CISYSGRCFL SAYMTYRDAN RGECAHPCRY KYYVMEEKRP GQYFEVFEDV DGTYIFNSKD LCMVEHIDKL AFAGIDAFKI EGRMKSSFYV ATVVSVYRKA IDKFIKDPEH FEPEKEWLEE IAKCSHRSYT TNFYFGKPDH DDYRFESSKY VREYEFVGIV KEVLNDGWAV VEQRNRFFKG DTVEVILPNG KYFIQKLNEI YDLENNPLDV VPHAQQLTKI KFDRPVVEFA MLRKKVEN
|
| |