Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1798 |
Symbol | |
ID | 7408585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1870537 |
End bp | 1871730 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716175 |
Product | carboxyl-terminal protease |
Protein accession | YP_002573664 |
Protein GI | 222529782 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GACTTCAAAC TTTTGCAATA GTTCTCATAA CTGCAATTGT AACATATATT GCGACCACTT ATGTCTATTT TGGAAGTCCT ATATATACAA ACAAATTAGT AACTAATCCT AAATTATCTA AGGTTATATG GCTTTTGAAA AAATACTACT ATGAGCCTAA GGATATAAGT GACCAGAAAA TTGTAGACGG TGCAATAGAT GGGATTGCCG CAAGTGTTGG TGATCCGTAC ACTGAGTATT TTACTAAAAA AGAATATGAA GAGTTCATGA TACAAAGTAA AGGTACGTAT TTTGGGGTAG GAGTAACAAT AGAGCCTGGC GAACATTATA TCGAAGTTGT AACACCCTTT GAAGGTTCTC CGGCGTACAA GGCGGGGATA AAACCAGGGG ATAAGATTAT AAAAGTAAAT GGAATAAGTT TGACATCAAA AGATATAGAA AAGGCTGTAA GTTTGATGAG AGGGCCAAAA GGAACAAGCG TGACAGTTAC AATTTTGCGC GATGGCAGCT CAAAGCCTAT TGACCTTAAG ATTGTCAGAG ACGAGATAAA AATAAAGACT GTATCTACTT CCATTTTTGA AAACAACATA GGTTATATCA AAATCACTAA CTTTGATGAA AATACTCCTC AGGACTTTTA CAATAGCTAT GACAAACTCA AAAGCTCTGG CTGCCGTGGA CTTGTCATTG ACCTGAGATT TAACCCTGGT GGGCTTTTAG AGTCTGTTGT TGACATTGCA AGCAATTTTC TCAAGAAAGG ACAGCTTATA GTGTATCTCA AGGACAGATA CAATAACAAA GAGTATTTCA AATCATACAA AAATGGTGAC ACGGTAACAC CGCTTGTGGT GCTTACCAAT AAGTATTCAG CGTCAGCTTC AGAGATATTA GCTGGATGTT TAAAAGACCA AAAGAGGGCA AAAATTGTTG GTGAGAAGAC TTTTGGCAAA GGCGTTGTTC AGCAGGTATT TGACCTGGGA GATGGGTCTG CAATAAAAAT AACAGTAAGC CAGTATCTTT TGCCAAGTGG AGCATATATT CACAAAAGAG GAATAAAGCC AGATATTAAA GTAGTTCAAC CCAAAGAGTA TCAGGACAAA ATGAATGTTC CAATGGATAA AGATTTGCAG CTGAAAAAAG CTATTGAGAT ATTAAAGAGT GAAATTTCAA AGAGCAAGTT TTGA
|
Protein sequence | MKKRLQTFAI VLITAIVTYI ATTYVYFGSP IYTNKLVTNP KLSKVIWLLK KYYYEPKDIS DQKIVDGAID GIAASVGDPY TEYFTKKEYE EFMIQSKGTY FGVGVTIEPG EHYIEVVTPF EGSPAYKAGI KPGDKIIKVN GISLTSKDIE KAVSLMRGPK GTSVTVTILR DGSSKPIDLK IVRDEIKIKT VSTSIFENNI GYIKITNFDE NTPQDFYNSY DKLKSSGCRG LVIDLRFNPG GLLESVVDIA SNFLKKGQLI VYLKDRYNNK EYFKSYKNGD TVTPLVVLTN KYSASASEIL AGCLKDQKRA KIVGEKTFGK GVVQQVFDLG DGSAIKITVS QYLLPSGAYI HKRGIKPDIK VVQPKEYQDK MNVPMDKDLQ LKKAIEILKS EISKSKF
|
| |