Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_1318 |
Symbol | |
ID | 5297680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | + |
Start bp | 1326637 |
End bp | 1328115 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640769592 |
Product | peptidase C1A, papain |
Protein accession | YP_001306552 |
Protein GI | 150021198 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0624769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA GATTAATATC CATTTTTATA ATAGTTTTGT TTGGTTTATT GTTATTTGCT AATTCCGTAG TTGAGCAAGC CATTCAATAT GCAGAAAATG TAACACAAAA AATTAAACAA TATGGATTTT TATGGTATGC AAGTCCAAAT AAAGAATTTT TTGAAGAATT TGAAAAGTTC AATGTCACTT CTATTGATGA AATAATGAGA AAAATAAATG GAACAAGAGA GTTACCAGAA GAGGTTAAAA GCAAGATATT TAATTATTTG AATTTGTTGA GAAATAATAA TGGGAAAATA GCTCCTGATA GTATGACAAC AACCGATGAT TTAATGATGG CTTCATTTAT TTTGATAAAA CCTGAAAATA TTACTGATAA GACCTTTTAT AGATTAGAAC CAGTTAAAGA CCAATATATG CATGGAAGTT GTTGGGCATT TTCTACTGCT GCAATGATGG AAAGTGCTTA TGCTGTTCAA GTTTTAAATA AAGAAGAAGG TAATATTAAT AATTTGGTGG ATTTTTCTGA ACGATGGGCA GCGTATCATA ATATTGATTG GGATGTATAT GTTAAATCAA AATACGAATA TGTTCAAGAT AAAAATTCGT TAGAAGGTGG AAATGTTTAT TTCTCATCAT ACAATATGAT TAGATATGGT ATGGTAAAAG AAGAATCAGC ACCATATGAA GATGTATATC TTGTATCTGA TGAAGTTATT CCTCTACCAC CACAAGCATA TAGAGCACCA AGAATTAAGG CAAGTAAAAC AGTAATGATT CCGGATGCGA AGAGTTCAAA GACTTTAGGT TATAGCTATG ACGATTATAT CAACATGATA AAAACAGCTT TAAAGAAGTT TGGTTCTTTA TCAGTTGCAT ATACAGTTCC AAAAGACTTT GGTTCTTACA GTAAAGGTAT CTATGTTCCA ACAACTTCAG AAAACTCAGG AGGACATGCA GTTACACTTG TTGGATGGGT TGATGGAAAA GACTTAGATG ATGTTGTTCT TGCTGAAAGG GTTGATCCAT CAGCTTCGAC TATTTTGGAT GTGGAGTTAC CTGATGGCTC CTACACGTAC TATGATCCAA CGGTAGATGC AACATTTACA ACAAATTTGT TCTGGATAAT AAAAAATTCA TGGGGATATA GCTGGGGAGA TGGAGGTTAT TATGTAGTAC CAGCAATTTC TAAAGAAGCA TATGAGAATG GCAAAGTAGG TTGGTGGATG ATAGAAAATA GAAATATGTA TATATCTATT TTTGATTCTT TAGCAAAACA TGAAGGTGAT AGCCTTGATT GTAATAATGA TGGTGTGGTA AATATTGATG ATTTTAAATA TTTAGTTTCA AAGATTGGAA CAACAAATTC CGAAGAAATT TCAAAATTTG ATATCTCTTT CCCTGAGGAT GGAAAGGTTG ATGGGAACGA TGCTGCAACA TGGGTTTATT TATATAACAA ACTATATGGT AAAAAGTAA
|
Protein sequence | MSKRLISIFI IVLFGLLLFA NSVVEQAIQY AENVTQKIKQ YGFLWYASPN KEFFEEFEKF NVTSIDEIMR KINGTRELPE EVKSKIFNYL NLLRNNNGKI APDSMTTTDD LMMASFILIK PENITDKTFY RLEPVKDQYM HGSCWAFSTA AMMESAYAVQ VLNKEEGNIN NLVDFSERWA AYHNIDWDVY VKSKYEYVQD KNSLEGGNVY FSSYNMIRYG MVKEESAPYE DVYLVSDEVI PLPPQAYRAP RIKASKTVMI PDAKSSKTLG YSYDDYINMI KTALKKFGSL SVAYTVPKDF GSYSKGIYVP TTSENSGGHA VTLVGWVDGK DLDDVVLAER VDPSASTILD VELPDGSYTY YDPTVDATFT TNLFWIIKNS WGYSWGDGGY YVVPAISKEA YENGKVGWWM IENRNMYISI FDSLAKHEGD SLDCNNDGVV NIDDFKYLVS KIGTTNSEEI SKFDISFPED GKVDGNDAAT WVYLYNKLYG KK
|
| |