Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0319 |
Symbol | |
ID | 4602087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 284593 |
End bp | 285843 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773079 |
Product | peptidase M28 |
Protein accession | YP_919731 |
Protein GI | 119719236 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAAGCC TGGACTACGA GTACGCGTAT AGAATGGCAG TTCTCATCTC CGAGAGGCCC CGCTTCACGG GAACGGAGGG AGAGAGGACA GCCAGGGAGG CTATCAAGGA GGAGCTCGAG AAGCATGGGT ACAGCGTGAG CCTAGAGAAA TTCTCCACGA AGACATACGA GGTCGTCGAG TCGGAACTCG TGATAACAGA GCCGTATCTC GGAAGGGTAG AGGCGTCCGC GCTGGGATTC AGCGGGGAGA CCCCCGCTGA GGGCGTTGAG GGGGAACTCG TATACCTGGA GAACACGGAC CCGGTGCTCA TACCGGAGGA GGACGGCTGG ATAGGAATAG TCGTTCAGAG ACCCTCGAAG GAGGGCTGGC AGAGACTTGT GAAGAAGGCT GGTGGACTCG TAATAGCCGA GAGTACTCCT TACAGGGGGC TGAGCAGGGT CGCCGTTCCT TACGAGTGGA GGGAGAAGAT AGGCTCTCTT CCCAGCGTCT ACGTGAAGTA CCGGGACGCG GTGAGGATGC TGACAGCGAG GAGAGCTAGG CTAAAACTGA CACAAGTCTA CAGGGACGTG GACACGTACA ACATTATAGC CGAGGTCAAG GGGTACAAGT ACCCAGACGA AATAGTCTAC CTTACGGCGC ACTACGACAG CGTGATGGGG GTCCCTGGGG CGACGGATAA CGCCGGTGGC ACAGCGTTGC TCCTTGCACT AGCTAAAGCT CTCGCAGGCT TCAAGCCTAA GAGAACTGTG CGTTTCGCGT TCTTCGCGGC GGAGGAGCTC GGGCTACGCG GCTCCCTCTT TCACGTCGGC TCGTTGAACG AAGAGGAGAA GAAAAAGATC AAAGTGGTGG TCAACCTCGA CGTACACGGC GGAGCCCTTG GAAGCAGCGC CGCCGTCATC AGTGGACCTA AGAGCCTTAG GTACTTCGCC GAGATACACG CTAAGAAGCT AGGCGTGAAC CTGAGCATAT CCGAGGATAT CATGAGCAGC GATGGTACTT CCTTCGTAAA GCACGGCATC CCCGCCGTTA ACCTCTATAG ATCCAGCGGT AGCGGGGCAG ACATACACAC AGAGAGCGAC TCCCCGGAGC ACCTACACCC GTTGGCCTTC AAGGTGATAG GGCACTACGC GCTCCACCTG GTCACGGAAC TGCTGTCCGC CGAAGAAATA CCGTTTGAGC GCGAAATACC CGAAGAGATA AAGAAGAAGG CCGAGGAATA CTTCGCCAAG AGACTGGGGG TTGACGGCTA G
|
Protein sequence | MQSLDYEYAY RMAVLISERP RFTGTEGERT AREAIKEELE KHGYSVSLEK FSTKTYEVVE SELVITEPYL GRVEASALGF SGETPAEGVE GELVYLENTD PVLIPEEDGW IGIVVQRPSK EGWQRLVKKA GGLVIAESTP YRGLSRVAVP YEWREKIGSL PSVYVKYRDA VRMLTARRAR LKLTQVYRDV DTYNIIAEVK GYKYPDEIVY LTAHYDSVMG VPGATDNAGG TALLLALAKA LAGFKPKRTV RFAFFAAEEL GLRGSLFHVG SLNEEEKKKI KVVVNLDVHG GALGSSAAVI SGPKSLRYFA EIHAKKLGVN LSISEDIMSS DGTSFVKHGI PAVNLYRSSG SGADIHTESD SPEHLHPLAF KVIGHYALHL VTELLSAEEI PFEREIPEEI KKKAEEYFAK RLGVDG
|
| |