Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1497 |
Symbol | |
ID | 4601406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1447203 |
End bp | 1448516 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639774272 |
Product | peptidase U34, dipeptidase |
Protein accession | YP_920897 |
Protein GI | 119720402 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGCGATA CTCTGGTTGC CCTGAGGGGG GCGACGAGGG ACGGAGTAAC GATCTTCGCG AAGAACAGTG ACCGCGAGCC GAACGAGGCC CAGGTGCTCG AGTTCGTCCC GAGGATGCGG CACACGGAGG AAAGGGTGCG CGTTACGTAC GTGGAGGTTG AACAGGTAGA CGAGACCTAC GCCGTCCTCA TATCGAGGCC TTTCTGGATG TGGGGAGCGG AGATGGGCGT CAACGAGTTC GGGGTGGCGG TTGGCAACGA GGCTGTGTTT ACCCGCGGGG GCTACTCGAA GACGGGGCTG ACGGGCATGG ATCTCGTGAG GCTCGCGCTC GAGAGGTCTA GGACTGCGCG GGAAGCCTTG AAGTGGATAA CCTCCCTGCT CGAGGAGTAC GGGCAGGGCG GTAACTGTAG CTACTTCAGG AAGATGTTCT ACAACAACTC CTTCCTGGTG GCGGATCCCC GGGAGGCGTG GGTGCTGGAG ACTGTGGGAC GCGAGTGGGT CGCTGAGCGC GTAGAGAGCG TTAGGTCGAT TTCGAACGCG CTCACCATAG GCGAGAGGTG GGACTCCTCC TCGCCGGGGC TCGAGGAAGC CGCCAGGAGG CTGGGTTGCC GCTCCCCGGT GAACTTCAGG GAGTGCTTCT CCGACTTCCT CTACACGAGG GTCTCGAAGG GCAGGGAGAG GCACAGGTAC ACGCAGGGAG AGCTGGAGAA GGCTGTCGGG AAGATAGACT TTTTCTTCGT GGCGAGCCTC CTCAGGCGCC ACTCCAGGGA GCCGTACGAG CCGTCGAGGG GGTCCAACGC GGATATATGC ATGCATGCCG GGGGCCTCAC GAGGCCTTCC CAGACGGCGG CTTCCATGAT AGCGCTGCTA TACGAGGAGG CGCCGGTAGC GTTCGTCACG GGGACCTCGA CGCCGTGTAT AAGCGCGTAC AAGCCGGTCT TCCTCTCGGC GGGGCTCCCG GACCTCGGGC CTAAGCCTTC CCACGTCTTC GACGGGGGCG CGAGCTACTG GTGGAAGCAT GAGCTTCTCA GCAGGAAGCT CCTCTGCGGG TACTCCAGGT ACGCCGGCGT CGTGGCGCGG GAGATGGAGA GGGTGGAGAG GAAGTACTTC GAGAAAGCCA TGGAGGCGAG GACGGGCTAC CTCAAGGGGC TCGTAGGCGC CGAGGAGCTC AGGAGGATAA CCGCGGAGGC TTTCCGCGAA GCGGCGGAAG TCGAGGAGAG GCTAGCCGCG GAGGTGAAGG CTTCCAGGTG CCTTAACCCG CTCTACGCGC TCTACTGGCG AAGGATTAAC AGAGAAGCCT CCTTGACGGC CTAG
|
Protein sequence | MCDTLVALRG ATRDGVTIFA KNSDREPNEA QVLEFVPRMR HTEERVRVTY VEVEQVDETY AVLISRPFWM WGAEMGVNEF GVAVGNEAVF TRGGYSKTGL TGMDLVRLAL ERSRTAREAL KWITSLLEEY GQGGNCSYFR KMFYNNSFLV ADPREAWVLE TVGREWVAER VESVRSISNA LTIGERWDSS SPGLEEAARR LGCRSPVNFR ECFSDFLYTR VSKGRERHRY TQGELEKAVG KIDFFFVASL LRRHSREPYE PSRGSNADIC MHAGGLTRPS QTAASMIALL YEEAPVAFVT GTSTPCISAY KPVFLSAGLP DLGPKPSHVF DGGASYWWKH ELLSRKLLCG YSRYAGVVAR EMERVERKYF EKAMEARTGY LKGLVGAEEL RRITAEAFRE AAEVEERLAA EVKASRCLNP LYALYWRRIN REASLTA
|
| |