Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1350 |
Symbol | |
ID | 4600874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1301268 |
End bp | 1302689 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774125 |
Product | amidohydrolase |
Protein accession | YP_920750 |
Protein GI | 119720255 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0120056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTAAGG AGTACGCGCT GGAGTGGATA GACGGCTACA GGGAGAGGCT GGTCAAGGTC TCCGACGCGA TCTGGGAGTA CGCGGAGCTC GGCCTCAGGG AGTTCAAGTC CTCCAGGCTC CTCGCGGGCG AGCTCGAGAG GCACGGCTTC AGGGTCGAGA TGGGGGTCGC CGGGATGCCC ACCGCCTTCG TCGCGACGTG GGGTAGCGGG AGGCCCGTCA TCGGGATCCT CGGGGAGTAC GACGCGCTGC CGGGGCTCTC CCAGAAGGTT GTCCCGTGGA GGGAGCCCCT CGTCCCCGGG GCGCCGGGGC ACGGGTGCGG GCACAACATC CACGGGGCTT CCGGGATGGC GGCGGCGCTC GCCGTCAAGG CGGCGATGGA GAGGGAGGGG CTGGGGGGCA CTGTGAAGTT CTTCGGTTGC CCGGCGGAGG AGAACTTCAG CGGCAAGGTG TTCATGGTGC GCGACGGAGT GTTCGAGGGC GTTGACGCTG TTCTCAGCCA CCACCCCGGC GACATGAACG CGGCTACGCT TAAGAGTAGC CTTGCGGTGA ACTCCGCGAG GTTCCACTTC TACGGGAGGG CTTCCCACGC CGGCGCGTCG CCGGAGGAGG GGAGGAGCGC GCTCGACGCC GTCCAGCTGA TGAATATAGG CGTGGAGTTC ATGAGGGAGC ACTTGCCGCA GGACGCGAGG GTCCACTACG TCGTGGAGAG GGGTGGGGGC CAGCCGAACG TTGTCCCAGA GTACGCGAGG GCCTGGTACT ACGTCAGGGC GCCGGAGAGG GAGGAGGTCG AGAGGATATA CAGCTGGGTC GTGGACATAG CGAGGGGAGC CGCGCTGATG ACGCAGACGA GGGTCGAGGT GGAGTTCCTC GAGGGTGTCT ACAACCTCCT GCCGAACAGG GTTCTCGCGG AGCTCGTCGT GGGGAACATG CGCGAGGTTG GGCTACCGGA GTACAGCGAG GAGGACTTGA GGTTCGCCGA GGAGATAGCC AAGACGATAC CGAGGGAGGT GAAGGTGGGC CAGCTGAGGA AGTCCGGGAG GCCCGGCTGG GAGCGGCTCG TGGACAAGCT CATCGACGAC GAGGTCCCGG ACCCGTGGGG CGAGGGGACG GTGATGCACG GCTCGACGGA CGTAGCCGAC GTCAGCTGGC AGGCGCCAAC ACTGGAGTTC AGCACCGCCG CCTGGGTCCT CGGAACCCCC GCGCACTCCT GGCAGGCAGT CGCCCAGTCA GCCGCCGGGA TCGGGCACAA GGCGCTGATC TTCGCGTCGA AGGTGCTGGC CGCCTCGGCC CTCGACCTGC TCACGAAGCC CGAGATCCTG GAGAAGGCGA AGGAGGAGCA CAAGAGGAGG CTCGCCGGGC GCGTCTACAG ATCCCCGCTA CCACCGGGCC ACAAGCCCCC GCTCGACGCG TGGGAGAAGT AG
|
Protein sequence | MVKEYALEWI DGYRERLVKV SDAIWEYAEL GLREFKSSRL LAGELERHGF RVEMGVAGMP TAFVATWGSG RPVIGILGEY DALPGLSQKV VPWREPLVPG APGHGCGHNI HGASGMAAAL AVKAAMEREG LGGTVKFFGC PAEENFSGKV FMVRDGVFEG VDAVLSHHPG DMNAATLKSS LAVNSARFHF YGRASHAGAS PEEGRSALDA VQLMNIGVEF MREHLPQDAR VHYVVERGGG QPNVVPEYAR AWYYVRAPER EEVERIYSWV VDIARGAALM TQTRVEVEFL EGVYNLLPNR VLAELVVGNM REVGLPEYSE EDLRFAEEIA KTIPREVKVG QLRKSGRPGW ERLVDKLIDD EVPDPWGEGT VMHGSTDVAD VSWQAPTLEF STAAWVLGTP AHSWQAVAQS AAGIGHKALI FASKVLAASA LDLLTKPEIL EKAKEEHKRR LAGRVYRSPL PPGHKPPLDA WEK
|
| |