Gene Tpen_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1350 
Symbol 
ID4600874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1301268 
End bp1302689 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID639774125 
Productamidohydrolase 
Protein accessionYP_920750 
Protein GI119720255 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTAAGG AGTACGCGCT GGAGTGGATA GACGGCTACA GGGAGAGGCT GGTCAAGGTC 
TCCGACGCGA TCTGGGAGTA CGCGGAGCTC GGCCTCAGGG AGTTCAAGTC CTCCAGGCTC
CTCGCGGGCG AGCTCGAGAG GCACGGCTTC AGGGTCGAGA TGGGGGTCGC CGGGATGCCC
ACCGCCTTCG TCGCGACGTG GGGTAGCGGG AGGCCCGTCA TCGGGATCCT CGGGGAGTAC
GACGCGCTGC CGGGGCTCTC CCAGAAGGTT GTCCCGTGGA GGGAGCCCCT CGTCCCCGGG
GCGCCGGGGC ACGGGTGCGG GCACAACATC CACGGGGCTT CCGGGATGGC GGCGGCGCTC
GCCGTCAAGG CGGCGATGGA GAGGGAGGGG CTGGGGGGCA CTGTGAAGTT CTTCGGTTGC
CCGGCGGAGG AGAACTTCAG CGGCAAGGTG TTCATGGTGC GCGACGGAGT GTTCGAGGGC
GTTGACGCTG TTCTCAGCCA CCACCCCGGC GACATGAACG CGGCTACGCT TAAGAGTAGC
CTTGCGGTGA ACTCCGCGAG GTTCCACTTC TACGGGAGGG CTTCCCACGC CGGCGCGTCG
CCGGAGGAGG GGAGGAGCGC GCTCGACGCC GTCCAGCTGA TGAATATAGG CGTGGAGTTC
ATGAGGGAGC ACTTGCCGCA GGACGCGAGG GTCCACTACG TCGTGGAGAG GGGTGGGGGC
CAGCCGAACG TTGTCCCAGA GTACGCGAGG GCCTGGTACT ACGTCAGGGC GCCGGAGAGG
GAGGAGGTCG AGAGGATATA CAGCTGGGTC GTGGACATAG CGAGGGGAGC CGCGCTGATG
ACGCAGACGA GGGTCGAGGT GGAGTTCCTC GAGGGTGTCT ACAACCTCCT GCCGAACAGG
GTTCTCGCGG AGCTCGTCGT GGGGAACATG CGCGAGGTTG GGCTACCGGA GTACAGCGAG
GAGGACTTGA GGTTCGCCGA GGAGATAGCC AAGACGATAC CGAGGGAGGT GAAGGTGGGC
CAGCTGAGGA AGTCCGGGAG GCCCGGCTGG GAGCGGCTCG TGGACAAGCT CATCGACGAC
GAGGTCCCGG ACCCGTGGGG CGAGGGGACG GTGATGCACG GCTCGACGGA CGTAGCCGAC
GTCAGCTGGC AGGCGCCAAC ACTGGAGTTC AGCACCGCCG CCTGGGTCCT CGGAACCCCC
GCGCACTCCT GGCAGGCAGT CGCCCAGTCA GCCGCCGGGA TCGGGCACAA GGCGCTGATC
TTCGCGTCGA AGGTGCTGGC CGCCTCGGCC CTCGACCTGC TCACGAAGCC CGAGATCCTG
GAGAAGGCGA AGGAGGAGCA CAAGAGGAGG CTCGCCGGGC GCGTCTACAG ATCCCCGCTA
CCACCGGGCC ACAAGCCCCC GCTCGACGCG TGGGAGAAGT AG
 
Protein sequence
MVKEYALEWI DGYRERLVKV SDAIWEYAEL GLREFKSSRL LAGELERHGF RVEMGVAGMP 
TAFVATWGSG RPVIGILGEY DALPGLSQKV VPWREPLVPG APGHGCGHNI HGASGMAAAL
AVKAAMEREG LGGTVKFFGC PAEENFSGKV FMVRDGVFEG VDAVLSHHPG DMNAATLKSS
LAVNSARFHF YGRASHAGAS PEEGRSALDA VQLMNIGVEF MREHLPQDAR VHYVVERGGG
QPNVVPEYAR AWYYVRAPER EEVERIYSWV VDIARGAALM TQTRVEVEFL EGVYNLLPNR
VLAELVVGNM REVGLPEYSE EDLRFAEEIA KTIPREVKVG QLRKSGRPGW ERLVDKLIDD
EVPDPWGEGT VMHGSTDVAD VSWQAPTLEF STAAWVLGTP AHSWQAVAQS AAGIGHKALI
FASKVLAASA LDLLTKPEIL EKAKEEHKRR LAGRVYRSPL PPGHKPPLDA WEK