Gene Tpen_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1497 
Symbol 
ID4601406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1447203 
End bp1448516 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content63% 
IMG OID639774272 
Productpeptidase U34, dipeptidase 
Protein accessionYP_920897 
Protein GI119720402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGCGATA CTCTGGTTGC CCTGAGGGGG GCGACGAGGG ACGGAGTAAC GATCTTCGCG 
AAGAACAGTG ACCGCGAGCC GAACGAGGCC CAGGTGCTCG AGTTCGTCCC GAGGATGCGG
CACACGGAGG AAAGGGTGCG CGTTACGTAC GTGGAGGTTG AACAGGTAGA CGAGACCTAC
GCCGTCCTCA TATCGAGGCC TTTCTGGATG TGGGGAGCGG AGATGGGCGT CAACGAGTTC
GGGGTGGCGG TTGGCAACGA GGCTGTGTTT ACCCGCGGGG GCTACTCGAA GACGGGGCTG
ACGGGCATGG ATCTCGTGAG GCTCGCGCTC GAGAGGTCTA GGACTGCGCG GGAAGCCTTG
AAGTGGATAA CCTCCCTGCT CGAGGAGTAC GGGCAGGGCG GTAACTGTAG CTACTTCAGG
AAGATGTTCT ACAACAACTC CTTCCTGGTG GCGGATCCCC GGGAGGCGTG GGTGCTGGAG
ACTGTGGGAC GCGAGTGGGT CGCTGAGCGC GTAGAGAGCG TTAGGTCGAT TTCGAACGCG
CTCACCATAG GCGAGAGGTG GGACTCCTCC TCGCCGGGGC TCGAGGAAGC CGCCAGGAGG
CTGGGTTGCC GCTCCCCGGT GAACTTCAGG GAGTGCTTCT CCGACTTCCT CTACACGAGG
GTCTCGAAGG GCAGGGAGAG GCACAGGTAC ACGCAGGGAG AGCTGGAGAA GGCTGTCGGG
AAGATAGACT TTTTCTTCGT GGCGAGCCTC CTCAGGCGCC ACTCCAGGGA GCCGTACGAG
CCGTCGAGGG GGTCCAACGC GGATATATGC ATGCATGCCG GGGGCCTCAC GAGGCCTTCC
CAGACGGCGG CTTCCATGAT AGCGCTGCTA TACGAGGAGG CGCCGGTAGC GTTCGTCACG
GGGACCTCGA CGCCGTGTAT AAGCGCGTAC AAGCCGGTCT TCCTCTCGGC GGGGCTCCCG
GACCTCGGGC CTAAGCCTTC CCACGTCTTC GACGGGGGCG CGAGCTACTG GTGGAAGCAT
GAGCTTCTCA GCAGGAAGCT CCTCTGCGGG TACTCCAGGT ACGCCGGCGT CGTGGCGCGG
GAGATGGAGA GGGTGGAGAG GAAGTACTTC GAGAAAGCCA TGGAGGCGAG GACGGGCTAC
CTCAAGGGGC TCGTAGGCGC CGAGGAGCTC AGGAGGATAA CCGCGGAGGC TTTCCGCGAA
GCGGCGGAAG TCGAGGAGAG GCTAGCCGCG GAGGTGAAGG CTTCCAGGTG CCTTAACCCG
CTCTACGCGC TCTACTGGCG AAGGATTAAC AGAGAAGCCT CCTTGACGGC CTAG
 
Protein sequence
MCDTLVALRG ATRDGVTIFA KNSDREPNEA QVLEFVPRMR HTEERVRVTY VEVEQVDETY 
AVLISRPFWM WGAEMGVNEF GVAVGNEAVF TRGGYSKTGL TGMDLVRLAL ERSRTAREAL
KWITSLLEEY GQGGNCSYFR KMFYNNSFLV ADPREAWVLE TVGREWVAER VESVRSISNA
LTIGERWDSS SPGLEEAARR LGCRSPVNFR ECFSDFLYTR VSKGRERHRY TQGELEKAVG
KIDFFFVASL LRRHSREPYE PSRGSNADIC MHAGGLTRPS QTAASMIALL YEEAPVAFVT
GTSTPCISAY KPVFLSAGLP DLGPKPSHVF DGGASYWWKH ELLSRKLLCG YSRYAGVVAR
EMERVERKYF EKAMEARTGY LKGLVGAEEL RRITAEAFRE AAEVEERLAA EVKASRCLNP
LYALYWRRIN REASLTA