Gene Tpen_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0738 
Symbol 
ID4601145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp685436 
End bp687424 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content68% 
IMG OID639773514 
Producthypothetical protein 
Protein accessionYP_920143 
Protein GI119719648 
COG category[S] Function unknown 
COG ID[COG2433] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000149054 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGTTCA GGCGGGTGCT AGGGCTGGAT ATACTGCCGG GTAGCTCTCC CCTCGGGAGG 
CAACACCTCT TCGCCGCCGT GCTCCTGGTG GACGGCAGGG TGGAGCAGAG GGTTCGGGAA
GCCTCCCTAG AGGACGTCGT GAGGCTCGCC ACCTCGGGCG GGGTCGAGGC GCTGGCGCTG
GACAACGTGT TCGAGCTGGC GCCCACCGTG GAGGGGCTCG CGGAGTTCTT GAGGCTGTTC
CCGGGGAGGC CGCCGCGCCT GATCCAGGTG ACAGTCGTCA ACGGCGAGGA GGTTAGCGTC
GAGACCCTCT GCGCCGTTAC GGGGCTGTGT AGGGGTAGGC TCGACCCCCT CGGGACGGCG
GAGGCGTGCG CCCTACTCGC GTACGCCGGC GTCGGTAGCG AGGTCCTCGT CTTTCACGAC
GAGACGGTGG TGCACGTTGG CCGCGGCAGG GTGCCCGGGC AGGGAGGGAT GAGCAGGGAG
AGGTTCAAGA GGGGCATCGA GGTCCTCGTG AAGCGGAAGG TTAGGGAGAT AGCGGAGGCG
CTACAGAGAA AGGGGCTCGA CTTCGACGTT TTCCCCAGGA AGAGCGGGGA GGGGCTCGTC
GGCGCGACGT TCATCGTGTA CGCGCCGCGC GAGGAACTCA ACGGCGTCGT TAAGAGCGAG
GAGGGGCACG ACCTCTTCGT CAGGGTGGAG CCTGCGAGGA GGGATAGGGT GGAGTTCAGG
CCCCTGGGCT CCAGGCTCCA CAGGGCGCTC TCCCAGGAGC GCCTCCTCAT AGTGGGCGTC
GACCCGGGGA TGGCCACGGG CTTCGCCGTG CTGGACTTCT CCGGGAGGGT GCTCGCGGTC
GACAGCAGGA GGCTCCTCGG CAGGGGGCAG CTCGTCAGGG AGCTCTACGG CTTCGGGAGG
CCGGCTATAG TCGCGACGGA CGTTAACCCT CCACCCGCCT ACGTGAAGAA GCTGGCGTCG
ACGCTCGGCG CCGTGCTGTA CGTCCCCAGC CGCTCGCTGA GCGTGGAGGA GAAGCGGAGG
CTCGCGCTGG AGGCCGCGGG GCAGCAGGGG GTGAGGCTTA GGACTTCCCA CGAGAGGGAC
TCCCTAGCCG CCGCCTACAA GGCTTTCCTG TCCTACAGGG AGCTCTTCGA GGAGGTGGAG
AGGGAGGCTG CTAGGTACGG GGTGCCGTTC TCCCTGGACG AGGCGAAGCT CCTCGCGGTA
AAGGGTAAGC CCGTGGCCCT CGCCGTCGAG GAGGCTCTGA GGAGGCAGGT GGGGGTGAAG
ATCCCGAGGA TCGAGCTCCA GAAGGAGGCG GAGGAGCCCC GGAGGGTCGA GGAGGAGCTG
GAGGAGGTAC GGCGCGCCCT CTCGGAGCTC CTCCGGGAGA ACGTCGAGCT TAGGCGGAGG
CTTGAGGAGG CGGAGGAGAG GGCTAGGAGG AGCGAGGAGG CGCTTAGGGG GCTGCTGAGG
GCTAGGGAGG TCGCGAGGGG CCTCGAGTCG GAGTACGCGA AGCTCCGGGC GAGGATAGAG
CTACTCCAGT CGGAGCTAGA CTCCTTCAGG AGGGAGCTAG CAGAAAAGGA GAGGGCGCTG
GAGTCCCTGG GCGACGCGTT GCTCTCCTAC CTCTCCGGGG AGGCGGTAGT CGCGGTGAGG
CTTTCCTACG TCCTCGAACG TGGCGCCGCG AGGGTTCCGG CGGTCTACGT TGACAAGCAG
TTGCCGTCCG ACTCCCTGAG GAGGGTTCTC GAGGAGCTGA AGCCGCCGGG CTCTCATCTC
ATCGCCTTCT TCGAGGGGGC GGCGCGCGGG GCCGCCGAGA GGCTCCCGCT CGGCGTGGTT
CCGGTCGCCC TCGGAGAGGT TAAGCCTTTA GCCGAGGTTG GACCCTTCGT CTTCGTGCCT
GCCGAGACAG CGCTCGGCGC CGCCACTGCT TCGCGCGACG CCGATAAGGA GAGGCTTAGG
CGCCTCCTGG AGGACTACAG GTTGCAGAGA AAGAGGGAGT TGGAGGGGCT TTCCGCCGGG
CGCCTCTAG
 
Protein sequence
MAFRRVLGLD ILPGSSPLGR QHLFAAVLLV DGRVEQRVRE ASLEDVVRLA TSGGVEALAL 
DNVFELAPTV EGLAEFLRLF PGRPPRLIQV TVVNGEEVSV ETLCAVTGLC RGRLDPLGTA
EACALLAYAG VGSEVLVFHD ETVVHVGRGR VPGQGGMSRE RFKRGIEVLV KRKVREIAEA
LQRKGLDFDV FPRKSGEGLV GATFIVYAPR EELNGVVKSE EGHDLFVRVE PARRDRVEFR
PLGSRLHRAL SQERLLIVGV DPGMATGFAV LDFSGRVLAV DSRRLLGRGQ LVRELYGFGR
PAIVATDVNP PPAYVKKLAS TLGAVLYVPS RSLSVEEKRR LALEAAGQQG VRLRTSHERD
SLAAAYKAFL SYRELFEEVE REAARYGVPF SLDEAKLLAV KGKPVALAVE EALRRQVGVK
IPRIELQKEA EEPRRVEEEL EEVRRALSEL LRENVELRRR LEEAEERARR SEEALRGLLR
AREVARGLES EYAKLRARIE LLQSELDSFR RELAEKERAL ESLGDALLSY LSGEAVVAVR
LSYVLERGAA RVPAVYVDKQ LPSDSLRRVL EELKPPGSHL IAFFEGAARG AAERLPLGVV
PVALGEVKPL AEVGPFVFVP AETALGAATA SRDADKERLR RLLEDYRLQR KRELEGLSAG
RL