Gene Tpen_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0791 
Symbol 
ID4601140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp743342 
End bp745138 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content60% 
IMG OID639773568 
Productglycoside hydrolase family protein 
Protein accessionYP_920196 
Protein GI119719701 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTCTCC ACTTTAGTGG TCCTCTCTTG ATGTACTGGC GTGAGCTTTA CCCCGACTTC 
CTCGCTAGGC TTAGAGAGAC TGTCTCGAGG AGCGAGTTCG AGGTTCTCGG CGGTACGTAC
TCTGAAAGCG TGCTGTCGCT GCTTCCGTGG GAGGACAGGG TCTTGCAGCT CAAGAAGGGG
CGCGAGCTGG TAGAGGAGAC CTTGGGGGTC TCTCCCCGGG GTCTCTGGAT CCCCGAGAGG
GTCTGGGACC CCACGCTTCC CCCCGCGATA AGCGAGGCTG GGTACAGCTA TGTGATAGTA
GACGACGAGG TCGGGTATAG GTCTGGTCTT TGGAAAGACG ACGTCCACAG GGCCGTCTTG
ACGGAGTACA GCGGGAGGAG GGTGGGAGTG CTCTTTATCG ACGGGCCCGT GAGGTACATA
CTGCCGTGGA AAGCACCCGG AGAGGTTCTA GGCTACATAA GATCCTTCGC CACGGAGGAC
GGGCGACTCT ACGTGCTGTG GGGCTCCGAC GCCGAGAAGT TCGGAGAGTG GTGGGACGCG
AGGGCCGCCG AGCAGTGGCT ACGCACGTTC TTCGGCATGC TTAAGGGGGA CTCGTCCGTG
GCGCTCCTAA CGCCGTCCGA GTACATCCTG AGGCACGGGT ACCAGGGGCT CGCGTACCTC
GCCCCGGGGA GCTACGACAA GATGATGGAG TGGAGTGGAG GCTACTTCCC CAACTTCCTG
AGGAAGTACA GGGAGACTAA CAACATGCAC AAGAAGATGC TGTACGTGAG GGGAAAGCTA
TCCCTTCTCA AGGCGTCGAG GGAGGCGTGG GAGGAGTACC TCAAGGCGCA GTGCAACGAC
GCTTACTGGC ACGGGCTGTT CGGAGGAGTC TACATACCGT TCCTCAGGCA GGCGGTGTTC
GAACACTTGG TAAGGGCGGA GCGCTTAGCC GAGGAGGAGT CCGGCTACTA CCTGGGGAAC
TCCTCCGTCG TTAGAAGCCT GGACTTCGAC TTCGATGGCG TGGACGAGGT GCTCATCGAG
GAGAAAGAGG TCAACGCCTA CGTAAAGCCG AGCGACGGAG GCTCTCTCTT CGAGCTAGAC
GTAAAGATGC CGGGGAAGGA GCACAACCTT CTCGCCACGA TGTCGAGGTA CAGGGAGCCG
TACCTAGAGG ATCAGAAATC GGTTGTCCCC GACTGGTACA GGCGCGTCGC CTTCAGGGAG
CACATCTGGC GCAAGGACGC CTCGAGCGCC GACTGGATTA ACAACACGCC GTTCGTAGAC
GTGAGCGACT TCGCCCTGGG CAACTACATC GTAGAGGCTG TAGAGGGCAA CAAGCTAGTG
CTCTCCTTTA CGGGTAGGGA CTGGAGCGAC AGGAGGAGGC CGGCGCGGAT ACACCTCGTG
AAGACCTACG AGGTGCTGGG CTCCCAGAGG ACTGTCAGGG TGCGCTACAG GTGGCGCAAC
ATGGAGAGAA GGTTCATAGA CCCCAAGCTC TCGGTGGAGG TCAGCCTGTT CCCCAGGCTG
AGCTACGAGG AGGACTCCGA CCCTACGTAC ACCGTGGACG GCTCGCAGAG GTTGAGCGTG
AGGGAGGGCT TCTCCTCTCC CTGGGCCAGG ACGGTGAGGG TAGAGTCGCC GGCCTTCCCG
ACGGTCACCG TTGAGAGCTC GAGGCACGCA GAGGTATGGG TATCCCCCAT CCTCAGCTGG
TACAGGACGG AGAAGGGGTT GCGGAGCGAG TACCAGGGGC TAGCCGTTTC CTTTAACTAC
GCGGTAGCGC TAAACCCAGG GGAAACCTTC GAAACGGAGG TGTCCCTGTC TTGGTGA
 
Protein sequence
MTLHFSGPLL MYWRELYPDF LARLRETVSR SEFEVLGGTY SESVLSLLPW EDRVLQLKKG 
RELVEETLGV SPRGLWIPER VWDPTLPPAI SEAGYSYVIV DDEVGYRSGL WKDDVHRAVL
TEYSGRRVGV LFIDGPVRYI LPWKAPGEVL GYIRSFATED GRLYVLWGSD AEKFGEWWDA
RAAEQWLRTF FGMLKGDSSV ALLTPSEYIL RHGYQGLAYL APGSYDKMME WSGGYFPNFL
RKYRETNNMH KKMLYVRGKL SLLKASREAW EEYLKAQCND AYWHGLFGGV YIPFLRQAVF
EHLVRAERLA EEESGYYLGN SSVVRSLDFD FDGVDEVLIE EKEVNAYVKP SDGGSLFELD
VKMPGKEHNL LATMSRYREP YLEDQKSVVP DWYRRVAFRE HIWRKDASSA DWINNTPFVD
VSDFALGNYI VEAVEGNKLV LSFTGRDWSD RRRPARIHLV KTYEVLGSQR TVRVRYRWRN
MERRFIDPKL SVEVSLFPRL SYEEDSDPTY TVDGSQRLSV REGFSSPWAR TVRVESPAFP
TVTVESSRHA EVWVSPILSW YRTEKGLRSE YQGLAVSFNY AVALNPGETF ETEVSLSW