Gene Tpen_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0796 
Symbol 
ID4601257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp750934 
End bp752115 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content60% 
IMG OID639773573 
Producthypothetical protein 
Protein accessionYP_920201 
Protein GI119719706 
COG category[S] Function unknown 
COG ID[COG1679] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCTCG ATAAGTTCGA GGAGAGGATG CTCGAAGGTG AGCTCGGAGA GGCTGTAGCC 
CTCGCTATGA GGATAGTTGT GAAGATTGCG GAGATCTTCT CCGCCGAAAG GCTCGTGAAG
ATTAAGCATG CACACGTCTC CGGGGTTTCC TACGAGAACA TAGGAGACGA GGGGTTAGAG
TTCTTGGAGG GGCTCGCCGC TAAGGGCGGG AGATTCTCCG TCCCTACAAC CGTGAATCCC
GGCGCTGTAG ACCTGGAACT GTGGAGGAAG ATGGGCGTAG ACGAGTCGTA TGTGGAGAAA
CAGCTCAGGA TAGTGGGCGC GTTTAGAAGG ATGGGGGCGA AGGTTACCTT GACGTGCACC
CCCTACCTCT ACGAGGACAT TTCCCCGGGT GACCACCTCG CGTGGTCTGA AAGCAACGCG
GTGCTCTTCG CGAACAGCGT TATCGGCGCC AGGACGAACA GGGATGGGGG ACCACTAGCA
CTGATGGAGG CTATAGCCGG GCGGGCACCC CTCTCGGGGT TGCACCTCGA CGAGAACAGG
AGACCGTCCC TCGTGGTGGA CTTCTCGGAG AGCTCGAGGT ACATCGCGGA AAACGGACTT
TTCTCCGTCG CCGGGCTCAT CGTGGGCAGG CTCGCCGGGA ACCGTGTCCC CCTGGTCCGC
GGGCTCGGCC TACAGAGAAA AGACGTAGAG GAATTAAAGC TCTTTCTGGC GGCAGTCGGA
GCCACCGGGG GGACCGGGAT GGTTCTCATA GATGGAGTTT CGCCCGAAGC CCCCGGGGAC
ATGCCAGGAG AGGTTGAGAA AATCGGCGTG GACGACGTCA AGGCGGAGTT AGAGAAGTAC
GGGGGCTCCG GGTGGGATGC AGTCGTGCTC GGGTGCCCAC ATCTGAGCTA CGAGGAGGTT
GCGTCTATCA TTGAATGGTT CGAGAGAAAA GGTAGGCCCA GGTCCCGGGT GTACCTCTAC
ACGAGCAGGG AGGTTGCCTC GAGGCTTCGA AGCGACCGCC TAGAAAAGCT GAATATACAC
TTGTTCGCCG ATACGTGCAT GGTGGTTTCC AACCTAGGGG CGTACGCCTC GCGGAGCGTC
GCGACGGATT CCGGGAAGGC TGCCTTCTAC CTAGCGTCGA AGGGCTACAG TGTCGCGCTC
CTGCCCAGGA GGAAGCTACT GGAGATGCTC GTCCAGGGGT GA
 
Protein sequence
MYLDKFEERM LEGELGEAVA LAMRIVVKIA EIFSAERLVK IKHAHVSGVS YENIGDEGLE 
FLEGLAAKGG RFSVPTTVNP GAVDLELWRK MGVDESYVEK QLRIVGAFRR MGAKVTLTCT
PYLYEDISPG DHLAWSESNA VLFANSVIGA RTNRDGGPLA LMEAIAGRAP LSGLHLDENR
RPSLVVDFSE SSRYIAENGL FSVAGLIVGR LAGNRVPLVR GLGLQRKDVE ELKLFLAAVG
ATGGTGMVLI DGVSPEAPGD MPGEVEKIGV DDVKAELEKY GGSGWDAVVL GCPHLSYEEV
ASIIEWFERK GRPRSRVYLY TSREVASRLR SDRLEKLNIH LFADTCMVVS NLGAYASRSV
ATDSGKAAFY LASKGYSVAL LPRRKLLEML VQG