Gene Tpen_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1661 
Symbol 
ID4601243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1608039 
End bp1609646 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content64% 
IMG OID639774434 
Producthypothetical protein 
Protein accessionYP_921059 
Protein GI119720564 
COG category[S] Function unknown 
COG ID[COG4879] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGGAGG CCGAGGTTCG CCCGACAAAT AGTTTAACTA AGCCTTTAGG CTTTGCATCG 
GCAGGAGTCA CGGAGCACTG GGGGAGGTGC TTGGCGTACC TCCAGAAGAG AAGGCTCCAG
GTAGTCACCG ACGTCCTGAG CAGGTTGCTG GCGGGCGAGG CCGCTACGAG GGACGAAGCC
ATCGGCTTGC TGCGCAGAGC ATACCTAGAG GCGGGCCTGG AGCCGCTGAG GGGGCTTTCC
ACTCCTCGGA GCTTCTACAG GGAGATGTCG CTGGTGTACG CCGTGGGCAA GTACGGGCTT
GGGCTCGACG AGGAGCTCGA AGGATTGCGG GAAATATTCG AGGTGGAGGC TCTCTGCGAC
GCGGCGCTGG GGCTCGTGAG GAGCGGCGCC GGCGTCGAGG AGTCGGTTTT CAGGGTGTTC
GGGCGCCTAA AGGAGTCCGA CGTTAGGAGT TTCCTCAACT ACGTCTTGGC GGCGAGCCTG
CTCGGCTACG TGGGCTTCGA GGAGCTCTTG AAGGTGCTGG GAGGTCTAGA GGCTTACGGC
AACGTAGCGC GGGACTACAG GATCCTGGTG GTAGCGTACA GGCTCGCGGA CTTCGTGGGC
TCCAACAAGT TTTCCAGGAG GTCCGAGAAG GAGGCGGCTA AGCTGGACCT AGCGAGGAGC
CTCGGCGACG AGAAGGCTTT GCCGTCGGAC GGGCTCGTCT GGAGGATAGC TGTAAACGTC
TACGGCGTAG ACGAGGGTGT GGCTAACCGC GTCCTCAGGA TGAGCCCCAG CGACCTCGCA
AGGGTAGTTC TCGAGTCTAG CACGTGGTGG TACGGGTACG TGGTGTCTCC CGGAGAGCTG
GAGGAATACC TCGCCGGCCT CGAGCCGGAG TGGCTCCAGG CGTACCTCTA CCTCGAAAAG
AAGCTCAGGA AGGCTTTCCC GGCTTCCTCG AGGCTAGTCG CGGCGGCAGC CGTAGACCAG
GCGAAGGCGG AGGGGGCGAG CCCCGAGTCC CTGACGGCTA GGAGCGTAAA CATCGAGAAC
CCCGTCTCCA GCCTGCTGGA GTGGGGGTGC TCCGGCTGGA GGTTTACCTA CATGAACCTG
GCTCCCAAGG GGGAGTTCGA GCTGAGGCTT GAGGACAAGC ACGAGGTAAT AGTCTTCGAC
CGCGTCCGGG CGTACGAGGC CCTCGCCTTG GGGGTCAGGA GGCTCAGGGA GAGGATAGCG
GAGAAAGCTA GCGGGGACCT CGACGTGAAG GTGAGGCTCG GCGGGAGGCT CAGCGGTATG
TGGCTCAGGG TCCAGGCCAT GCTCCTCGCG GTTAAAGTCG TCGGGGAGGC CTACGTGTTG
ACCGCGCCGC CCGCAGCCCA GGCGAGGCTC CGGGAGGGCC TCCTGGAGGA GAGGAGGATC
CCGGTGGACG GCGGGGAGCT CGTCGCGAGG CTTGTTGAGC GAGGGTTTAA CCGGTACGTC
ACGCTGGAGC TAGGCGGGAG GAGGATCGCG ACGCTGAAGC TGGGCTCCGA CCCCGGGAAG
CTGGAGGAAA AGGTCGAGAA GATACTCTCG CATAACCTCC CGAAGGGCGT GAGCGAGGAG
AAAAGGCGGC TACTGGGAGA AGAGGTGAGG AGCCTGCTCA AGCGCTGA
 
Protein sequence
MVEAEVRPTN SLTKPLGFAS AGVTEHWGRC LAYLQKRRLQ VVTDVLSRLL AGEAATRDEA 
IGLLRRAYLE AGLEPLRGLS TPRSFYREMS LVYAVGKYGL GLDEELEGLR EIFEVEALCD
AALGLVRSGA GVEESVFRVF GRLKESDVRS FLNYVLAASL LGYVGFEELL KVLGGLEAYG
NVARDYRILV VAYRLADFVG SNKFSRRSEK EAAKLDLARS LGDEKALPSD GLVWRIAVNV
YGVDEGVANR VLRMSPSDLA RVVLESSTWW YGYVVSPGEL EEYLAGLEPE WLQAYLYLEK
KLRKAFPASS RLVAAAAVDQ AKAEGASPES LTARSVNIEN PVSSLLEWGC SGWRFTYMNL
APKGEFELRL EDKHEVIVFD RVRAYEALAL GVRRLRERIA EKASGDLDVK VRLGGRLSGM
WLRVQAMLLA VKVVGEAYVL TAPPAAQARL REGLLEERRI PVDGGELVAR LVERGFNRYV
TLELGGRRIA TLKLGSDPGK LEEKVEKILS HNLPKGVSEE KRRLLGEEVR SLLKR