Gene Tpen_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1729 
Symbol 
ID4601754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1672406 
End bp1674325 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content61% 
IMG OID639774502 
Producthypothetical protein 
Protein accessionYP_921127 
Protein GI119720632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAG CGGCTCTCGC GGTCCTCGTA CTGCTCGTAG CGGCTACGCT CGTACCCCCG 
CAGCGCCAGG CGAGCGTCGA GGCGAAGCCG CCCGTAGACT GGCTGGAGCT AGCGAGGGCG
GCGTGGGGCT ACTTCTCCCC CGGCTTCGGG CTGAGCCAGA GGGGGATCAA CTACGCGACG
CCCTCCTGGC ACTACGTGAC GGACTGGGAC GTCGGCAGCT ACCTCTCGGC GATAGTCGAC
GCGGCGTGGC TCGGGCTGAT ATCCAGGGAT GAGGCTATCA GCAGGGCCGA AAAAGTGCTC
GCCTTCCTCT CCACGAGGCC GCTACACCCC TCCGGCGTGC CGTACTCGGC GTACAGCTCG
GACACGGGCA TGCCGGCGGA GAATGCGGGG CCCTCGAACC CCAGCGACGC CGGGAGGTTG
CTGATAGCCC TCTACAGGCT GAAGAAGAGC TTCCCGGAGC TCGCGCCGAC CGTGGACTAC
GTCGTCGAGA GGAACGGCTT CTCGGCCTTC GCGGGCTCGG TGCCGGACAG CGGCTTCTAC
TCGTACTACT ACGCCTACGG CTTCCACCTC TGGGGCTTCA ACACCCCCCA GGTCATGAAG
GCCCTCTCGA TGCTCGGGAG GCTACCCTAC ACGAGGACTG TCGACGCCTA CGGGGTCCGG
CTACCCTACG TGGAGGTAAC GATGGAGCCG ATACTCCTCA CGATCTTCGA GCTAGACCCC
CCGCCCGAGT TCTACGAGTG GGCGTACAAG GTCTACAAGG CGCAGGAAAA CCGCTACCTC
GCCACGGGTA AGCCCACCGC GTTCACGGAG GGTCAGGTCA ACGCGCCCCC GTACTACATC
TACGAGTGGA TAGTCGACAT ATACACGGGC GAGACGTGGA CCGTGTGGAG CGGCTCCCTC
GGAAAGCTCA GCATGACGCC CGTAGTATAC GCCAAGGCGG CTCTAGGCAT GCACGCCATC
TGGAACACCA ACTACACAGC CTTCCTAGCG GAGTACGTGA TGAAGGCTAA AACGCCCAAC
TGCTTCTACG AGGGAGTCGA CGAAAACGGC AACGTCGTCT ACGCGATAAC CGACAAGACG
AACGCCATGA TAGTGAGCGC CGCGAGGTAC GCACTGCAGA GAGCAAGCAA GCCCTCGGTA
ACGGCGGGAG CGGTCCCAGC CCTCTACCCA GGCGAAAACG CGACGATAAC GCTCAACGTA
ACCCACCAGC TACCCCTACC CATCACCCTA AGCGCGGAAG CCCCACCCGG GATAACAGCT
GAGGTCGAGC CCAGCACCGG GAAAGCCAAC CTAACGGCGA GGCTAAAGGT ATCGGCGCGG
CAAGGGCTAG CCCCCGGCAA CTACACCGTC ACAGTAAAAG TCTCGACGAT AGCGCACAAC
GAGACGCTAA CCCTCACCGT GACAGTCAAG CCCCCCGGCT ACACCCTCAG AGTAAGAGTA
GTAGACGCCT GCGGAGACCC AGTACCCGGC GCAACGCTAC TACTAAACGG GCTCAAAGCA
GGAGAAACCG ACGCCAAGGG AGAAGCCGAG GTAAAACACG TGGAAGGAGA AGCAACGCTC
ACCGCGATCT ACGCAGGGCT AGAAGTAGCC GGGCCACTAA AAATCAGCGT AAACTCCGAC
ACCAACGCGA CGCTCAAGGC AAACCTCCGA AAAATAGCAG TCGCCTTCAC AACCCCCGAC
GGAAAACCCG CAACCGGGAT ACTCGTAGTA GCACTGCTAG GGCGAACAAC CCTCTCAACC
GCAAAGACGA ACTCCACGGG GCACGCACTA CTACCGAGAA TACCCCCCGC AAACATAACC
CTACAAGCCT ACACACCCGA CGGAAAGCTA CTACTAGGAG AATGGACCGT GAACGCCGCG
CAAGGCGAAG GAGTAGTAGA CCCAGAAATC CCCCCAACCA CCAGACACCT AGAGGCATAG
 
Protein sequence
MKKAALAVLV LLVAATLVPP QRQASVEAKP PVDWLELARA AWGYFSPGFG LSQRGINYAT 
PSWHYVTDWD VGSYLSAIVD AAWLGLISRD EAISRAEKVL AFLSTRPLHP SGVPYSAYSS
DTGMPAENAG PSNPSDAGRL LIALYRLKKS FPELAPTVDY VVERNGFSAF AGSVPDSGFY
SYYYAYGFHL WGFNTPQVMK ALSMLGRLPY TRTVDAYGVR LPYVEVTMEP ILLTIFELDP
PPEFYEWAYK VYKAQENRYL ATGKPTAFTE GQVNAPPYYI YEWIVDIYTG ETWTVWSGSL
GKLSMTPVVY AKAALGMHAI WNTNYTAFLA EYVMKAKTPN CFYEGVDENG NVVYAITDKT
NAMIVSAARY ALQRASKPSV TAGAVPALYP GENATITLNV THQLPLPITL SAEAPPGITA
EVEPSTGKAN LTARLKVSAR QGLAPGNYTV TVKVSTIAHN ETLTLTVTVK PPGYTLRVRV
VDACGDPVPG ATLLLNGLKA GETDAKGEAE VKHVEGEATL TAIYAGLEVA GPLKISVNSD
TNATLKANLR KIAVAFTTPD GKPATGILVV ALLGRTTLST AKTNSTGHAL LPRIPPANIT
LQAYTPDGKL LLGEWTVNAA QGEGVVDPEI PPTTRHLEA