Gene Tpen_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1014 
Symbol 
ID4600687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp958256 
End bp961096 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content63% 
IMG OID639773792 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_920417 
Protein GI119719922 
COG category[R] General function prediction only 
COG ID[COG1201] Lhr-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGGCG AAGTTGTCAA GGGGAGGGAT CGGGGAGGCT CCCGTAGGTT GGAGGAGCTC 
GGGGACCTAC TACCCGGAAG GATCCTCGAG GCTCTCTCCA GGTTTGGGTA CAGGGAGCTG
AACTACGTGC AGGTGAAGTC GCTAGAGCTC TCCAGAGTGT ACCCCAACAT GCTCATCTCC
GCCCCCACGG GTGCCGGGAA AACGGAGGCG GCGGTTCTGC CCGTTCTCCG GGATCTCGTC
GAGAAGGGTG GGAAGCCCAT ATACGCGCTC TACGTCACGC CTCTACGTGC TCTCAACAGG
GACATATACG ACAGGATGGT GGAGCTTTTT CGCGCCCTGG GCTTCGAGGC GGAAGTGAGG
CACGGGGATA CGCCGCAGAG GGTTCGCAGG AGGATAGCCG AGTCTCCTCC GCACATGCTC
ATCACCACGC CGGAGACTAC GCAGTTCCTG CTGGTGGACG GGAGGTACAG GGAGCACTTG
AAGAATACCC GGTGGGTCGT AGTGGACGAG GTGCACGAGC TACTCGACGA CAAGAGGGGG
GCGCAGCTAT CGCTGGCGCT CGAGAGGCTT AGGCTTATCT CGCCGGGGCT CCGCGTCGTC
GGGCTCTCGG CTACGCTGAG GGACCCTGAG GCGGCGCTCA GGCTACTCTC AGGCGGGAGG
GTGGGGACAG TCGTCGAGTG GTTCGAGCGT AAAGCCTACG AGCTGGTAGT AGAGGACATC
GACGAGGAGG GAGATCTCGC CGGGAGGGTT AAGCGCGTCG CCGAGCTGTG CAGGGGCGGC
GGCGTGATAG TGTTCACCAA TACGCGCGAC ACAGCGGAGT TCATCGGGAG GGTTCTCGCC
CGCGACTACG GCTTGAACGT GAGGGTGCAC CACGGGAGCC TCTCCCGCCA GGAGAGGGAG
GAGGCTGAGC GGCTCTTCAA GAGCGGCAGG GTAGACGCGA TCGTCGCTAC GTCGAGCCTA
GAGCTCGGCG TGGACATAGG GTACGCGAGG CTCGTAGTGC AGTTCGGGTC GCCCAGGCAG
GCAATAAAGC TGGCGCAAAG GGTGGGCAGG GCTGGGCACA GCCTCTCGGA GGTCTCCCGC
GGGGTCATAG TCCCGCTGTA CCTGGAGGAC GCAGTCGAGT CCGCGGTCCT CGCGCGCAGA
GTGTACCTGA GGGCGCTCGA GAAGCAGGGG TTCCACGAGA AGCCTCTCGA CGTGCTCTCG
CACCAGGTGG CGGGCCTCGT GCTCGAGTAC CGGGACCTAG ACGTGTACAC GGTCCACGAG
GTCGTCACGC GCTCCGCCCC GTTCCGCGAC CTTAGCCTGG GCGAGCTGAA GCAACTCCTC
GAGTTCCTCG ACTCCCTAGG CGTCGTTAGG TTCGACGGAG AGCGCGTCAA GATGGGGAGG
AGGACTATCT CCTACTACTA CGGCTCCGCC TCCATGATCC CGGAGACCCT ATCCTTCGAC
GTCGTCGACA TGGCGTCTAG GAGGACTATA GGCGCGCTGG ACTACGCGTT CGCGTCCCTG
GTCGACAAGG GCAGGGTGAT AATACTCGGC GGGAAAGCCT GGACGGTGGA GGAGGTAGAC
GTGGATGCCA ACAAGGTCTA CGTAGTCGAG AACGTGGAGG AGTTCGGGGA GCCGCCCATC
TGGACCGGCA TGACCCTACC CGTGGACGCG AAGGTAGCGA GGGAGGTGGG CTCCCTCTAC
AGGAGGATCG GGGAAAGCCT CGGCAACCCC GAGGAGCTCG AGAAGCTGAG GAAGGAGTAC
GCGATACCCG AGAAAGCCTT CGAGAAGCTC GTAAGGATAA TCGAGGAGGA GAAGAGGCTA
CTGGGCGTAG TGCCCAGCGA CCGCAACGCC GTCGCGGAGG TCGCCAGCTA CAAGGGCAAG
ACAGCTATAA CCCTGCACTC CTACCTGGGC ACGAAGGGCA ACAACCTGCT CGCGCTCCTA
CTCGCACACG CTGTGCGGGG GTACACGGGG TCCTCAGCGA GGTACTTCGC AGACCCCTAC
AGGGTCCTAG TGGTCACCGA GTACGAGGTA CCGGACCAGA GGGTGCTGGA GAGCCTGAGG
AGCGGGCTCG AGTGGTCGCT CAGGAACCTA GAGGAGGTGG TCAGAGAGAG CAACTCGTAC
GCGCTAGCGT TTTCGCACGT AGCCTCCAAG ATGGGGGTAG TCGACGTGAA GAAGTCTAAG
CCGGAGCCGG GGCTGATGTC AAGCCTGAGG AAGAGGATGA GGGGAACACC CCTGGACAGG
GAGGCGTTGA GGACCTGCCT CTTCGAGTAC TTCGACCTGG AGGCTGTCGA GGAGTTCGTC
AGGGAGGTCT CGGAGGGCAG GAGACCCCTC GTGTTCAAGA GGCTCCCCGA GCTGAGCCCG
CTGGCCCAGC TTATATTCGA GAAGCCCGTC GTAAAGGCCG GCGCGCTGGC CTCCGAGCTC
CCCGTTTCCA GCATGGTCAG CGCGGTCGTG AAGAGGATTG AGAACACCCA CGTGCTACTC
TACTGCGTCC ACTGCGGCAA CTGGCACGCC ACGATGAGGG TGGTGGAGGC GAAGGCGTAC
CCCTCCTGCC CCAAGTGCGG CTCGAGGGCC TTAGCGGTGC TTAGGCCGTA CGAGGAGGAA
AAGCTCCCGG TGCTGGAGAA GTGGCGGAAG GGTGGGAAGC TCTCCCCCGA GGAGAAGAAG
TTCGTGGAGA AAGTCAGGCA GTCCGCCTCG CTGGTACTCT CCTACGGCTA CCCAGCGGTC
TTCGTGCTAG CCGGCCACGG GATAGGGCCT ACCACAGCCA AGAGCATCCT TTCGAAGGGG
ACGGACCCGG AGACCCTTGC GAGGAACATC CTCGTAGCCG AGGCTAACTA CACGAGGACG
AGGAAATACT GGGAAGAGTA G
 
Protein sequence
MLGEVVKGRD RGGSRRLEEL GDLLPGRILE ALSRFGYREL NYVQVKSLEL SRVYPNMLIS 
APTGAGKTEA AVLPVLRDLV EKGGKPIYAL YVTPLRALNR DIYDRMVELF RALGFEAEVR
HGDTPQRVRR RIAESPPHML ITTPETTQFL LVDGRYREHL KNTRWVVVDE VHELLDDKRG
AQLSLALERL RLISPGLRVV GLSATLRDPE AALRLLSGGR VGTVVEWFER KAYELVVEDI
DEEGDLAGRV KRVAELCRGG GVIVFTNTRD TAEFIGRVLA RDYGLNVRVH HGSLSRQERE
EAERLFKSGR VDAIVATSSL ELGVDIGYAR LVVQFGSPRQ AIKLAQRVGR AGHSLSEVSR
GVIVPLYLED AVESAVLARR VYLRALEKQG FHEKPLDVLS HQVAGLVLEY RDLDVYTVHE
VVTRSAPFRD LSLGELKQLL EFLDSLGVVR FDGERVKMGR RTISYYYGSA SMIPETLSFD
VVDMASRRTI GALDYAFASL VDKGRVIILG GKAWTVEEVD VDANKVYVVE NVEEFGEPPI
WTGMTLPVDA KVAREVGSLY RRIGESLGNP EELEKLRKEY AIPEKAFEKL VRIIEEEKRL
LGVVPSDRNA VAEVASYKGK TAITLHSYLG TKGNNLLALL LAHAVRGYTG SSARYFADPY
RVLVVTEYEV PDQRVLESLR SGLEWSLRNL EEVVRESNSY ALAFSHVASK MGVVDVKKSK
PEPGLMSSLR KRMRGTPLDR EALRTCLFEY FDLEAVEEFV REVSEGRRPL VFKRLPELSP
LAQLIFEKPV VKAGALASEL PVSSMVSAVV KRIENTHVLL YCVHCGNWHA TMRVVEAKAY
PSCPKCGSRA LAVLRPYEEE KLPVLEKWRK GGKLSPEEKK FVEKVRQSAS LVLSYGYPAV
FVLAGHGIGP TTAKSILSKG TDPETLARNI LVAEANYTRT RKYWEE