Gene Tpen_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0478 
Symbol 
ID4602037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp434006 
End bp435772 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content54% 
IMG OID639773246 
ProductDEAD_2 domain-containing protein 
Protein accessionYP_919890 
Protein GI119719395 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.738253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGG AATTGCGCGC ACAGGTGGAG GCTCTAATAC CCTATAAGAC TGTCAGGAAG 
GGCCAGCTCG AGCTAGCTTT GGAAGTTGCC AAGGCGTATG CCGAGAAGGC GATCCTGCTT
GCACGCTATC CTACTGGTAT AGGGAAGACG GCGGCTGTGC TTGCCGGAGC TCTCGCGTCG
GGTGCCCCCA AAGTCGTGTA TCTCGCGCGG TCTAAGTCAC AGTTCCAGGC TCCTTTAAGG
GAGGTTAAAA GGCTCTTGGA GAGGGGTATA AGCGTTCCCA CCGTAGTCCT CGTGAACAAG
AAGAACTACT GCCTACTGAG AGGGGCCCTT CCGCTGGACT ACGAGGAATT TCTCCACTTC
TGCCGGGTTA AGAGATTCAC GGGGGCGTGC CCCTACTCTT CGGAGTTTGA AGACGCAGAA
ATCCCCGTCC TGGTGACCCC TAAGAGTGCC AGGTCTTTAG GTGCGAAGCT AGGGGTATGC
CCCTTTGAGC TAGCCTGGAA GGCCTTAAGG AAAGCTAGGC TCGTCGTCGC ATCCTATCCT
TACGTCTTCC GCGAAGATCT GCGGAGGCTT CTCGTCGAAA GCCTAGGCAC CCCGATGGAT
GTTTTAATGC TCGTAGTCGA TGAGGCCCAC AACTTGCCGG ATCATATAGC CGAGTCTACC
GCGATAGCCG TCTCGGACGC AACGCTGAGA AGAGCTATTT CGGAACTGAG AGCGGCAGGC
ATAGGGCAGG AGCTTGCCTC CTCGCTTTCC AACCTATTAG GTTACATGAA GAGGATTAAA
GGAGCTGAGG AAGGTGTAAG AATACCTCTG GATGAGCTTA CGTACCTAGT TCCTCCAGCG
GACGAGGTTA AGAGGGTCGC CCTGGCTCTG GAGAAGCTTA CAGGCATGTA CTCAAGTTTG
TGGCAACTCT ACGCGTTCGC CGAGGCTTTG AGACGCGCGT CACACGAGTA TCTAGTAACC
GCAGTGTCGA GCGACGGATC GGTATCGTTG AAGCTTGTAC TTATAAATCC TGGGAAGGTG
TCTAGGGAAG TTTTCCCCAA AGTTAGAAGT GCCGTGCTGA TGTCGTCGAC GCTTATGCCG
GCAGAATACT ACGTGGCTGT CCTGGGCTTT CCCGGAGAGA GAGTAAGGGA AGTCTCCTAC
CCCTTTGTGT GGAGCGAAAA CGTTGACGTT GTCATAGTTA AGGGTATATC GTCCAGGTAC
GTCGAGCGCG GAGAGGAACT CTACAGGCGG TACGCAAGCG CAATAGACTA CATTTTCGAG
CTTCCCAGTA CTAGGAGGGC TCTAGCCATC TTCCCTTCCT ACTCCTTCAT GATGGGCGTT
TATCCCTACA TACGCTCAAA GCCCGTTATC ATCGAGAGGC GGGATTCCAC TATAGGCTCT
ATGCTTGGGA AGGTGCTCGA ACTCGAGAAA GCCCTGCTCC TCGTAGTTGC GCGCGGAAAG
TTCGCCGAGG GGGTTGAGTT CACGGTGCTT GGAAAGTCTC TCATAGACAC GATCGTGATA
GCCGGGCTAC CCGTCCCGGA GCCTTCGGTG GAGAACGAAA AGCTCTACGA GCTACTTCAA
GAAAGGCTGG GCGACAGGGA TCTTGCATGG AAGTACGTCT ACCTTTATCC CGCCTTTATG
CAAGTAGTGC AAGGCATAGG TAGGGGTGTT CGAAGCGAGA AGGATAAAGT GAAGGTGTTC
ATATTGGATG AGCGCATGAT AGGAGAGGGT GAAAAGTACC TGTCTATGTA CGGGCTCGTC
CCCCGCGTAG GGAAGCTCCC CCTGTAG
 
Protein sequence
MSEELRAQVE ALIPYKTVRK GQLELALEVA KAYAEKAILL ARYPTGIGKT AAVLAGALAS 
GAPKVVYLAR SKSQFQAPLR EVKRLLERGI SVPTVVLVNK KNYCLLRGAL PLDYEEFLHF
CRVKRFTGAC PYSSEFEDAE IPVLVTPKSA RSLGAKLGVC PFELAWKALR KARLVVASYP
YVFREDLRRL LVESLGTPMD VLMLVVDEAH NLPDHIAEST AIAVSDATLR RAISELRAAG
IGQELASSLS NLLGYMKRIK GAEEGVRIPL DELTYLVPPA DEVKRVALAL EKLTGMYSSL
WQLYAFAEAL RRASHEYLVT AVSSDGSVSL KLVLINPGKV SREVFPKVRS AVLMSSTLMP
AEYYVAVLGF PGERVREVSY PFVWSENVDV VIVKGISSRY VERGEELYRR YASAIDYIFE
LPSTRRALAI FPSYSFMMGV YPYIRSKPVI IERRDSTIGS MLGKVLELEK ALLLVVARGK
FAEGVEFTVL GKSLIDTIVI AGLPVPEPSV ENEKLYELLQ ERLGDRDLAW KYVYLYPAFM
QVVQGIGRGV RSEKDKVKVF ILDERMIGEG EKYLSMYGLV PRVGKLPL