Gene Tpen_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1875 
Symbol 
ID4600333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008696 
Strand
Start bp20180 
End bp22081 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content54% 
IMG OID639772473 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_919133 
Protein GI119709793 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGAGA AAAAGGCGTG GGGGGCTATA GGAAAAGAAG TTAGCCCCGA CTCTAATCCT 
CTAAGCGCCT ACAATAGCGA GGTAGAGCGA TTGCGCCTAG CCATAAGGCG TAACCGCGAT
TTAGCCCCTA GCCTTGAATT CGAAATAGCT AGGGCTATAG CCTCGCGCTT AGCCCTTATA
AGGGCTAAAG GGCTAGACAC ACAGGAAATC GTCAACGACA TATGCGACGC CGTAGATTCT
AGTCTTTCGC TAGGCGAGAA TTTGGCTATC GTCGAAAAAA TCCTCGCTAA ATGGCTAGGC
GGGGCTTATT ATTCTGATAA TAATAGCCCC GAGGAAGTCG AAGCGCAGTA CAATTCATGG
CTTGAGAGCC GCCTCGCGGT GTTAGCCGAC AAGGTAGAGC TAGGCGAGGC TAGCCCCGAG
GAAATCGAGG AAGCTAAGCA AATCCTAGCC CAGAATCCTC GCCTAAAAGA CAAATACGCA
GAGCTAGCCC AAAAGCTAGG CGTAGAGCAG TACGTAGACT GGAGACAAAT CCTAGCCCCA
CAAGCCATAA GCGAGCTAGA GCTAACCGAG GAAGAAGCCA GCACCCAGAT AATCCTAGGC
GTGGACACGT CGATATGGGG CTATGAACTG CTAGCCGACC TAGCAGGCGA CGAGGAAAGG
CTCAAAGAAC TTTTAAGGGA GCAGGGGCTA ACCGAGAGAG ACTACACGGC TAGAGTTTTA
AGCGAGCGCA TTAGGAAAAT ACGCGTGGCT AAAATTGTCG TGCCGAAGAA AATGCGCGAC
TACCAGAGGA TAGCGCTTAG ATGGCTACTC AGCGCGCACG GCGCCGTGCA AATCCCCACA
GGCGGCGGAA AGACGCTTAT CGGTAGCACG CTAGCCGTGA ACCTAGCGCT ATGGGGCTAT
GACACGGCTA TTATCGTGCC CACGCAAGTG CTGGCTGAGC AGTGGGCTAG CCACTTTGAG
AGGGAGTGGA GGCTCAGAGT GAACGTGAAG TACGGGGGAG TGAACAGGCG AGGACAAGGG
CTAGGCGAGC TAATCACGGT AATGGTCTAC AACACTTTCG TGAGCGAGGT TAAGCGGGGC
TACACGCCTT TCGCAGTGAT AGTTGACGAA AGCCATCATG CAGGGGCTAG GGAGCTAGCG
AGGACTCTGG CGAGGCTGGC TAGGCTAGGC GTGCACATCT ATGGCTTTAG CGCGACGCAC
AAGCGCGAGG ACACAGAAGA GCAGTGGGTG CTTGACATGC TACTACCTAA GCGCTATGAG
GTAAGCCCGA GCGAGCTACA GGAGCAAGGC TACATGGTGC CAATCGAGCT AGTGGGGATA
AGAGTGACGG AGACACAGGA GTTCTACCAG CAGTACGAGG AGATACAGGA GAAGATTAGG
CGCCTAAGCA GGCGCATAGA GGACGGGGAG AAACACTTGA AGAAGGACTT GATGAAGCTG
GTAAACTTAA GGAAACAGCT AGCCGCGAGG AGCCCTGTTA AGTGGAGCTC GGCGCTAGCG
CTTATCAACA GGCTAGCACA GGAGCACAAC AGAGTACTAG TGTGGACTGA GAGCCAAGAC
GTGGCGTTAA AGCTAGCCCA GACACTCGGA GGAAAAGCGA TACTCAGCAA GACGCCTAAG
ACCGAGCGCG CGAGGATACT CAAAGAGTGG GGCTCAACCT TCAGAGTACT AGTGACGTGC
AGAGTGCTAG ACGAGGGAGT AGACGTACCC GAGGTAAGCG TAGGCGTAAT GCTAGCGAGC
GGTACGACTG ACAGACAGCT CATACAGAGA GCAGGGAGGC TATTGAGACC GGCGCCTGGA
AAGACGAAAG CCACGCTGTT CTACGTCTAC GTGGCTTACA CGCACGAGGA GACGGCGTTT
CACAAGCTAA AGGCGATATT CGCGAGGAGG GGATTGCAGT GA
 
Protein sequence
MREKKAWGAI GKEVSPDSNP LSAYNSEVER LRLAIRRNRD LAPSLEFEIA RAIASRLALI 
RAKGLDTQEI VNDICDAVDS SLSLGENLAI VEKILAKWLG GAYYSDNNSP EEVEAQYNSW
LESRLAVLAD KVELGEASPE EIEEAKQILA QNPRLKDKYA ELAQKLGVEQ YVDWRQILAP
QAISELELTE EEASTQIILG VDTSIWGYEL LADLAGDEER LKELLREQGL TERDYTARVL
SERIRKIRVA KIVVPKKMRD YQRIALRWLL SAHGAVQIPT GGGKTLIGST LAVNLALWGY
DTAIIVPTQV LAEQWASHFE REWRLRVNVK YGGVNRRGQG LGELITVMVY NTFVSEVKRG
YTPFAVIVDE SHHAGARELA RTLARLARLG VHIYGFSATH KREDTEEQWV LDMLLPKRYE
VSPSELQEQG YMVPIELVGI RVTETQEFYQ QYEEIQEKIR RLSRRIEDGE KHLKKDLMKL
VNLRKQLAAR SPVKWSSALA LINRLAQEHN RVLVWTESQD VALKLAQTLG GKAILSKTPK
TERARILKEW GSTFRVLVTC RVLDEGVDVP EVSVGVMLAS GTTDRQLIQR AGRLLRPAPG
KTKATLFYVY VAYTHEETAF HKLKAIFARR GLQ