Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1875 |
Symbol | |
ID | 4600333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008696 |
Strand | - |
Start bp | 20180 |
End bp | 22081 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639772473 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_919133 |
Protein GI | 119709793 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGAGA AAAAGGCGTG GGGGGCTATA GGAAAAGAAG TTAGCCCCGA CTCTAATCCT CTAAGCGCCT ACAATAGCGA GGTAGAGCGA TTGCGCCTAG CCATAAGGCG TAACCGCGAT TTAGCCCCTA GCCTTGAATT CGAAATAGCT AGGGCTATAG CCTCGCGCTT AGCCCTTATA AGGGCTAAAG GGCTAGACAC ACAGGAAATC GTCAACGACA TATGCGACGC CGTAGATTCT AGTCTTTCGC TAGGCGAGAA TTTGGCTATC GTCGAAAAAA TCCTCGCTAA ATGGCTAGGC GGGGCTTATT ATTCTGATAA TAATAGCCCC GAGGAAGTCG AAGCGCAGTA CAATTCATGG CTTGAGAGCC GCCTCGCGGT GTTAGCCGAC AAGGTAGAGC TAGGCGAGGC TAGCCCCGAG GAAATCGAGG AAGCTAAGCA AATCCTAGCC CAGAATCCTC GCCTAAAAGA CAAATACGCA GAGCTAGCCC AAAAGCTAGG CGTAGAGCAG TACGTAGACT GGAGACAAAT CCTAGCCCCA CAAGCCATAA GCGAGCTAGA GCTAACCGAG GAAGAAGCCA GCACCCAGAT AATCCTAGGC GTGGACACGT CGATATGGGG CTATGAACTG CTAGCCGACC TAGCAGGCGA CGAGGAAAGG CTCAAAGAAC TTTTAAGGGA GCAGGGGCTA ACCGAGAGAG ACTACACGGC TAGAGTTTTA AGCGAGCGCA TTAGGAAAAT ACGCGTGGCT AAAATTGTCG TGCCGAAGAA AATGCGCGAC TACCAGAGGA TAGCGCTTAG ATGGCTACTC AGCGCGCACG GCGCCGTGCA AATCCCCACA GGCGGCGGAA AGACGCTTAT CGGTAGCACG CTAGCCGTGA ACCTAGCGCT ATGGGGCTAT GACACGGCTA TTATCGTGCC CACGCAAGTG CTGGCTGAGC AGTGGGCTAG CCACTTTGAG AGGGAGTGGA GGCTCAGAGT GAACGTGAAG TACGGGGGAG TGAACAGGCG AGGACAAGGG CTAGGCGAGC TAATCACGGT AATGGTCTAC AACACTTTCG TGAGCGAGGT TAAGCGGGGC TACACGCCTT TCGCAGTGAT AGTTGACGAA AGCCATCATG CAGGGGCTAG GGAGCTAGCG AGGACTCTGG CGAGGCTGGC TAGGCTAGGC GTGCACATCT ATGGCTTTAG CGCGACGCAC AAGCGCGAGG ACACAGAAGA GCAGTGGGTG CTTGACATGC TACTACCTAA GCGCTATGAG GTAAGCCCGA GCGAGCTACA GGAGCAAGGC TACATGGTGC CAATCGAGCT AGTGGGGATA AGAGTGACGG AGACACAGGA GTTCTACCAG CAGTACGAGG AGATACAGGA GAAGATTAGG CGCCTAAGCA GGCGCATAGA GGACGGGGAG AAACACTTGA AGAAGGACTT GATGAAGCTG GTAAACTTAA GGAAACAGCT AGCCGCGAGG AGCCCTGTTA AGTGGAGCTC GGCGCTAGCG CTTATCAACA GGCTAGCACA GGAGCACAAC AGAGTACTAG TGTGGACTGA GAGCCAAGAC GTGGCGTTAA AGCTAGCCCA GACACTCGGA GGAAAAGCGA TACTCAGCAA GACGCCTAAG ACCGAGCGCG CGAGGATACT CAAAGAGTGG GGCTCAACCT TCAGAGTACT AGTGACGTGC AGAGTGCTAG ACGAGGGAGT AGACGTACCC GAGGTAAGCG TAGGCGTAAT GCTAGCGAGC GGTACGACTG ACAGACAGCT CATACAGAGA GCAGGGAGGC TATTGAGACC GGCGCCTGGA AAGACGAAAG CCACGCTGTT CTACGTCTAC GTGGCTTACA CGCACGAGGA GACGGCGTTT CACAAGCTAA AGGCGATATT CGCGAGGAGG GGATTGCAGT GA
|
Protein sequence | MREKKAWGAI GKEVSPDSNP LSAYNSEVER LRLAIRRNRD LAPSLEFEIA RAIASRLALI RAKGLDTQEI VNDICDAVDS SLSLGENLAI VEKILAKWLG GAYYSDNNSP EEVEAQYNSW LESRLAVLAD KVELGEASPE EIEEAKQILA QNPRLKDKYA ELAQKLGVEQ YVDWRQILAP QAISELELTE EEASTQIILG VDTSIWGYEL LADLAGDEER LKELLREQGL TERDYTARVL SERIRKIRVA KIVVPKKMRD YQRIALRWLL SAHGAVQIPT GGGKTLIGST LAVNLALWGY DTAIIVPTQV LAEQWASHFE REWRLRVNVK YGGVNRRGQG LGELITVMVY NTFVSEVKRG YTPFAVIVDE SHHAGARELA RTLARLARLG VHIYGFSATH KREDTEEQWV LDMLLPKRYE VSPSELQEQG YMVPIELVGI RVTETQEFYQ QYEEIQEKIR RLSRRIEDGE KHLKKDLMKL VNLRKQLAAR SPVKWSSALA LINRLAQEHN RVLVWTESQD VALKLAQTLG GKAILSKTPK TERARILKEW GSTFRVLVTC RVLDEGVDVP EVSVGVMLAS GTTDRQLIQR AGRLLRPAPG KTKATLFYVY VAYTHEETAF HKLKAIFARR GLQ
|
| |