Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1014 |
Symbol | |
ID | 4600687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 958256 |
End bp | 961096 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639773792 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_920417 |
Protein GI | 119719922 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGGCG AAGTTGTCAA GGGGAGGGAT CGGGGAGGCT CCCGTAGGTT GGAGGAGCTC GGGGACCTAC TACCCGGAAG GATCCTCGAG GCTCTCTCCA GGTTTGGGTA CAGGGAGCTG AACTACGTGC AGGTGAAGTC GCTAGAGCTC TCCAGAGTGT ACCCCAACAT GCTCATCTCC GCCCCCACGG GTGCCGGGAA AACGGAGGCG GCGGTTCTGC CCGTTCTCCG GGATCTCGTC GAGAAGGGTG GGAAGCCCAT ATACGCGCTC TACGTCACGC CTCTACGTGC TCTCAACAGG GACATATACG ACAGGATGGT GGAGCTTTTT CGCGCCCTGG GCTTCGAGGC GGAAGTGAGG CACGGGGATA CGCCGCAGAG GGTTCGCAGG AGGATAGCCG AGTCTCCTCC GCACATGCTC ATCACCACGC CGGAGACTAC GCAGTTCCTG CTGGTGGACG GGAGGTACAG GGAGCACTTG AAGAATACCC GGTGGGTCGT AGTGGACGAG GTGCACGAGC TACTCGACGA CAAGAGGGGG GCGCAGCTAT CGCTGGCGCT CGAGAGGCTT AGGCTTATCT CGCCGGGGCT CCGCGTCGTC GGGCTCTCGG CTACGCTGAG GGACCCTGAG GCGGCGCTCA GGCTACTCTC AGGCGGGAGG GTGGGGACAG TCGTCGAGTG GTTCGAGCGT AAAGCCTACG AGCTGGTAGT AGAGGACATC GACGAGGAGG GAGATCTCGC CGGGAGGGTT AAGCGCGTCG CCGAGCTGTG CAGGGGCGGC GGCGTGATAG TGTTCACCAA TACGCGCGAC ACAGCGGAGT TCATCGGGAG GGTTCTCGCC CGCGACTACG GCTTGAACGT GAGGGTGCAC CACGGGAGCC TCTCCCGCCA GGAGAGGGAG GAGGCTGAGC GGCTCTTCAA GAGCGGCAGG GTAGACGCGA TCGTCGCTAC GTCGAGCCTA GAGCTCGGCG TGGACATAGG GTACGCGAGG CTCGTAGTGC AGTTCGGGTC GCCCAGGCAG GCAATAAAGC TGGCGCAAAG GGTGGGCAGG GCTGGGCACA GCCTCTCGGA GGTCTCCCGC GGGGTCATAG TCCCGCTGTA CCTGGAGGAC GCAGTCGAGT CCGCGGTCCT CGCGCGCAGA GTGTACCTGA GGGCGCTCGA GAAGCAGGGG TTCCACGAGA AGCCTCTCGA CGTGCTCTCG CACCAGGTGG CGGGCCTCGT GCTCGAGTAC CGGGACCTAG ACGTGTACAC GGTCCACGAG GTCGTCACGC GCTCCGCCCC GTTCCGCGAC CTTAGCCTGG GCGAGCTGAA GCAACTCCTC GAGTTCCTCG ACTCCCTAGG CGTCGTTAGG TTCGACGGAG AGCGCGTCAA GATGGGGAGG AGGACTATCT CCTACTACTA CGGCTCCGCC TCCATGATCC CGGAGACCCT ATCCTTCGAC GTCGTCGACA TGGCGTCTAG GAGGACTATA GGCGCGCTGG ACTACGCGTT CGCGTCCCTG GTCGACAAGG GCAGGGTGAT AATACTCGGC GGGAAAGCCT GGACGGTGGA GGAGGTAGAC GTGGATGCCA ACAAGGTCTA CGTAGTCGAG AACGTGGAGG AGTTCGGGGA GCCGCCCATC TGGACCGGCA TGACCCTACC CGTGGACGCG AAGGTAGCGA GGGAGGTGGG CTCCCTCTAC AGGAGGATCG GGGAAAGCCT CGGCAACCCC GAGGAGCTCG AGAAGCTGAG GAAGGAGTAC GCGATACCCG AGAAAGCCTT CGAGAAGCTC GTAAGGATAA TCGAGGAGGA GAAGAGGCTA CTGGGCGTAG TGCCCAGCGA CCGCAACGCC GTCGCGGAGG TCGCCAGCTA CAAGGGCAAG ACAGCTATAA CCCTGCACTC CTACCTGGGC ACGAAGGGCA ACAACCTGCT CGCGCTCCTA CTCGCACACG CTGTGCGGGG GTACACGGGG TCCTCAGCGA GGTACTTCGC AGACCCCTAC AGGGTCCTAG TGGTCACCGA GTACGAGGTA CCGGACCAGA GGGTGCTGGA GAGCCTGAGG AGCGGGCTCG AGTGGTCGCT CAGGAACCTA GAGGAGGTGG TCAGAGAGAG CAACTCGTAC GCGCTAGCGT TTTCGCACGT AGCCTCCAAG ATGGGGGTAG TCGACGTGAA GAAGTCTAAG CCGGAGCCGG GGCTGATGTC AAGCCTGAGG AAGAGGATGA GGGGAACACC CCTGGACAGG GAGGCGTTGA GGACCTGCCT CTTCGAGTAC TTCGACCTGG AGGCTGTCGA GGAGTTCGTC AGGGAGGTCT CGGAGGGCAG GAGACCCCTC GTGTTCAAGA GGCTCCCCGA GCTGAGCCCG CTGGCCCAGC TTATATTCGA GAAGCCCGTC GTAAAGGCCG GCGCGCTGGC CTCCGAGCTC CCCGTTTCCA GCATGGTCAG CGCGGTCGTG AAGAGGATTG AGAACACCCA CGTGCTACTC TACTGCGTCC ACTGCGGCAA CTGGCACGCC ACGATGAGGG TGGTGGAGGC GAAGGCGTAC CCCTCCTGCC CCAAGTGCGG CTCGAGGGCC TTAGCGGTGC TTAGGCCGTA CGAGGAGGAA AAGCTCCCGG TGCTGGAGAA GTGGCGGAAG GGTGGGAAGC TCTCCCCCGA GGAGAAGAAG TTCGTGGAGA AAGTCAGGCA GTCCGCCTCG CTGGTACTCT CCTACGGCTA CCCAGCGGTC TTCGTGCTAG CCGGCCACGG GATAGGGCCT ACCACAGCCA AGAGCATCCT TTCGAAGGGG ACGGACCCGG AGACCCTTGC GAGGAACATC CTCGTAGCCG AGGCTAACTA CACGAGGACG AGGAAATACT GGGAAGAGTA G
|
Protein sequence | MLGEVVKGRD RGGSRRLEEL GDLLPGRILE ALSRFGYREL NYVQVKSLEL SRVYPNMLIS APTGAGKTEA AVLPVLRDLV EKGGKPIYAL YVTPLRALNR DIYDRMVELF RALGFEAEVR HGDTPQRVRR RIAESPPHML ITTPETTQFL LVDGRYREHL KNTRWVVVDE VHELLDDKRG AQLSLALERL RLISPGLRVV GLSATLRDPE AALRLLSGGR VGTVVEWFER KAYELVVEDI DEEGDLAGRV KRVAELCRGG GVIVFTNTRD TAEFIGRVLA RDYGLNVRVH HGSLSRQERE EAERLFKSGR VDAIVATSSL ELGVDIGYAR LVVQFGSPRQ AIKLAQRVGR AGHSLSEVSR GVIVPLYLED AVESAVLARR VYLRALEKQG FHEKPLDVLS HQVAGLVLEY RDLDVYTVHE VVTRSAPFRD LSLGELKQLL EFLDSLGVVR FDGERVKMGR RTISYYYGSA SMIPETLSFD VVDMASRRTI GALDYAFASL VDKGRVIILG GKAWTVEEVD VDANKVYVVE NVEEFGEPPI WTGMTLPVDA KVAREVGSLY RRIGESLGNP EELEKLRKEY AIPEKAFEKL VRIIEEEKRL LGVVPSDRNA VAEVASYKGK TAITLHSYLG TKGNNLLALL LAHAVRGYTG SSARYFADPY RVLVVTEYEV PDQRVLESLR SGLEWSLRNL EEVVRESNSY ALAFSHVASK MGVVDVKKSK PEPGLMSSLR KRMRGTPLDR EALRTCLFEY FDLEAVEEFV REVSEGRRPL VFKRLPELSP LAQLIFEKPV VKAGALASEL PVSSMVSAVV KRIENTHVLL YCVHCGNWHA TMRVVEAKAY PSCPKCGSRA LAVLRPYEEE KLPVLEKWRK GGKLSPEEKK FVEKVRQSAS LVLSYGYPAV FVLAGHGIGP TTAKSILSKG TDPETLARNI LVAEANYTRT RKYWEE
|
| |