Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0602 |
Symbol | |
ID | 4601222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 556017 |
End bp | 558131 |
Gene Length | 2115 bp |
Protein Length | 704 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773377 |
Product | CoA-binding domain-containing protein |
Protein accession | YP_920010 |
Protein GI | 119719515 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACG AACGTATCTT GAGGCTTTTA AGGCCGGAGA GCGTAGCTGT TGTTGGCGCG TCGCGGAATC CCGAGAAGAT AGGCTACCAG GTCGTTAAGA ACCTCTTGGA GGCAGGGTTC CCGCGGGAGA GAATTTTCCC CGTAAATCCC AATGCCGACG AGATACTGGG CTTGAAGTGC TATAAATCCG TCTCGGAGAT ACCTTACCAG GTCGACTTGG TGGTTGTCGC GGTCCCAGCC CCCGCCGTTC CGGGAGTTCT AGAGGACGCT GGTCGAAAGG GTGTTAAGGC AGTTGCCGTA ATAACGAGCG GCTTCAAGGA AATTGGGAAC GTAGACTTGG AGCAACGCAT AGCGAGCATA GCGAAGAACT ACGGGATGAG ACTCCTAGGC CCGAACATAG TCGGGATATG CGACACCGTG AAGAGAGTAA ATGCAAGCTT CTGTCAAGGG CTACCTAAAC CGGGGGAGAT AGCCTTCATT ACGCAGAGCG GAGCACTGGG AATAGCGCTG GTAGGGTGGA CGAAGCTGAA GGGCATAGGG CTCTCCGACC TGGTCAGCAT AGGCAACAAA GCCGATGTTG ATGAGACAGA CCTTGTAGAG TTTTTCGGGG AAGACCAGTA CACCAAGGTT ATAACCGCAT ACCTGGAAGG GGTGAGCGAC GGGAGAAGGT TCCTCGAGGC TGCACGCCGC GTAGCGAGGC GGAAGCCAAT AATCGTACTC AAGGCGGGTA GAGGTGTAAG AACCATAGGC GCCATCAAAT CGCACACGGG TTCGCTTGCA GGGAGCTTCG CAGCTTATGA GGCGGCGTTC AAGCAGTCGG GTATCCTGCT TGCGCGTAGC TTCGTAGAGC TCTTCGACTG GGCGACTGCT TTCGCGAAGA CGCACATTCC GAGAGGGGAG AACGTCGTTG TGCTTACGAA TGGCGGAGGA GCGGGTATAA TGGCGACCGA CGCGTTGGAA GACTATGGAA TCAGGCTGAT GGACATACCC CAGGACCTAG CCGAGAAGCT GAGGAAGCAC ATGCCGCCCT TCGGGAGCGT CTTCAACCCC GTAGACCTCA CGGGCATGGC AGACGCGGAA CAGTACTACG GCGCGCTCAA AGACCTGTTG ATGCATGACT CTGTGGACGC CGTGCTGGTA CTCTACTGCC ACACAGCCAT AACGAACCCA CAGGAGATAG CGGAAGCGCT ACTAAGGGCT GTCCGGGAGA CGGAAAGCAA GAAGACAGTT CTTGCGTCGT TCATAGGCGG CGAAGAGGTA AACGAGGCGT GCGGGAAGCT TACCGAAAAC GGTATTCCGT GCTACGAGTC GCCCGAGAAG GCGGCGTCCA GCCTGGGCGC CATCTACAGG TACAAACACA TGCTCGAAAA ACTCGACAGG AGGACCGAGG TAGCCGTAAG CGTTGACGTC GAAGCTGCGC GCCACGTGGT TAGGAGCGCC TTGAAGGAGG GGAGAACGAC GCTAACGCCC TCGGAGGCCG CGGCGCTGGT CGGGTACTAT GGGATCCCGG TCTTAGCGAA GAAGATCGCT AAAAGCCCCG AGGAAGCAGT GAGGATAGCG CGAGAGATCG GGTACCCCGT GGTTCTCGAG GTGGAGTCCC CCGACATCCT CCATAAAAGC GACATCGGCG GTATACTGGT AGGGCTTCAA AGCGACGCCG AGGTAGCCGA GGGGTACAGC AAGATACTCG ACAATGTATC CAAGAAAGCT CCCAGCGCCA GGGTTAACGG TATCATCGTT AGGAAGATGG CGGAGAAAGG CAAGGAGATA GCACTGGGCG TGCACAGAGA CCCCATTTTC GGACCGCTGG TAATGGTAGG CTCCGGAGGA GTCCTCGTGG AGCTCTACAG GGATGTGAGC TTCCGCGTAG CTCCCCTGAG CCTGGAGGAC GCCTACGAAA TGCTCGAAGA GACGAAGATC TACAAGGTGC TCAAAGGATA CAGAGGAGAA CCCCCCTCCG ACTACGAAAA GGTTGTCGAC GTTCTGATAA GGCTGTCAAA ACTCGCCTGC GACGTAGAGG AGATCGAGGA CATAGACATC AACCCGTTCT TCGTATACGA GAGAGGGAAA GGAGGGATTG CGGTAGACGT CAAAGTAACC CTCGTACAGC GCTAG
|
Protein sequence | MIDERILRLL RPESVAVVGA SRNPEKIGYQ VVKNLLEAGF PRERIFPVNP NADEILGLKC YKSVSEIPYQ VDLVVVAVPA PAVPGVLEDA GRKGVKAVAV ITSGFKEIGN VDLEQRIASI AKNYGMRLLG PNIVGICDTV KRVNASFCQG LPKPGEIAFI TQSGALGIAL VGWTKLKGIG LSDLVSIGNK ADVDETDLVE FFGEDQYTKV ITAYLEGVSD GRRFLEAARR VARRKPIIVL KAGRGVRTIG AIKSHTGSLA GSFAAYEAAF KQSGILLARS FVELFDWATA FAKTHIPRGE NVVVLTNGGG AGIMATDALE DYGIRLMDIP QDLAEKLRKH MPPFGSVFNP VDLTGMADAE QYYGALKDLL MHDSVDAVLV LYCHTAITNP QEIAEALLRA VRETESKKTV LASFIGGEEV NEACGKLTEN GIPCYESPEK AASSLGAIYR YKHMLEKLDR RTEVAVSVDV EAARHVVRSA LKEGRTTLTP SEAAALVGYY GIPVLAKKIA KSPEEAVRIA REIGYPVVLE VESPDILHKS DIGGILVGLQ SDAEVAEGYS KILDNVSKKA PSARVNGIIV RKMAEKGKEI ALGVHRDPIF GPLVMVGSGG VLVELYRDVS FRVAPLSLED AYEMLEETKI YKVLKGYRGE PPSDYEKVVD VLIRLSKLAC DVEEIEDIDI NPFFVYERGK GGIAVDVKVT LVQR
|
| |