Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0893 |
Symbol | |
ID | 4600470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 842012 |
End bp | 843928 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773672 |
Product | acetyl-CoA synthetase |
Protein accession | YP_920297 |
Protein GI | 119719802 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGTAC TCCCGGTCGA GGAGAGGAGG GTTTTAACGG CGAAGCTAGA CGAGATGCGG AGAAAAGCGC TTGAGGATCC CGAGTCCTTC TGGGACGAGC ATGCAAGGGC GCTGGAGTGG TACAAGATCT GGGATAAGGT CCTAGACGAT GGGGAGAAGC CTTTCTACAG GTGGTTCGTG GGGGGAAGGA TCAATGCTAG CTACAACGCC CTGGACAGGC ACGTGAAGAC CTGGAGGAAG AACAAGGTAG CGCTGATATG GGAGGGGGAG GACGGCTCGG TGAAGAAGTA CTCGTACAGG GACCTCTACG TAGAGGTAAA CAGGGTAGCG GCCCTTCTAA AGAACTTCGG GGTCAAGAAG GGGGATAGGG TCGCCCTCTA CCTTCCCATG ATCCCGGAGC TCCCGATCTT CATGCTCGCT GCCGCCAGGA TAGGGGCCCC CTTCACGGTG ATATTCTCGG GATTCTCGAG CGACTCTCTC GCCAAGAGGC TCAACGACTC TGGGGCGAAG CTCTTGGTAA CCGCCGACGG CTTCTGGAGG AGGGGGAGGG TCGTGAGGCT GAAGGATATA GCGGACAAGT CCCTAGAGCA GGCCCTGAGC GTGGAGAGTG TCCTCGTAGT GAGGCACGCG GGGGTAGACG TCGCGATGCA GGAGGGCCGC GACTACTGGT ACCACGAGGC CCTGGAGGGT ATAGGCAGGA ACACCTACGT AGAGCCGGAG AGGCTCGACT CCAACCACCC GCTCTTCATA CTCTACACCT CCGGAACTAC GGGTACGCCT AAAGGCATCT ACCACAGCAC GGGTGGCTAC CTTGTGTGGG TATACTGGAC GCTCAAGTGG GCTTTCAACC CGAACGACGA GGACATCTGG TGGTGCACGG CAGACATAGG GTGGATTACG GGGCACAGCT ACGTCGTCTT TGGCCCGCTC CTGCACGGCT TAACCACGCT TATGTACGAG GGTGCCCCGG ACTACCCGGC CCCGGACAGG TGGTGGAGCA TCATCGAGAG GCACGGGGTC ACCGTCTTCT ACACGTCTCC AACTGCCATT AGGATGTTCA TGCGCTACGG CTCGCACTGG GTCGAGAAGC ACGACCTGTC GAGCCTCAGG ATACTCGGGA GCGTCGGGGA GCCTATAAAC CCCGAAGCGT GGGAATGGTA CTTCAAGGTA GTCGGGAAGG GCAGGTGCCC GATAATCGAC ACCTGGTGGC AGACGGAGAC GGGCGGCTTC ATGATATCGC CTGCGGCGGG CATAGAGCTC GTACCACTCA AGCCGGGCTC GGCTACCCTG CCCCTACCGG GCGTAGACGC CGACGTAGTG GACGACAACG GGAACCCCGC GAAGCCAGGC GTGCAGGGCT ACCTGGTGAT CAAGCGCCCG TGGCCGGGCA TGTTGCTAGG CGTCTGGGGC GACCCGGAGA GGTACGTTAA AACGTACTGG GGCAGGTTCG ACGGGTACTA CTTCCCCGGG GACTACGCCA TGAAGGACGA GGACGGCTAC TTCTGGATCC TCGGCAGGGC CGACGAGGTC TTAAAGGTAG CCGCCCACAG GATAGGAACC ATGGAGCTTG AAAGCGCACT CGTCGAGCAC CCGGCCGTAA GCGAAGCCGC GGTCGTCGGG AAACCGGACC CCGTAAAGGG AGAGGTACCT GTCGCCTTCG TCGTACTCAA GGAAGGCTTC TCTCCGAGCG TGAAGCTCGA GGAAGAGCTT TCCAACCACG TCGCGGAGGT CATAGGTCCG ATAGCGAGGC CTGCCGCGAT AATCTTCGTC AAGAAGCTTC CGAAGACGAG GAGCGGCAAG ATAATGAGGA GGGTTTTGAA GGCGCTCGTC AGAGGGGAGG CGAGCCTCGG GGACCTCTCC ACGATAGAAG ACCCATCGGC CGTCGACGAA GTGAAGGCGG CTCTGAGGAT AGCCTAG
|
Protein sequence | MGVLPVEERR VLTAKLDEMR RKALEDPESF WDEHARALEW YKIWDKVLDD GEKPFYRWFV GGRINASYNA LDRHVKTWRK NKVALIWEGE DGSVKKYSYR DLYVEVNRVA ALLKNFGVKK GDRVALYLPM IPELPIFMLA AARIGAPFTV IFSGFSSDSL AKRLNDSGAK LLVTADGFWR RGRVVRLKDI ADKSLEQALS VESVLVVRHA GVDVAMQEGR DYWYHEALEG IGRNTYVEPE RLDSNHPLFI LYTSGTTGTP KGIYHSTGGY LVWVYWTLKW AFNPNDEDIW WCTADIGWIT GHSYVVFGPL LHGLTTLMYE GAPDYPAPDR WWSIIERHGV TVFYTSPTAI RMFMRYGSHW VEKHDLSSLR ILGSVGEPIN PEAWEWYFKV VGKGRCPIID TWWQTETGGF MISPAAGIEL VPLKPGSATL PLPGVDADVV DDNGNPAKPG VQGYLVIKRP WPGMLLGVWG DPERYVKTYW GRFDGYYFPG DYAMKDEDGY FWILGRADEV LKVAAHRIGT MELESALVEH PAVSEAAVVG KPDPVKGEVP VAFVVLKEGF SPSVKLEEEL SNHVAEVIGP IARPAAIIFV KKLPKTRSGK IMRRVLKALV RGEASLGDLS TIEDPSAVDE VKAALRIA
|
| |