Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0792 |
Symbol | |
ID | 4601253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 745135 |
End bp | 746796 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773569 |
Product | starch synthase |
Protein accession | YP_920197 |
Protein GI | 119719702 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0297] Glycogen synthase |
TIGRFAM ID | [TIGR02095] glycogen/starch synthases, ADP-glucose type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACCCGG ATAGAACGGT CTTAATGGTA GCGTTCGAAG CAACGCCCTT CGTGAAGGTC GGCGGCCTCG CGGAGGTTCC GAGCAACCTG GCGAAGGAGC TGGTCGCCCT AGGGTGGAGA GCCTACCTCG CCTTGCCGTC GCACGCGCCT CCGGGCAGGA GGGGGGAGGA GCCAGTGGCC CGCTTCGCGA CGCCTCTAGG AGAAGTCTTC GTTGGCAGGG CTTCCTGGGG CGGCGTCACG TACCTGCTCT TCTCGGGATC CGCGCTGAGC GACGAGAGGG TGTACGCCGG CGAGGTAATG GACCAAAAGG TGAAGCTCTT CTCGTACGGG CTATCCCTCC TACTGCTGAA CGCTGAGAAG TACGGCGTAG TGTTCCCGGA CGTCATACAC TTCCACGACT GGCACAGCGT GTATGCGCTC GTGAAGGTGA AGCACGACTT CCAGCAGAGG AAGGCTCGGA CGCTGTTCCA TGTCCACCTG CTCGTGAAGA AGCACGTAGA CCCCTCGATT TTCGAGCAAC TAGGGGTACC CCTCGGCTGG AGGCACGAGG TGAGGGTTAA CGGTAGATCC ATGGACCTAA CCCTAGGCGA CGTGCTGAGA GTTTCAGGGG GTATCGCCGA GAAGATAGGA GCCCTCGAGG CCGACCGGCT AGTAACGGTG AGCGAGTCCT ACCTTCTGGA CGACGTGCTC CCGTTCGTGG GAGGCGAGTT CCGCGGCAAG TCTAGGGTAA TCTACAACGC CACCACATGG AGCCTGAGGG GGGCCTTGGA GGAGGTACTC GGCAAGCACG GGCAAAGGCT ACGCTCCTTC GCTGGGCGTA GCGAGGGCTT CAGGAGGACC GAGATAAGGA AGTACTTCCT GCTGAGGGCA CTTGGAGAGC TCGAGGAAGG CGAGCCCGAG GTACCGGACG AGAGGATAAA GAAGCTCGTC TACGACCTGG CGGATCCACC CATGCGCGGT CAAGGCAAGG TTGAACCCTT CATGTTTGAC GGCCCCCTAG CCATAACGAC CGGGAGGCTT GCCAGGCAGA AAGGCTTTGA CTTGATGGTA GAGGCTGTGC CGAGAGTGCT GAGGGAGCTC GGGGAGGCTA AGTTCGTCTT CCTCGTGCTA CCCGTCTGGG GCGGGGAGGA CTACGTGTAC CAGCTTGCAG ACCTCCAGCG CGAGTACCCC GAGAACGTGC GCGCAGTCTT CGGCGTAGCG CCCTCGATAT ACAAGCTGGC GCACCTCGCC TCCGATGTGT TCTTCGCTCC TTCGAGGTGG GAGCCCTTCG GGATAATGGC TCTGGAAGCC ATGTCAACCG GGAACCCCCT CGTGGCTTCG AGGACCGGGG GGTTGAAGGA GATAGTGCTG GACGTAAACG TGTACGCAGA GAGGGGTACA GGGATCCTCG TAAGACCCGA CGACCCCTAC GAGCTCGCAG AGGCGCTGAG AGACCTCCTC GCCTTCATGG AGGCTTCGAA CACGGGCAGG ATTGAGTGGT ACGCGGGCAA AATAGAGAAC AAGACGCTGA GGCGCATGCT CGAAGAGTAC CCGGACGCGG GCGAGATTCT CAGGAAGAAC TGCGTGGACA GGGTTGAAAA ACACTTCTCG TGGGCGGCCT CCGCGAGGAC GGCAGATTCT ATCTACAGAG AGTTGCTAGA GGGAGGACAG GAGACTCTGT AG
|
Protein sequence | MNPDRTVLMV AFEATPFVKV GGLAEVPSNL AKELVALGWR AYLALPSHAP PGRRGEEPVA RFATPLGEVF VGRASWGGVT YLLFSGSALS DERVYAGEVM DQKVKLFSYG LSLLLLNAEK YGVVFPDVIH FHDWHSVYAL VKVKHDFQQR KARTLFHVHL LVKKHVDPSI FEQLGVPLGW RHEVRVNGRS MDLTLGDVLR VSGGIAEKIG ALEADRLVTV SESYLLDDVL PFVGGEFRGK SRVIYNATTW SLRGALEEVL GKHGQRLRSF AGRSEGFRRT EIRKYFLLRA LGELEEGEPE VPDERIKKLV YDLADPPMRG QGKVEPFMFD GPLAITTGRL ARQKGFDLMV EAVPRVLREL GEAKFVFLVL PVWGGEDYVY QLADLQREYP ENVRAVFGVA PSIYKLAHLA SDVFFAPSRW EPFGIMALEA MSTGNPLVAS RTGGLKEIVL DVNVYAERGT GILVRPDDPY ELAEALRDLL AFMEASNTGR IEWYAGKIEN KTLRRMLEEY PDAGEILRKN CVDRVEKHFS WAASARTADS IYRELLEGGQ ETL
|
| |