Gene Tpen_0792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0792 
Symbol 
ID4601253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp745135 
End bp746796 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content61% 
IMG OID639773569 
Productstarch synthase 
Protein accessionYP_920197 
Protein GI119719702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0297] Glycogen synthase 
TIGRFAM ID[TIGR02095] glycogen/starch synthases, ADP-glucose type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCGG ATAGAACGGT CTTAATGGTA GCGTTCGAAG CAACGCCCTT CGTGAAGGTC 
GGCGGCCTCG CGGAGGTTCC GAGCAACCTG GCGAAGGAGC TGGTCGCCCT AGGGTGGAGA
GCCTACCTCG CCTTGCCGTC GCACGCGCCT CCGGGCAGGA GGGGGGAGGA GCCAGTGGCC
CGCTTCGCGA CGCCTCTAGG AGAAGTCTTC GTTGGCAGGG CTTCCTGGGG CGGCGTCACG
TACCTGCTCT TCTCGGGATC CGCGCTGAGC GACGAGAGGG TGTACGCCGG CGAGGTAATG
GACCAAAAGG TGAAGCTCTT CTCGTACGGG CTATCCCTCC TACTGCTGAA CGCTGAGAAG
TACGGCGTAG TGTTCCCGGA CGTCATACAC TTCCACGACT GGCACAGCGT GTATGCGCTC
GTGAAGGTGA AGCACGACTT CCAGCAGAGG AAGGCTCGGA CGCTGTTCCA TGTCCACCTG
CTCGTGAAGA AGCACGTAGA CCCCTCGATT TTCGAGCAAC TAGGGGTACC CCTCGGCTGG
AGGCACGAGG TGAGGGTTAA CGGTAGATCC ATGGACCTAA CCCTAGGCGA CGTGCTGAGA
GTTTCAGGGG GTATCGCCGA GAAGATAGGA GCCCTCGAGG CCGACCGGCT AGTAACGGTG
AGCGAGTCCT ACCTTCTGGA CGACGTGCTC CCGTTCGTGG GAGGCGAGTT CCGCGGCAAG
TCTAGGGTAA TCTACAACGC CACCACATGG AGCCTGAGGG GGGCCTTGGA GGAGGTACTC
GGCAAGCACG GGCAAAGGCT ACGCTCCTTC GCTGGGCGTA GCGAGGGCTT CAGGAGGACC
GAGATAAGGA AGTACTTCCT GCTGAGGGCA CTTGGAGAGC TCGAGGAAGG CGAGCCCGAG
GTACCGGACG AGAGGATAAA GAAGCTCGTC TACGACCTGG CGGATCCACC CATGCGCGGT
CAAGGCAAGG TTGAACCCTT CATGTTTGAC GGCCCCCTAG CCATAACGAC CGGGAGGCTT
GCCAGGCAGA AAGGCTTTGA CTTGATGGTA GAGGCTGTGC CGAGAGTGCT GAGGGAGCTC
GGGGAGGCTA AGTTCGTCTT CCTCGTGCTA CCCGTCTGGG GCGGGGAGGA CTACGTGTAC
CAGCTTGCAG ACCTCCAGCG CGAGTACCCC GAGAACGTGC GCGCAGTCTT CGGCGTAGCG
CCCTCGATAT ACAAGCTGGC GCACCTCGCC TCCGATGTGT TCTTCGCTCC TTCGAGGTGG
GAGCCCTTCG GGATAATGGC TCTGGAAGCC ATGTCAACCG GGAACCCCCT CGTGGCTTCG
AGGACCGGGG GGTTGAAGGA GATAGTGCTG GACGTAAACG TGTACGCAGA GAGGGGTACA
GGGATCCTCG TAAGACCCGA CGACCCCTAC GAGCTCGCAG AGGCGCTGAG AGACCTCCTC
GCCTTCATGG AGGCTTCGAA CACGGGCAGG ATTGAGTGGT ACGCGGGCAA AATAGAGAAC
AAGACGCTGA GGCGCATGCT CGAAGAGTAC CCGGACGCGG GCGAGATTCT CAGGAAGAAC
TGCGTGGACA GGGTTGAAAA ACACTTCTCG TGGGCGGCCT CCGCGAGGAC GGCAGATTCT
ATCTACAGAG AGTTGCTAGA GGGAGGACAG GAGACTCTGT AG
 
Protein sequence
MNPDRTVLMV AFEATPFVKV GGLAEVPSNL AKELVALGWR AYLALPSHAP PGRRGEEPVA 
RFATPLGEVF VGRASWGGVT YLLFSGSALS DERVYAGEVM DQKVKLFSYG LSLLLLNAEK
YGVVFPDVIH FHDWHSVYAL VKVKHDFQQR KARTLFHVHL LVKKHVDPSI FEQLGVPLGW
RHEVRVNGRS MDLTLGDVLR VSGGIAEKIG ALEADRLVTV SESYLLDDVL PFVGGEFRGK
SRVIYNATTW SLRGALEEVL GKHGQRLRSF AGRSEGFRRT EIRKYFLLRA LGELEEGEPE
VPDERIKKLV YDLADPPMRG QGKVEPFMFD GPLAITTGRL ARQKGFDLMV EAVPRVLREL
GEAKFVFLVL PVWGGEDYVY QLADLQREYP ENVRAVFGVA PSIYKLAHLA SDVFFAPSRW
EPFGIMALEA MSTGNPLVAS RTGGLKEIVL DVNVYAERGT GILVRPDDPY ELAEALRDLL
AFMEASNTGR IEWYAGKIEN KTLRRMLEEY PDAGEILRKN CVDRVEKHFS WAASARTADS
IYRELLEGGQ ETL