Gene Pars_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1878 
Symbol 
ID5055719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1681267 
End bp1682715 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content59% 
IMG OID640469424 
Productstarch synthase 
Protein accessionYP_001154081 
Protein GI145592079 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0297] Glycogen synthase 
TIGRFAM ID[TIGR02095] glycogen/starch synthases, ADP-glucose type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCAC CTGAACACAT CCGGCGAGTC TACATCTTGG CTATGGAGTA CGGCGGCCTC 
ATAAAGGTGG GGGGACTGGG CGAGGCCGTT AGGCAATACG CAGTAGGCCT AGCGGCTAGG
GGGTACGACG TCACTGTGCT TATGCCGTCT CATGGCAGAC ACCTAGACCC AAACCGAGGC
TTTGACCTAT ACCCCCTAGA CTTCAGAACT TGCGGAGAGC GATGGGGTCT AGACGGGAAG
GCGTATCCAT ACTGCCTCGG CGCAGAGATT ACTTTTCAAG ACGGCGTTAA GATAGTAATG
TTTAAGGGGC TCGACTACGC CACGGGGCAC ATCTTCGACC GGTGGGGCGT TTACGAGTAC
ACGGAGGAGA AGGCGGCTCT CCTGGCCAGG GCAGTTGTGG CATTTGCCGA GAGGTTCGGC
CCCCCCGACC TAATACACAT GAACGACTGG CCCACCGTAC CTGCCGGCAT AGCCTTGAAA
GACCTTGGCG AGAGGAGGGG TCTCGCCATC CCCACGCTGT TCACGATACA CTTGTCCTGG
GACTACTCCT TCCCATGGCA CTACGCCGAG TGGTCAGGCC TGGCGGATAG GCCGCACCCA
GTGTGGCGGG TCTGTTGCCA CCGTTACGAG CACTACAGCG CCGTGTGGGA CGAGGGCGGG
GGGAGCGTGG AGAGGTTCGG CGTGGTTGAG GCAGACGCGG TGTCGACAGT GAGCTACGGG
TATCTCCAAG AGCTGTTTAG GAAATACGGA GAGTGGATTA GGGAGAAGTC GTGCGTGGTT
TACAACTCCA CTGACTGGTC TCTAAAAGAC GTAGAGGGGG TGTCGGAGTC GGACACATGG
CGTCTGGTAG AAGAGGTGGA GCGCATGGGC GTAGTGGGCT GGCTGGATAG GAGGGGCGTC
CTATTCCTAG CTGTGGGGAG AATAACATCG CAGAAGGGGT TTGACATAGC CGTCAAGGCG
CTTGACTACG CCCCCCATGC GCGGCTCTTG ATACTCGGCG TACCCGCAGG GGAGTGGGGC
TACGAGGAGT ACGTGAAGAG CCTCGTCTGG GAGCGGCGGG GCAGAGTAGC CCTCTCAACG
GCCAAAATCC CACCTAGACT CTACAAGGCG TTGCACTACG TGGCAAAGGC CTTGGTAATG
CCCTCAAGAT GGGAGCCCTT CGGCATCTCG GCCATCGAGG CTATGGCGCT GGGCACTCCA
GTAATAGCGC CGGCAGTTGG AGGACTCCCC GAGGTCGTGG GCGAATACGG CATATTAGTT
GACCCTGAAA ACCCCGAAAA GCTGGGCAAA GCCATGGAGG AGCTGGCAAC TGGCGCTGTC
AGCCTTCCCT CACGGGAGCG TATTGCCCAG TATGTCGATG CCAAGTTCAG GATGAGGAAT
ACGATAGACA TGCTCGAGCA GTGCTACCAG AGCGCGAGGC TCTTTGCATA TTACCGGGCT
CACAGCTAG
 
Protein sequence
MRAPEHIRRV YILAMEYGGL IKVGGLGEAV RQYAVGLAAR GYDVTVLMPS HGRHLDPNRG 
FDLYPLDFRT CGERWGLDGK AYPYCLGAEI TFQDGVKIVM FKGLDYATGH IFDRWGVYEY
TEEKAALLAR AVVAFAERFG PPDLIHMNDW PTVPAGIALK DLGERRGLAI PTLFTIHLSW
DYSFPWHYAE WSGLADRPHP VWRVCCHRYE HYSAVWDEGG GSVERFGVVE ADAVSTVSYG
YLQELFRKYG EWIREKSCVV YNSTDWSLKD VEGVSESDTW RLVEEVERMG VVGWLDRRGV
LFLAVGRITS QKGFDIAVKA LDYAPHARLL ILGVPAGEWG YEEYVKSLVW ERRGRVALST
AKIPPRLYKA LHYVAKALVM PSRWEPFGIS AIEAMALGTP VIAPAVGGLP EVVGEYGILV
DPENPEKLGK AMEELATGAV SLPSRERIAQ YVDAKFRMRN TIDMLEQCYQ SARLFAYYRA
HS