Gene Apar_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1047 
Symbol 
ID8413920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1186100 
End bp1187599 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content46% 
IMG OID645022636 
Productglycogen/starch synthase, ADP-glucose type 
Protein accessionYP_003180066 
Protein GI257784849 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0297] Glycogen synthase 
TIGRFAM ID[TIGR02095] glycogen/starch synthases, ADP-glucose type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG TCTCCAAGCG TCGCAAGCTT CGTGTTGTTT ATGCAACTTC TGAGGTAGCG 
CCTTTTTCCA AGACGGGTGG CCTTGGAGAT GTCTCGGGTT CATTGCCACA AGCACTTAAA
AAAGTAGGAG CTAGAGTGGC GGTAATTTCG CCACTCTATA GCGCTATCAA GCCTGAATGG
CGAGAAAAGA TGAAGAAAGT CTGTGAGCTT CAGGTGCCCC TTTCGTGGCG TTTTGAGTAT
TGTGGGCTTT GGCATCTTGT TTATGAGGGC GTTGATTTTT ACTTTGTAGA TAACGAGTTG
TACTTTGCTC GTGATGGTCT CTATGGTTAT TTTGATGATG GAGAGCGCTA TGCGTTCTTC
TCTAAGGCTC TTTGTGAGCT TATTGCACAC GTTCCAGAGC TCTCCTGTGA CGTTTTACAC
TGTAATGACT GGCAAACTGC GCTAGCACCT GTATTTTTGC GTGAGCAGTA TCAGAGCGTC
CCTGAGGTTC AGAACGTTAA GACGGTTTTC TCCATTCATA ATGTTATGTT CCAAGGTCAG
TTTACTGACA AGATGCTGAG CGATGTTCTT GGCCTTGCTG ATATCCCAGC TGCAGTTGAT
CAGCTGAGGT GTGATGCAAG TTCTATCAAT TTCATGAAGG GTGCGCTTTG CTACTCTGAT
TATCTACTTA CCGTTAGCCC AACGTATGCC CACGAGCTAC AGACAGAACA CTTTGGAGAG
GGAAGAGACG ATATCTTCCG TCGCCGTCAG AACGTTCTGC GTGGTATTCT GAACGGCATT
GACATGGGTA CATGGTCTCC TGCAAGTGAT CCTTATATTC CACAGAACTT CTCGGCACGC
CATATGGAAG GTAAGGCTGA GTGTAAGCGT CAGCTGCAAG AGGAGCTGGG TCTTGAGGTG
GCTTCAGATG CGCCACTTGC GGTAATGGTT ACACGTTTGA CCAATCAGAA GGGTCTTGGT
CTGATTCGCT ATGCTATGGA TCGTCTTATT CGTGCAGGTA TTGAAATTGC CGTTCTAGGA
ACTGGCGACG CTGAACAAGA AGACGCCATG CGCTACTTTG ACGAGCATTA TAAGAATCGC
ATGGCGGCAC GTATTGAGTT TGATATTGCG CTTTCTCACC GTATGTACGC AGGTGCTGAT
ATGTTTTTAA TGCCATCAGA GTTTGAGCCT TGCGGTCTTT CTCAGATGAT TTCTATGCGT
TACGGCACAC TTCCTGTTGT TCGCGAGACG GGTGGTCTGG CAGACTCCGT CAAACCATAT
AATCAGTTTA CGGGCGAAGG AACCGGCTTT AGTTTTGCTA ATCAAAACGC AGACGAAATG
GCAGATATTA TTTTGTACGC TGCTGACGTA TACAAGAACG ATAAGCAATC GTGGACAAAT
CTTGTAAAAC AGGCGATGGC AGAGGACTTT AGTTGGCATA ATGCTGCAAA TGAATATCTT
GACGTGTATC ATTTACTACA TCCTGAGATT ATCAGGTATG TTCGTCGTCG AGATTGGTAA
 
Protein sequence
MSQVSKRRKL RVVYATSEVA PFSKTGGLGD VSGSLPQALK KVGARVAVIS PLYSAIKPEW 
REKMKKVCEL QVPLSWRFEY CGLWHLVYEG VDFYFVDNEL YFARDGLYGY FDDGERYAFF
SKALCELIAH VPELSCDVLH CNDWQTALAP VFLREQYQSV PEVQNVKTVF SIHNVMFQGQ
FTDKMLSDVL GLADIPAAVD QLRCDASSIN FMKGALCYSD YLLTVSPTYA HELQTEHFGE
GRDDIFRRRQ NVLRGILNGI DMGTWSPASD PYIPQNFSAR HMEGKAECKR QLQEELGLEV
ASDAPLAVMV TRLTNQKGLG LIRYAMDRLI RAGIEIAVLG TGDAEQEDAM RYFDEHYKNR
MAARIEFDIA LSHRMYAGAD MFLMPSEFEP CGLSQMISMR YGTLPVVRET GGLADSVKPY
NQFTGEGTGF SFANQNADEM ADIILYAADV YKNDKQSWTN LVKQAMAEDF SWHNAANEYL
DVYHLLHPEI IRYVRRRDW