Gene Apar_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1049 
Symbol 
ID8413922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1188705 
End bp1189853 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content46% 
IMG OID645022638 
Productglucose-1-phosphate adenylyltransferase 
Protein accessionYP_003180068 
Protein GI257784851 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02091] glucose-1-phosphate adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA AAGAATGTCT TGCAATGCTG CTTGCAGGGG GACAGGGAAG CCGTCTAGGC 
GCTCTAACCT CAAAAGTAGC TAAGCCAGCT GTTTCGTTTG GCGGCAAGTT CCGCATCATT
GACTTTGCAT TATCCAACTG CGCAAATTCA GGAATTTCTA CGGTGGGTGT TTTGACACAG
TATCGTCCGT ATTTGTTGCA CTCTTATGTG GGGTCAGGTA GTGCTTGGGA CCTTGATGAG
CTTGGCGGTG GAATTTCTAT TCTTCCTCCA TTTGCTACTC AGTCTGGTGG TGCATGGTAT
GCGGGCACTG CAGATGCTGT CACTCAGAAT ATTGGTTACA TCGAGCAGAA CAAACCTGAT
TACGTAATTA TCCTCTCTGG CGATCAGCTC TATCGTATGG ACTACGGTGA GATGCTTGCT
TGCCATAAAG ACAATAATGC CGACCTCACT ATTGCTGTTA TGCCTGTTCC TTGGGAGGAA
GCTTCCCGCT TTGGCATTAT GTCTGTTGAT GATGAAGGTA GAATCACTAA GTTTTCCGAG
AAACCCGCAG AACCTGAGTC CAATCTAGCT TCTATGGGCA TCTATATCTT CACCACTGAT
TTGCTTCTTG AAACACTCCG TGAAGATGCA AAGAATCCTG AGTCTTCTCA TGATTTTGGT
AAAGATATTA TTCCAACGCT GCTTGACGAT GGTAAGCGCC TGTTTACCTA TCGCTTTGAG
GGCTTCTGGC GTGATGTGGG TACTATTGCA AGTTATCATG AGACCAGTAT GGACCTGCTT
GGCTCCGAGC CTAAGTTTGA TATTTTTTCG GACAAGTTCC CCATTTTGTC TAATGCTTCT
ACCCGTCCAC CAGCGTATAT TGGCCCTTTT GGTGAGGTAG ACGATTGTCT AGTAAGCAAC
GGCTGTCAAG TCTTTGGCTA TTCTCGCCAT TCAATTTTGT CTACAGATGC CGTTGTTGGT
GAGGGAGCAC GTGTTATTGA CTCTGTGCTT CTTCCTGGCG CAGAGGTTAA GCCTGGTGCC
GTTGTGATTC GCGCAATTAT TGGTGAGAAC GCCGTGGTCG AGAAGAACGT TCACGTTGGT
AGTTCTGATT TAAACAAAGA GATTGCTGTT GTTGGAAATG ATGTAGTGGT TGAAAGAGGT
GAGCACTAA
 
Protein sequence
MSKKECLAML LAGGQGSRLG ALTSKVAKPA VSFGGKFRII DFALSNCANS GISTVGVLTQ 
YRPYLLHSYV GSGSAWDLDE LGGGISILPP FATQSGGAWY AGTADAVTQN IGYIEQNKPD
YVIILSGDQL YRMDYGEMLA CHKDNNADLT IAVMPVPWEE ASRFGIMSVD DEGRITKFSE
KPAEPESNLA SMGIYIFTTD LLLETLREDA KNPESSHDFG KDIIPTLLDD GKRLFTYRFE
GFWRDVGTIA SYHETSMDLL GSEPKFDIFS DKFPILSNAS TRPPAYIGPF GEVDDCLVSN
GCQVFGYSRH SILSTDAVVG EGARVIDSVL LPGAEVKPGA VVIRAIIGEN AVVEKNVHVG
SSDLNKEIAV VGNDVVVERG EH