Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16851 |
Symbol | glgC |
ID | 4777695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1474298 |
End bp | 1475593 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087194 |
Product | glucose-1-phosphate adenylyltransferase |
Protein accession | YP_001017694 |
Protein GI | 124023387 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0448] ADP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR02091] glucose-1-phosphate adenylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.647337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTG TTTTGGCCAT CATTCTCGGG GGAGGAGCTG GAACGCGCCT CTACCCCCTC ACCAAGATGC GGGCCAAGCC GGCCGTACCT CTTGCAGGCA AATATCGTCT GATTGACATC CCAATCAGCA ACTGCATTAA CTCCAACATC AACAAGATGT ATGTGTTGAC GCAGTTCAAC AGCGCTTCTC TGAATCGACA CCTTGGACAG AGCTACAACC TCAGTGCTGC CTTTGGTCAA GGTTTTGTAG AAGTTCTTGC CGCACAGCAG ACCCCAGAAA GTCCTTCATG GTTTGAGGGG ACTGCGGATG CAGTTCGTAA ATATCAGTGG TTGTTTCAGG AATGGGATGT TGATGAATAT CTGATCCTCT CTGGTGATCA GCTCTACCGT ATGGATTACA GCCTTTTTGT GGAGCACCAT CGTCGCTCAG GTGCTGATCT CACAGTGGCA GCGCTTCCCG TTGATGCAGA GCAGGCTGAG GGATTTGGTC TAATGCGCAC GGATAGCGAT GGCAACATCC AGGAGTTCCG TGAAAAGCCC AAGGGTGAAT CCCTGAAGGC CATGGCCGTC GATACCTCTC GATTTGGCTT GTCAGCTGAG TCAGCAAAGA ACAAGCCCTA TCTCGCCTCG ATGGGGATCT ATGTGTTCAG TCGCGCCACT CTCTTCGATT TGCTGCATAA GAACCCTTCT CACAAGGATT TCGGCAAGGA GGTGATTCCT GAGGCCCTGG CTCGAGGTGA CCGGTTGCAA AGCTATGTGT TCGATGAGTA CTGGGAAGAC ATCGGCACGA TAGGAGCTTT TTACGAGGCG AATTTGGCTC TTACCCAACA GCCCAATCCA CCGTTCAGTT TCTATGACGA GAAGTTCCCG ATCTATACAA GACCCCGTTA TCTACCGCCA ACCAAACTGG TCGATGCGCA AATCACGGAA TCGATTATTG GAGAGGGATC CATCCTCAAA TCCTGCAGTA TTCACCATTG CGTCCTTGGT GTGCGTAGCC GAGTTGAGAG CGATGTGGTC TTGCAAGATT CCCTCGTGAT GGGCTCTGAT TTCTATGAAT CATCCGAGGA ACGCACTCTC CTAAGGCAAG GAGGAGGCAT CCCGCTCGGC GTTGGCGAAG GCACCACCGT TAAGGGTGCG ATCCTCGACA AGAACACACG AATCGGCAAC AACGTCACGA TCGTCAACAA GGATCACGTT GAGGAAGCCG ATCGTGCTGA TGAGGGCTTC TACATCCGCA ATGGGATCGT TGTAGTGGTC AAAAATGCCA CCATTTCTGA TGGCACCGTG ATTTGA
|
Protein sequence | MKRVLAIILG GGAGTRLYPL TKMRAKPAVP LAGKYRLIDI PISNCINSNI NKMYVLTQFN SASLNRHLGQ SYNLSAAFGQ GFVEVLAAQQ TPESPSWFEG TADAVRKYQW LFQEWDVDEY LILSGDQLYR MDYSLFVEHH RRSGADLTVA ALPVDAEQAE GFGLMRTDSD GNIQEFREKP KGESLKAMAV DTSRFGLSAE SAKNKPYLAS MGIYVFSRAT LFDLLHKNPS HKDFGKEVIP EALARGDRLQ SYVFDEYWED IGTIGAFYEA NLALTQQPNP PFSFYDEKFP IYTRPRYLPP TKLVDAQITE SIIGEGSILK SCSIHHCVLG VRSRVESDVV LQDSLVMGSD FYESSEERTL LRQGGGIPLG VGEGTTVKGA ILDKNTRIGN NVTIVNKDHV EEADRADEGF YIRNGIVVVV KNATISDGTV I
|
| |