Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15071 |
Symbol | pgi |
ID | 4777734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1309142 |
End bp | 1310752 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640087015 |
Product | glucose-6-phosphate isomerase |
Protein accession | YP_001017516 |
Protein GI | 124023209 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.33386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCAGC TGATGAGTTT CCCGGATTTC AGCGCCAGCG ATGCCCATGT TCAGTGGCAG CGCTTTAACA ATTTGCTTTG GTATCACAAC GATCTTGGAA TTTGGCTGGA CATCAGCCGA ATGCATATCA ATGCAGAAGA TTTTGAGCGG CTAGGGCCAC GCTTTGATCA GGCTTTCAAG GCCATGCAGG CTTTGGAACA GGGATCTATC GCTAATGCTG ATGAACAGCG GATGGTCGGC CACTACTGGC TGCGTCAGCC GCAGCTGGCG CCTGATCAAG AGGTCCGTGA TCACATTGCC AAAGAAATTG ACCTGATTGA GACCTTTGGC AGCAATGTCG TCAATGGCCT CATCAAAGCC CCCAATGGCA AACAGTTCAC CGATGTGCTT TGGATCGGGA TTGGAGGCAG TGGCCTAGGG CCTTTATTAA TGATTCGTGC TCTTCAGAAT GCTGAGCAGG GATTGCGATT CCATTTTTTC GACAATGTGG ATCCTGATGG CATGAGTCGC GTTCTTGGCA ATCTTGGAGA TGCCCTGAGC ACGACTTTGG TTGTCACGGT AAGCAAATCT GGTGCGACTC CGGAACCACA CCTCGGCATG GAGCAGGCTC GTCAACGTCT CGAAGAGATG GGAGGTCGTT GGTCGGGTCA GGCCGTTGCC GTGACGATGC TCAATAGCCA GTTAGATCAG CTTGCACAGA AGGAATCCTG GCTTAAGCGT TTTGACATGT TCGATTGGGT TGGCGGTAGA ACAAGCATCA CGAGTGCTGT TGGCCTTTTG CCGGCTGCAT TGATTGGTTG TAATACCCGC GATTTTCTTG CTGGTGCTGC CCAAATGGAT GAGGCCACAC GTGTGCCTGA TCTGCACAGC AACCCAGCAG CTTTGATGGC CGCTGCTTGG TTTGTTGCAG GAGATGGGCT TGGACGTCGT GACATGGTTG TTCTTCCATA TAGAGATCGC CTTGAGGTCT TTAGTCGTTA CCTGCAACAG CTGGTAATGG AGTCGCTAGG CAAGCGTTTA GATCGAGATG GCAATGTTGT TCATCAAGGC CTTGCTGTAT ATGGCAACAA GGGTTCAACT GATCAACATG CTTATGTACA GCAATTACGC GATGGTGTTG ACAATTTCTT CGCCACATTT ATTGAGGTGC TTGAGGATAT TGAGAACATA CCGGCGATCA ACAATGAGCA TCCAGGTGAT TTTCTCGATG GCTTCCTGCA AGGCACTCGC GCAGCGCTCA GTCAGGGAGG TCGTCAGAGC CTCAGTATTT CAATGCGTCG ATTTGATCCT CGTCGACTCG GCGCATTAGT TGCTTTGTTT GAGAGGGCAG TTGGTTTATA TGGTGAACTT GTCAATATTA ATGCGTATCA CCAGCCTGGA GTGGAATCAG GTAAGAAAGC AGCCGCTGCC ATCCTTAACT TGCAATCTCG AGTTGAAGCC TTGCTTGCTG ATGGTGTCGA CCGTTCAGCT GGAGAGATCC ATCAAGTGAT TGGAGAAGGG TCAGAAGAAG CCATCTTTTG GATCATGAGG CATTTAACTG CCAATAAGCG TGGTTACGTT GCAGAAGGTG ATTGGGGAAT TCCGACTTCA CTACGTTTCA GCAAAGGCTG A
|
Protein sequence | MHQLMSFPDF SASDAHVQWQ RFNNLLWYHN DLGIWLDISR MHINAEDFER LGPRFDQAFK AMQALEQGSI ANADEQRMVG HYWLRQPQLA PDQEVRDHIA KEIDLIETFG SNVVNGLIKA PNGKQFTDVL WIGIGGSGLG PLLMIRALQN AEQGLRFHFF DNVDPDGMSR VLGNLGDALS TTLVVTVSKS GATPEPHLGM EQARQRLEEM GGRWSGQAVA VTMLNSQLDQ LAQKESWLKR FDMFDWVGGR TSITSAVGLL PAALIGCNTR DFLAGAAQMD EATRVPDLHS NPAALMAAAW FVAGDGLGRR DMVVLPYRDR LEVFSRYLQQ LVMESLGKRL DRDGNVVHQG LAVYGNKGST DQHAYVQQLR DGVDNFFATF IEVLEDIENI PAINNEHPGD FLDGFLQGTR AALSQGGRQS LSISMRRFDP RRLGALVALF ERAVGLYGEL VNINAYHQPG VESGKKAAAA ILNLQSRVEA LLADGVDRSA GEIHQVIGEG SEEAIFWIMR HLTANKRGYV AEGDWGIPTS LRFSKG
|
| |