Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1057 |
Symbol | |
ID | 3773987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1067527 |
End bp | 1068558 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637799479 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_400074 |
Protein GI | 81299866 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.004946 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0997928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCTG AGTCGAGCAT GGATTGGGTT GAAACCCGTT GCCACCGCAT TCTGGATGCC AACCTCGATC GCGCCCGTGA AGGGTTGCGC ATTCTGGAAG AGTGGTGCCG CTTTGGCCTC GAGCGCGCTG ACTTGAGCGC GACTTGCAAG GCGCTTCGGC AAGAAGTCGG CAGTTGGCAT CGCCCAGAAT TTCGTCAAGC TCGCGACACA ACCCACGATC CCGGCACGAG CTTGAGTCAT CCCCAAGAGC GACAGCGGAC TGACCTCAAT GCCGTTTTAT TGGCGAACTG TGCGCGGGTT CAGGAAGCCC TGCGGGTGAT TGAGGAATAC GGCAAGCTGA TCGAGGGGGA TCTCAGCGAT CGCGCTAAGG CAATGCGCTA CCAGATCTAT GTCTTGGAGT CTCAGCTGCA AAGTCGCGAT CGCCTCAGTC GGTTGCGACA GGCACGGCTT TATCTTGTGA CCTCGCCCCA TCCTCGCTTG CTGGAGGTCG TTGAAGCGGC TCTTTCGGCA GGGCTGAAAT TGGTGCAGTA CCGCGACAAG CAGCAGGAGG ATGCAACCCG CCTAGAAACC GCCTGCCGCT TGGCGGAACT CTGCCAGCGC TACGGAGCCC TTTTCCTCGT CAACGATCGC GTCGATCTAG CACTTGCTTG CGGGGCTGAT GGCGTTCATC TTGGTCAGCA GGATGTGCCG ATGGACGTGG CTCGCCGCAT TCTCGGCCCC GATCGCATTG TTGGGCGATC GACGACCAGT CCTGAAGAGT TGGCCCGTGC CAATGCTGAA GGGGCGGACT ATGTCGGCGT GGGGCCCATC TTTGCCACAC CCACGAAACC CGGCAAAGCA GCAGCGGGCT TTGACTATTT AGGCTATGCC CGCCAGCAGG CTCAGCAGCC GTTTTATGCG ATTGGTGGAA TCGATGTCAG CAATGCCGCT GCGGTGGTTG CAGCCGGTGC CGATCGCCTC GCCGTGGTCC GCGCGATCAT GGAGGCGCCC GACCCGAAGG CCGCTACGGC AGAACTTTTA CAAATGCTTT AG
|
Protein sequence | MSSESSMDWV ETRCHRILDA NLDRAREGLR ILEEWCRFGL ERADLSATCK ALRQEVGSWH RPEFRQARDT THDPGTSLSH PQERQRTDLN AVLLANCARV QEALRVIEEY GKLIEGDLSD RAKAMRYQIY VLESQLQSRD RLSRLRQARL YLVTSPHPRL LEVVEAALSA GLKLVQYRDK QQEDATRLET ACRLAELCQR YGALFLVNDR VDLALACGAD GVHLGQQDVP MDVARRILGP DRIVGRSTTS PEELARANAE GADYVGVGPI FATPTKPGKA AAGFDYLGYA RQQAQQPFYA IGGIDVSNAA AVVAAGADRL AVVRAIMEAP DPKAATAELL QML
|
| |