Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14731 |
Symbol | thiE |
ID | 4718194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1256992 |
End bp | 1258047 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640079194 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001009863 |
Protein GI | 123969005 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.161501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATT CCAATACTAA AGACCATGAA GATTTAAGAA TTTATCAGAT TATTGACGCT AATCTAGATA GAGCAAGAGA AGGACTAAGA GTATTAGAGG ATTGGGCTAG ATTTGGTCTA GGCAAAGAAA AATATGTTGA AAAAATTAAA AATTTTAGAC AAATTTTAGG AAAAAATCAT TTAGAAGTTT ATAAACAATC TAGAAATCAG ATCGAAGACA ATTGTAAAGG ATTGACTCAT CAAGAGCAAT TAAACAGAAA AACTTCTGAG CAAATTATAA GTTCTAATTC AGCAAGAGTT CAAGAAGCAT TACGAGTCAT AGAGGAATTC TCAAGGCTGC ACAATAATGA GCTTTCAAAA ATCGCTTCTG AAATTAGATA TGAAATTTAT ACTGTTGAAA TTGACCTATT AAGTTTTAGC AAGTTTAAGA AGTCGGAGAA AATATTAAAA GAAAATGACT TATATGTAAT CACAGATCAA AAGGACAATT TATTAGAAAT AATTGAAGAG ATTTTAATTG CGGGAGTAAA AATTATTCAG CATAGATTTA AAACGGGAAC TGATCAAGAT CATCTTCAAG AAGCAATTGA GATTAAAAAT CTATGTAAAA GATATAATTC TTTGTTCATA GTTAACGATA GACTTGATAT AGCTCTAGCA TCTAACGCGG ATGGGATTCA TCTTGGACAA GACGATTTAG ACTTAAAAAC CACAAGAAAG CTATTTGGAT ATTCAAAAAT AATCGGTATA AGTGCAAATA ATGCAATTGA TATTTCAAAT GCTCTTGACG AGGGTTGTGA CTACATAGGG ATAGGGCCAG TATTTGAAAC TACGACAAAA AAGAATAAAA AACCTTTAGG TATTGAAAAT ATCAAAACAT TAACAAAGGA TTTAAATATT CCTTGGTTTG CTATCGGAGG AATCAAGTCA AATAACATTT CATATTTAAA AAGAAATGGG TTTAAAAAAG TTGCCTTAGT TTCGGAATTA ATGAATTCTG AAGATCCTAA AGAAGACGCT ATGATGATTT TAAAAGAATT GTCTCATGAA AATTAG
|
Protein sequence | MLNSNTKDHE DLRIYQIIDA NLDRAREGLR VLEDWARFGL GKEKYVEKIK NFRQILGKNH LEVYKQSRNQ IEDNCKGLTH QEQLNRKTSE QIISSNSARV QEALRVIEEF SRLHNNELSK IASEIRYEIY TVEIDLLSFS KFKKSEKILK ENDLYVITDQ KDNLLEIIEE ILIAGVKIIQ HRFKTGTDQD HLQEAIEIKN LCKRYNSLFI VNDRLDIALA SNADGIHLGQ DDLDLKTTRK LFGYSKIIGI SANNAIDISN ALDEGCDYIG IGPVFETTTK KNKKPLGIEN IKTLTKDLNI PWFAIGGIKS NNISYLKRNG FKKVALVSEL MNSEDPKEDA MMILKELSHE N
|
| |