Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_16941 |
Symbol | thiE |
ID | 4779849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1380541 |
End bp | 1381593 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084978 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001015514 |
Protein GI | 124026399 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAA TACCTGTCAC CCCTCCTTCT GATAATCGTA TTGCTCAATT AATTGACGCG AACCTTGATC GCGCTAGAGA GGGGCTTAGA GTTATGGAAG ATTGGTGCAG ATTTGGTTTA AAGAGGAGTG ATTTTTCGAT TCAAATCAAA GATTGGAGGC AACAATTAGG AGTACATCAC CACAATATTT ATCGAAAAGA AAGGCTTACA TCCATCGATC CAGCTATGGG CATTTCACAT CCGTTACAAA CAGTCAGATC AACCCCAGAG GATGTATTTA TTGCAAACTC ATCCAGAGTT CAAGAAGCCC TAAGAGTAAT AGAGGAATTC ACTCGAAAAA CAGATCCGAA TCTTTGTGAA ATAGCCAGCA AAATTAGATA CGAAACTTAT GAAATCGAGA TAAAGGTGCT TAAATCGACA GAAGGCGTAA ATAAAAGAGA AACCTTAAAA TATTGTTCGG TATACTTAAT AACCTCAAAC AGGAGAGATA TAGAAGAGGT TGTTCTTCAT GCTCTAAAAG CTGGTGTAAA AATAGTTCAA TATAGAGAGA AATTTTTAAA CGATAATGAA AAAATTTCAC AAGCTAAATG TTTAGCCTCT CTTTGTAAAA AATTCAATTC ACTTTTTATA GTCAATGACC GTATTGATAT CGCACTCGCC GTTGACGCAG ATGGAATTCA TTTGGGACAA GAAGATATGC CAACAAAAAT CGCGAGACAA CTACTAGGGG CTGAAAAAAT CATTGGCAGA AGCACGCACT GTCTTGAAGA CATCAAAAAT GCCGAAGGAG AAGGCTGTGA TTATATTGGT ATAGGGCCAA TATTTCCTTC TGAAACAAAA AAGCAACTAA ATCCAATTGG AATTGACAAC CTAAAAAAAG GATTAAGTGA AACTATCCTT CCTGCTTTTG CTATTGGGGG AATTAATAAA TCAAATATCA CAAAATTAAA TCAGATAAAT AATCTTCGCA TTGCTGTCTC AAATGCAATC ATTAATTCAA ATGATCCCTT TTCAACAACT GAAGAGCTTA TCAAATTTCT AATATGCAAT TAA
|
Protein sequence | MKSIPVTPPS DNRIAQLIDA NLDRAREGLR VMEDWCRFGL KRSDFSIQIK DWRQQLGVHH HNIYRKERLT SIDPAMGISH PLQTVRSTPE DVFIANSSRV QEALRVIEEF TRKTDPNLCE IASKIRYETY EIEIKVLKST EGVNKRETLK YCSVYLITSN RRDIEEVVLH ALKAGVKIVQ YREKFLNDNE KISQAKCLAS LCKKFNSLFI VNDRIDIALA VDADGIHLGQ EDMPTKIARQ LLGAEKIIGR STHCLEDIKN AEGEGCDYIG IGPIFPSETK KQLNPIGIDN LKKGLSETIL PAFAIGGINK SNITKLNQIN NLRIAVSNAI INSNDPFSTT EELIKFLICN
|
| |