Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_17561 |
Symbol | thiF |
ID | 4719878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 1541555 |
End bp | 1542703 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640081451 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001012070 |
Protein GI | 123966989 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT CAATTGATCA CAAATCTGAT TCCTTAACTT TAGATGAAGA AGATAGATAC AAGAGACATT TAACACTCAA TGAAATAGGA TTAAAGGGAC AATTAAAACT TAAACGCAGT TCAGTAGTTT GTATTGGCGC AGGAGGTTTA GGCTCTTCTG TATTAATTTA TCTTGCCGCT GCAGGAATTG GAACAATAGG AATAGTTGAT AATGATCAAG TTGAGAAGTC GAATCTACAA AGACAAATAA TTCATGAAAC AAATACAGTT GGGGATTTAA AAATTGATTC TGCTCAAGAA AGAATTAGAA GATTGAATCC TAATATTGAA GTAATAACTT TTGCTGAACG AATTAACTCA AATAATATTC TCGATATTAT TAATCAATTT GATATTGTTT GTGATTGTTC AGATAACTTT GGTACTCGTT ATTTAATAAA CGATGCTTGC TTAATACTTG ATAAACCTTT AGTTTTTGGA AGCGTTCAAG GATTTGAAGG CCAAATCAGT GTTTTTAATC TAAAAAAAAA TAGTCCCAAT TTAAGAGACT TACTCCCGGA ATCGCCTTTA AAAAATAATA TTCCTAGCTG CGAAGAATTT GGCGTTATAG GAGTTTCAAC TGGTCTTATA GGAGTTTTAC AAGCAAATGA AGCAATAAAG ATTATTCTTA AAAAAGGGCA AATTCTTGAT GGGAAGATTT TGATTTTCAA CCTTCTCAAT ATGAATATAA AAACATTGAC TTTAAAAGCT GATAAATTTA CAAATACGAT TAATGACCTT TCTGAGTTTG AAGATTTTTA TAAAGACATT GAATGTCAAG ATAATATAAA AATTAATAAA ATAGATTCCA CGACATTTGA AACACTATAC AGATCAAATT ATAATAATCT ACTAATAATA GATGTTAGAG AAAAAGAGGA ATTTAATAAA TACTCTATTA AAGGAGCGAT ATCTATACCA CTTAATAATC TAGACCAAAA ACCACACCTA GAATTTATCA AACAAGAAAG TTTGGATAAA GAAGTATTTA CATTATGTCA AGCAGGAAAA CGATCTGAAA AAGCTTCAAA GATTTTGATG AAATTTAAAA TCTCATCAAA ATCAATTGAA GGAGGAATTG CAAATATTAA ACAACTAATT TTTCATTAA
|
Protein sequence | MKISIDHKSD SLTLDEEDRY KRHLTLNEIG LKGQLKLKRS SVVCIGAGGL GSSVLIYLAA AGIGTIGIVD NDQVEKSNLQ RQIIHETNTV GDLKIDSAQE RIRRLNPNIE VITFAERINS NNILDIINQF DIVCDCSDNF GTRYLINDAC LILDKPLVFG SVQGFEGQIS VFNLKKNSPN LRDLLPESPL KNNIPSCEEF GVIGVSTGLI GVLQANEAIK IILKKGQILD GKILIFNLLN MNIKTLTLKA DKFTNTINDL SEFEDFYKDI ECQDNIKINK IDSTTFETLY RSNYNNLLII DVREKEEFNK YSIKGAISIP LNNLDQKPHL EFIKQESLDK EVFTLCQAGK RSEKASKILM KFKISSKSIE GGIANIKQLI FH
|
| |