Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22891 |
Symbol | thiF |
ID | 4778710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2020918 |
End bp | 2022147 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640087808 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001018289 |
Protein GI | 124023982 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTGC GCATTTGGGA CAGTGGTAAG GGCATAGCTA GCGGCACTCT TTCCCCTATG GGTCTTCACG AAACCGTTGA TGTTGGCCTG AGCCCCGATG AGTTAGAGCG TTTTTCTCGC CATCTCACCC TGCCAGAGGT TGGGATGAAT GGTCAGAAAC GACTCAAGGC GGCTTCTGTG TTGTGCGTGG GTAGTGGCGG GCTCGGTTCG CCGTTGCTGC TTTATTTGGC TGCTGCTGGT GTAGGCCATA TCGGGATTGT TGATTTCGAT GTTGTTGAGC TTTCTAATTT GCAGCGCCAG GTGATCCATG GCACGAGTTG GGTAGGTCAA CCAAAAACGC ATTCTGCTCG GGCTCGCATC CTTGAAATCA ATCCCCATTG CCAAGTGGAT CTCTACGAGA AAGCGCTGAC GCGAGATAAC GCTTTTGAGA TCATTCATCC GTACGACATT GTTTGTGACT GCACTGATAA TTTCCCTAGC CGTTACCTGG TGAATGACGC TTGTGTGCTT TTAGGTAAGC CCAGTATTTA TGGATCGATT CATCGCTTTG AAGGTCAGGC CACGGTGTTT AACCTTGATG CTGAGAGTCC CAACTATCGT GATCTGGTGC CCGAACCGCC ACCCGCTGGC TTGGTGCCTT CTTGCGTCGA AGGCGGTGTG ATGGGGATCC TTCCGGGATT GATTGGTTTA ATTCAGTCCG CGGAGGTCAT TAAAATCATT ACTGGTATCG GCACGACGTT GAGTGGTCGG CTGTTGGTTG TTGATGCCTT GGCGATGACG TTTCGGGAGA TGGCTTTGCG CCCGAGTCAA CCGCGAGTTG TGATTGATCA GCTGATTGAT TATCAAGACT TTTGTGCTTC TGGTGCTGAT CAACCAGCTC AAGAGAAGGC TGCTGGGTTG GAGAGTATTT CAGTTCAGGA TCTCAAGTCT TTACTCGATC TTGGCGCTGA GGATTTTGCC CTGGTGGATG TGCGTAATCC CAATGAGGCT GAGATTGCTT GCATTGCTGG TTCAGAATTG ATCCCGTTGA ACAAGATTGA GAGTGGTGAG GCGATTGAGA AGGTTCGGCA GTTGGCCTCT GGCCGTCGAC TATATGTCTA CTGCAAGTTG GGTGGTCGCT CGGCAAAGGC TTTGACCACT CTCAAGCGTC ATGGCATTGA AGGTGTCAAT GTGACTGGTG GCATTGATGC TTGGGCCAAG GAAGTTGACA ACTCATTGCC TCGCTACTGA
|
Protein sequence | MRLRIWDSGK GIASGTLSPM GLHETVDVGL SPDELERFSR HLTLPEVGMN GQKRLKAASV LCVGSGGLGS PLLLYLAAAG VGHIGIVDFD VVELSNLQRQ VIHGTSWVGQ PKTHSARARI LEINPHCQVD LYEKALTRDN AFEIIHPYDI VCDCTDNFPS RYLVNDACVL LGKPSIYGSI HRFEGQATVF NLDAESPNYR DLVPEPPPAG LVPSCVEGGV MGILPGLIGL IQSAEVIKII TGIGTTLSGR LLVVDALAMT FREMALRPSQ PRVVIDQLID YQDFCASGAD QPAQEKAAGL ESISVQDLKS LLDLGAEDFA LVDVRNPNEA EIACIAGSEL IPLNKIESGE AIEKVRQLAS GRRLYVYCKL GGRSAKALTT LKRHGIEGVN VTGGIDAWAK EVDNSLPRY
|
| |