Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20151 |
Symbol | thiF |
ID | 4779548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1657930 |
End bp | 1659075 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640085307 |
Product | molybdopterin biosynthesis protein |
Protein accession | YP_001015835 |
Protein GI | 124026720 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.28602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAAA GCCAGAGTAA GGCAAATTTA AGTTCAGAAG AAATTGCCAG GTATGCAAGA CATATAAGTC TCCCAGAGAT AGGTATCAAA GGCCAAGAAA AATTGAAGAC AAGCTCAGTT GCTTGCATTG GGACAGGAGG GCTAGGATCT CCACTTTTAA TTTATCTTGC AGCAGCTGGA ATTGGACGTA TCGGAATAGT TGATTTTGAT GTCGTTGAAT ACTCAAATTT ACAAAGACAA ATCATTCATA CAACACATTC AATAGGTCTA TTAAAAACAG ATTCGGCCAA ACAAGCTATA CGCAAAATAA ATCCTTCTTG TCGAGTTGAT TTATTCAATC AAAAGCTAAC AAGTAGTAAT GCTTTGGAAA TACTTAAAGC TTATGATGTG ATATGTGATT GTTCAGACAA TTTCCCAACG CGTTACCTGA TTAATGATGC TTGTCTAATA CTTAACAAAC CTAATATATA TGGTTCAATC GCAAGATTCG AAGGACAAGT AAGTGTATTT AATTTGAAGG AAGATAGCCC TAACTATAGA GACCTTATCC CCATACCCCC TCCACAAGAG TTAATTCCAT CTTGCTCTGA AGCTGGAGTG ATGGGAATTC TTCCAGGAAT TATTGGTACA ATTCAAGCAG CAGAAGCTAT AAAGATAATA ACAAACATTG GTTATCCACT TAACGGTAGG ATTCTCATTT TTAATGCATT AAAAATGCAA TTTAAAGAAC TAACTTTGAA ATCCAATCCA GAAAATAAAA ATATCCATAA ATTAATAGAT TATAAAAGTT TCTGTTCAGA AATTTCAGTT AAAGATGAAG TAGAATGTGA TATAGAAAGT ATTTCAGTTA AAGAATTAAA AGTACTTCTT AGACAATCTT CAAAAGAAAT GTTATTAATA GATGTTCGCA ACCAAGATGA ATATCATCAA TGTTCAATTA CAGGTTCATT GCTCATACCT CTTAACTCTA TTGAAAGTGG TAAAGCCATT GATGAAATTA AAATCCTTAC CGCAAAAAAA AATCTTTATG TATTTTGTAA AAGTGGAAAA AGATCATTGC TTGCATTAAA GCATTTAAAC AAATTTGGAA TTAGAGGTAT TAATATTCTT GGAGGTATTG ATGCGTGGAA TAGCGAAAAA AATTAA
|
Protein sequence | MEQSQSKANL SSEEIARYAR HISLPEIGIK GQEKLKTSSV ACIGTGGLGS PLLIYLAAAG IGRIGIVDFD VVEYSNLQRQ IIHTTHSIGL LKTDSAKQAI RKINPSCRVD LFNQKLTSSN ALEILKAYDV ICDCSDNFPT RYLINDACLI LNKPNIYGSI ARFEGQVSVF NLKEDSPNYR DLIPIPPPQE LIPSCSEAGV MGILPGIIGT IQAAEAIKII TNIGYPLNGR ILIFNALKMQ FKELTLKSNP ENKNIHKLID YKSFCSEISV KDEVECDIES ISVKELKVLL RQSSKEMLLI DVRNQDEYHQ CSITGSLLIP LNSIESGKAI DEIKILTAKK NLYVFCKSGK RSLLALKHLN KFGIRGINIL GGIDAWNSEK N
|
| |