Gene P9303_22891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22891 
SymbolthiF 
ID4778710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2020918 
End bp2022147 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content51% 
IMG OID640087808 
Productmolybdopterin biosynthesis protein 
Protein accessionYP_001018289 
Protein GI124023982 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTGC GCATTTGGGA CAGTGGTAAG GGCATAGCTA GCGGCACTCT TTCCCCTATG 
GGTCTTCACG AAACCGTTGA TGTTGGCCTG AGCCCCGATG AGTTAGAGCG TTTTTCTCGC
CATCTCACCC TGCCAGAGGT TGGGATGAAT GGTCAGAAAC GACTCAAGGC GGCTTCTGTG
TTGTGCGTGG GTAGTGGCGG GCTCGGTTCG CCGTTGCTGC TTTATTTGGC TGCTGCTGGT
GTAGGCCATA TCGGGATTGT TGATTTCGAT GTTGTTGAGC TTTCTAATTT GCAGCGCCAG
GTGATCCATG GCACGAGTTG GGTAGGTCAA CCAAAAACGC ATTCTGCTCG GGCTCGCATC
CTTGAAATCA ATCCCCATTG CCAAGTGGAT CTCTACGAGA AAGCGCTGAC GCGAGATAAC
GCTTTTGAGA TCATTCATCC GTACGACATT GTTTGTGACT GCACTGATAA TTTCCCTAGC
CGTTACCTGG TGAATGACGC TTGTGTGCTT TTAGGTAAGC CCAGTATTTA TGGATCGATT
CATCGCTTTG AAGGTCAGGC CACGGTGTTT AACCTTGATG CTGAGAGTCC CAACTATCGT
GATCTGGTGC CCGAACCGCC ACCCGCTGGC TTGGTGCCTT CTTGCGTCGA AGGCGGTGTG
ATGGGGATCC TTCCGGGATT GATTGGTTTA ATTCAGTCCG CGGAGGTCAT TAAAATCATT
ACTGGTATCG GCACGACGTT GAGTGGTCGG CTGTTGGTTG TTGATGCCTT GGCGATGACG
TTTCGGGAGA TGGCTTTGCG CCCGAGTCAA CCGCGAGTTG TGATTGATCA GCTGATTGAT
TATCAAGACT TTTGTGCTTC TGGTGCTGAT CAACCAGCTC AAGAGAAGGC TGCTGGGTTG
GAGAGTATTT CAGTTCAGGA TCTCAAGTCT TTACTCGATC TTGGCGCTGA GGATTTTGCC
CTGGTGGATG TGCGTAATCC CAATGAGGCT GAGATTGCTT GCATTGCTGG TTCAGAATTG
ATCCCGTTGA ACAAGATTGA GAGTGGTGAG GCGATTGAGA AGGTTCGGCA GTTGGCCTCT
GGCCGTCGAC TATATGTCTA CTGCAAGTTG GGTGGTCGCT CGGCAAAGGC TTTGACCACT
CTCAAGCGTC ATGGCATTGA AGGTGTCAAT GTGACTGGTG GCATTGATGC TTGGGCCAAG
GAAGTTGACA ACTCATTGCC TCGCTACTGA
 
Protein sequence
MRLRIWDSGK GIASGTLSPM GLHETVDVGL SPDELERFSR HLTLPEVGMN GQKRLKAASV 
LCVGSGGLGS PLLLYLAAAG VGHIGIVDFD VVELSNLQRQ VIHGTSWVGQ PKTHSARARI
LEINPHCQVD LYEKALTRDN AFEIIHPYDI VCDCTDNFPS RYLVNDACVL LGKPSIYGSI
HRFEGQATVF NLDAESPNYR DLVPEPPPAG LVPSCVEGGV MGILPGLIGL IQSAEVIKII
TGIGTTLSGR LLVVDALAMT FREMALRPSQ PRVVIDQLID YQDFCASGAD QPAQEKAAGL
ESISVQDLKS LLDLGAEDFA LVDVRNPNEA EIACIAGSEL IPLNKIESGE AIEKVRQLAS
GRRLYVYCKL GGRSAKALTT LKRHGIEGVN VTGGIDAWAK EVDNSLPRY