Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09371 |
Symbol | folP |
ID | 5730527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 834383 |
End bp | 835282 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641285303 |
Product | putative dihydropteroate synthase |
Protein accession | YP_001550822 |
Protein GI | 159903478 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000783824 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGTTCT CGTTGGAGGA GCTTCCCTCG ATCCTGAAAG TTTCGCAAGG ATCTCCAATT ACAAAGTTGA ATAACAATAC GATCTGGCCG TCCGGGTGGG GTCAAAGAAC CGCAGTGATG GCAATTATAA ATATTACCCC TGACTCATTT AGTGATGGAG GTAAATATTT AATAGAGGAA GAAGCTTTGA AAAGAGCTCT AATGTCAATT AAAGATGGCG CAGACGTAAT TGACTTAGGG GCCCAAAGTA CTCGTCCAGG TGCAAATATT ATTAGCCCAG ATGAAGAATT AAAAAGACTA TTACCTATAC TAAAATCTAT ACGTTCAAAG TTACCTAATT CAATTATATC TGTAGATACA TTTCATTCAA GTGTAGCTGA GAAGGCTTTA GAAGAAGGGG CAGACTGGAT AAATGATGTT ACTGGTTCTA AATATGATAA AAGAATGGTT GATGTATTAT CTAGTGGAAA TTTTCCTTAC GTTTTAACAC ATAGCAAGTC TAACAATAAA ACCATACACT CTCAGGCAGA ATATAACAAT GTAGTAGAGG ATGTTTATCA GAGTTTAATC GAATTAACTG ATAATGCAAT CTCGAAGGGC ATTCGTGAAA AAAATATTAT CTGGGACCCT GGGATTGGGT TTTCTAAAAA CAGAGAACAT AATATTCAGA TATTAACTAA CATTGATAAA TTTTCTGGCG GAAATTTTCC TTTAATAATA GGAGCTTCCA GGAAAAGGTT TATTGGGGAG ATTATTAATG AGAAAAACCC ATTAAAAAGA AATGCAGGTA ATATTTCAGT TGTTTGTAAA TGTGTTGCGT CAAATGTGGA TATGGTCAGA GTTCATGATG TCAAAGAAAC TATTCAAACT ATTCGCCTAG CCAATGAATT ATGGAAGTAA
|
Protein sequence | MEFSLEELPS ILKVSQGSPI TKLNNNTIWP SGWGQRTAVM AIINITPDSF SDGGKYLIEE EALKRALMSI KDGADVIDLG AQSTRPGANI ISPDEELKRL LPILKSIRSK LPNSIISVDT FHSSVAEKAL EEGADWINDV TGSKYDKRMV DVLSSGNFPY VLTHSKSNNK TIHSQAEYNN VVEDVYQSLI ELTDNAISKG IREKNIIWDP GIGFSKNREH NIQILTNIDK FSGGNFPLII GASRKRFIGE IINEKNPLKR NAGNISVVCK CVASNVDMVR VHDVKETIQT IRLANELWK
|
| |