Gene P9211_13501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13501 
SymbolfolC 
ID5730990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1218587 
End bp1219828 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content37% 
IMG OID641285723 
Productputative bifunctional dihydrofolate/folylpolyglutamate synthase 
Protein accessionYP_001551235 
Protein GI159903891 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTAACT TCACAGTCGA TGGCGATGAC GAAATTGAAA ACCTTCTAAG TTTATATAAG 
GCTAGAGGTA TAAGTCTTGA ATTAAATAGG ATGCAAGCAG CTCTAAAAAA TCTTGGCAAT
CCTTACAACG AAATTCCTGC AATACAAGTT ATAGGAACAA ATGGGAAAGG TTCAATTGTA
AGTTTTCTTG AGAGCTGCCT AAAAGAAGCA AGAATTAAAA TTGGATGTAG CACCTCTCCT
CATCTCGTAA GCTGGCGCGA GCGAATTCGC ATTAATGGGC AAGAAATATC TTCTCAAGAC
TTTCTGAAAA TTCTTACCAA ATTCCAAGCA ATCGCAAAAA GCTACCGCTT AACCCCATTT
GAACTAATAA TACTTTCTGC ATTTGATTAT TTTTACAGCA ATCAAGTTGA GTTAATGGTT
TTAGAGGTGG GGCTAGGAGG AAAACTCGAT GCAACAACAG CACATCCTTT CAGACCTTTA
ATAGCTATTG GAGGAATTGG CTTAGACCAT TGTGAATATC TGGGAAATAC TTTAACAGCA
ATTACTAAAG AAAAAGCCGC TGTGATTTCA TATGGAAGTA CTGTAATTAG TTCACCCCAG
GAACCAGAAG TAAAAAGAGT TATTGAGAAA GTTGTATCTA AAAATAATGC AAGAATTATA
TGGGTAGAGC CATTATCTAA AGATTGGGAA TTAGGTATAG CTGGTGAAAT TCAAAGAACA
AACGCCGCAG TAGCTAAAGG AATTTTAGAA GCCTTACCAA GCTTTGGATG GGAAGTCAAT
CAAACAACAA TTCGTAGAGG GCTGTCCCTA GCAAAATGGC CAGGAAGGCT TCAAAAAGCA
AGCTGGGGGA ATATGCCATT AATTTTAGAT GGAGCCCATA ATGAACATGC AGCCAATCAA
TTAGCTAAAG AACGATTGCT CTGGCCATCA GAAAGCAATG GAATTTTTTG GATTTTTGGC
ATTCAAGCGC ATAAAGATGG TCCTGAGATA ATTAGGAAAT TGCTAAAAGT CAATGATCTT
GGATGGGTTG TTCCAGTTCC AAATACTAAA AGTTGGAGCA AAAGTAATCT TTGCAAAACA
TATCCTGAAA TGTCGAATCA ATTAAATGAA GCCAATAGCG TGGCAAAAGT CCTAGAGAAG
ATTTCATCTG GAGAGATGTG TAAGGATAAA AAAACCATTG TAATAACTGG TTCCTTACTT
CTTATAGGAA ACCTTTTAAG AAAAGACTTA CTTCTCTTTT AA
 
Protein sequence
MINFTVDGDD EIENLLSLYK ARGISLELNR MQAALKNLGN PYNEIPAIQV IGTNGKGSIV 
SFLESCLKEA RIKIGCSTSP HLVSWRERIR INGQEISSQD FLKILTKFQA IAKSYRLTPF
ELIILSAFDY FYSNQVELMV LEVGLGGKLD ATTAHPFRPL IAIGGIGLDH CEYLGNTLTA
ITKEKAAVIS YGSTVISSPQ EPEVKRVIEK VVSKNNARII WVEPLSKDWE LGIAGEIQRT
NAAVAKGILE ALPSFGWEVN QTTIRRGLSL AKWPGRLQKA SWGNMPLILD GAHNEHAANQ
LAKERLLWPS ESNGIFWIFG IQAHKDGPEI IRKLLKVNDL GWVVPVPNTK SWSKSNLCKT
YPEMSNQLNE ANSVAKVLEK ISSGEMCKDK KTIVITGSLL LIGNLLRKDL LLF