Gene P9211_11201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11201 
SymbolfolD 
ID5730557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1023723 
End bp1024631 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content42% 
IMG OID641285488 
Productputative bifunctional methylenetetrahydrofolate dehydrogenase methenyltetrahydrofolate/cyclohydrolase 
Protein accessionYP_001551005 
Protein GI159903661 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0190] 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.466045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCATA AACTTGATGG TAAGCAATTA GCTGGAGAGA TTGAGCAAAG ACTTAGTCAT 
GAGATAACTC TTTGCTTAAA AAAAGGTGTC CGCCCACCAG GGTTGGCGGT GATCCGAGTT
GGTGATGATC CTGCCAGTCA AGTTTATGTT TCGAATAAAG AAAAGGCCTG TAGAAGGGCT
GGTATTAAAA GTTTTGGTTG CCATCTTGAT GCAAATTCTT CTTTTCGTGA AATTGAGGAA
CAGATTATAA AGTTGAACTC CAACCAAGAA GTGGATGGCA TTTTGTTGCA GCTTCCTCTA
CCTATAGGAC TTGATGCAGG AAGACTTTTG AAGGTTATAG ACCCCAGGAA AGATGCTGAT
GGACTACACA CTTTAAATTT AGGAAGATTA CTCAAGGATG AAATAGGCCC TAGATCTTGT
ACTCCTGCTG GAGTTATGGC TTTGTTGGCT GCTAATCAGA TAGAGATTAA GGGTAAGAAC
ACTGTTGTCA TTGGTCGTAG CATTCTCGTA GGGAAACCAA TGGCATTAAT GCTTCAGGCT
GCGAATGCAA CTGTTACTCT TGTTCATTCT CATACAAGGG ATTTGATTGG CTTTACAAAA
CAAGCAGAAA TACTTGTTGT GGCTGCAGGG AAGCCTCAAT TGATTGGCCT AGAGCATGTC
AAGGAAAAAT CGGTAGTAGT AGATGTGGGA ATTCATAGGG TATTTAAGGA TCAAAACTTA
GGAGATGCTG GCGGTTACAA GCTTTGTGGT GATGTTCGTA GAGAAGAGGT TGATGATTTT
GTAAGTGCAA TTACACCAGT CCCTGGAGGC GTTGGCCCTA TGACTGTTGC AATGTTGCTT
GTAAATACTG TTAATAGTTG GCAGCAGCAT TGCGACTTAT CCTTGAGTTT GGATGATTTA
CTTCCATGA
 
Protein sequence
MAHKLDGKQL AGEIEQRLSH EITLCLKKGV RPPGLAVIRV GDDPASQVYV SNKEKACRRA 
GIKSFGCHLD ANSSFREIEE QIIKLNSNQE VDGILLQLPL PIGLDAGRLL KVIDPRKDAD
GLHTLNLGRL LKDEIGPRSC TPAGVMALLA ANQIEIKGKN TVVIGRSILV GKPMALMLQA
ANATVTLVHS HTRDLIGFTK QAEILVVAAG KPQLIGLEHV KEKSVVVDVG IHRVFKDQNL
GDAGGYKLCG DVRREEVDDF VSAITPVPGG VGPMTVAMLL VNTVNSWQQH CDLSLSLDDL
LP