Gene P9303_19851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19851 
SymbolfolC 
ID4777373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1745460 
End bp1746701 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content56% 
IMG OID640087497 
Productputative bifunctional dihydrofolate/folylpolyglutamate synthase 
Protein accessionYP_001017992 
Protein GI124023685 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0732491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAAAC CCACAAGTCC AAGCAAGGAA GATTTAAGCG ATCTGATTCC TCGTTTTGAT 
CAGCGGGGGA TGGATCTTGG CCTTGAGCGC ATGCAACAAG CACTGCAGGC CATGGGAAAC
CCCTGTGCCT CTATTCCAGC CATTCAGGTC GTAGGGACCA ATGGCAAAGG GTCTATCGCC
AGCTTCATTG CCAGCAGCCT TAAGGCTGCT GGCATTCGCG TGGGGCTGAC CACCTCTCCT
CACCTCGTGA GCTGGTGTGA ACGGATCAGC AGCGATGGTG AGCTGATCTC AATCGTTGAG
CTACGCCAAC GACTCACAGC CCTGCAAGCC CTTGCCCAAA CCCACCGACT CACCCCCTTC
GAGCTGTTGA TGGCCACGGC TTTTGATCAC TTCAGATCCC GCGAGGTAGA GCTGCTTGTG
CTTGAGGTGG GCCTCGGAGG ACGCCTCGAT GCCACCACAG CTCATCCATG CAGACCAATC
ATTGCCATGG CCAACATCGG CCTCGACCAC TGCGAACACC TTGGTTACAG CCTTAAAGAG
ATAACAGCAG AAAAAAGTGC CGTCATCAGT CCTGGGGCTG CTGTCATCAG TGCTCGTCAA
CAAGCTGAAG TGGCGAGCAT TCTTGAAGAC AAGGCAAAAC ATCAACAGGC CCGCTTGCAA
TGGGTTTCAC CCCTCCCAGA TGACTGGACC TTGGGCTTGC CAGGAGTCCT TCAGCGGCAG
AACGCGGCGG TAGCCAAGGG TGCTCTTGAA TCCCTGGTCC CACTTGGCTG GCGGCTTAAT
GAAGACGTCA TTCGCACAGG GCTGGCCCAC GCCTACTGGC CTGCACGGCT GCAGACCGTT
CACTGGCAAA GCCAGCCAGT GCTCATTGAT GGTGCCCATA ACCCACCAGC CACCGAACGG
TTGGCCCACG AGCGGCAGCA GTGGAGCAAT CAAGAGTTTG GGGTTTGCTG GGTGCTTGGT
CTACAGGCCC ACAAGCAAGC ACCAGCCATG CTTCGCCATT TACTCAAGCC AAGCGATCTG
GCTTGGATCG TGCCGGTGCC GGAACACTGC AGCTGGACGC AGCACCAGCT CGCAGCACAC
TGCCCGGACC TATCAAGCCA ACTCCAATCT GCTGACAATG TTGAACAGGT CTTCTCCATC
CTGCTCAAAC AGAATCGATG GCCAAATCCT CCTCCAGTTG TAGCTGGTTC CCTCTATTTA
TTAGGTGATC TTCTCGCAAA GCAAACCATC AAGGCAGAGT GA
 
Protein sequence
MAKPTSPSKE DLSDLIPRFD QRGMDLGLER MQQALQAMGN PCASIPAIQV VGTNGKGSIA 
SFIASSLKAA GIRVGLTTSP HLVSWCERIS SDGELISIVE LRQRLTALQA LAQTHRLTPF
ELLMATAFDH FRSREVELLV LEVGLGGRLD ATTAHPCRPI IAMANIGLDH CEHLGYSLKE
ITAEKSAVIS PGAAVISARQ QAEVASILED KAKHQQARLQ WVSPLPDDWT LGLPGVLQRQ
NAAVAKGALE SLVPLGWRLN EDVIRTGLAH AYWPARLQTV HWQSQPVLID GAHNPPATER
LAHERQQWSN QEFGVCWVLG LQAHKQAPAM LRHLLKPSDL AWIVPVPEHC SWTQHQLAAH
CPDLSSQLQS ADNVEQVFSI LLKQNRWPNP PPVVAGSLYL LGDLLAKQTI KAE