Gene NATL1_17221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17221 
SymbolfolC 
ID4781162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1407357 
End bp1408595 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content36% 
IMG OID640085007 
Productputative bifunctional dihydrofolate/folylpolyglutamate synthase 
Protein accessionYP_001015542 
Protein GI124026427 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATG AAACTTTCTT AGCACAATCG AATTTCATTT CTGAAAGCAA GAGCAAAGAA 
TATAAAGATA TGAATCTGGG CATAGATCGT ATGTCATTGG CTATTAATGC TATGGGAGAT
CCCTGCAAAA AAACACCTGC TATTCACATA GCAGGAACAA ATGGGAAAGG CTCTATTGGC
GCTTTTATCA ATAGTGTTCT TAGCTTAGTA AATATCAAAA CTGGAGTTAC CACTTCTCCC
CATTTAGTTG ATTGGGTCGA GAGGATATGT ATCAACAAGA CTCCAATATC AAAAGAAGAA
TTTCAATCAT TAAGCCTGTC TCTTTCTCCA ATTGCAAAAA AATATAGCTT GACTCCTTTT
GAATGTGTTA TTGCAATAGC ACTTAAATAT TTTACTTTAA GGGAAGTTGA ACTTCTGATC
CTTGAAGTAG GGCTTGGAGG TAGACTTGAT GCAACAACCG CGCATCAATA TAGACCTATT
ATTGCATTTG GAGCTATTGG GCTTGATCAT TGTGAATATT TAGGCAATAG TCTAGAAAAA
GTAGCTATTG AGAAAGCAGC TGTAATCACT ACAAAGAGCA CTGTCATTAC AGCCACACAA
AATAATATTG TAAAAAGAGT TTTAGAAGAG ACTGCCAAGA GGAAACAAGC AGTCATTCAC
TGGGTAGATC CAATTCCATT AGACTGGGAA CTCGGCTTAT CAGGAGTAAT ACAGAAAGAA
AATGCAGCTG TAGCTAAAGG GGTTATTGAA TCTTTAAAAA ATATAGGCTG GAATATTTCT
GAAGGACAAC TTCGAGAAGG CTTATCTCTT GCTAAATGGC CTGGAAGACT TCAAACAACA
AAATGGGAAG GGATGCCCAT AGTTGTTGAT GGTGCACATA ATCCTCATGC GGCTAATCAA
TTATCAATTG AAAGGGACGC ATGGACTAAT CAAGAAAGTG GAATTATTTG GATACTAGGG
ATTCAAAAAA GAAAAGACAT GAAAGGCATT TTGTATAAAC TAGTTAGAGA GAAAGATTTA
GCTTGGATAG TTCCTGTCCC TGGCCAACAA AGCTGGTCAA AAGATCAAAT TTTAAGTTTT
TGCCCTGAAT ATAAACACCA AATCAAAGAA GCTTTTAGTG TAGAGGAAGT TTTGTCAACA
CTTAAAAAAC AACATAGATG GCCATCTCCA CCACCAATCA TTTCAGGATC GCTATATTTA
ATAGGTGACC TTTTTCAAAG GAAAATCTTG ACAAGTTAA
 
Protein sequence
MSDETFLAQS NFISESKSKE YKDMNLGIDR MSLAINAMGD PCKKTPAIHI AGTNGKGSIG 
AFINSVLSLV NIKTGVTTSP HLVDWVERIC INKTPISKEE FQSLSLSLSP IAKKYSLTPF
ECVIAIALKY FTLREVELLI LEVGLGGRLD ATTAHQYRPI IAFGAIGLDH CEYLGNSLEK
VAIEKAAVIT TKSTVITATQ NNIVKRVLEE TAKRKQAVIH WVDPIPLDWE LGLSGVIQKE
NAAVAKGVIE SLKNIGWNIS EGQLREGLSL AKWPGRLQTT KWEGMPIVVD GAHNPHAANQ
LSIERDAWTN QESGIIWILG IQKRKDMKGI LYKLVREKDL AWIVPVPGQQ SWSKDQILSF
CPEYKHQIKE AFSVEEVLST LKKQHRWPSP PPIISGSLYL IGDLFQRKIL TS