Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17221 |
Symbol | folC |
ID | 4781162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1407357 |
End bp | 1408595 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085007 |
Product | putative bifunctional dihydrofolate/folylpolyglutamate synthase |
Protein accession | YP_001015542 |
Protein GI | 124026427 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0285] Folylpolyglutamate synthase |
TIGRFAM ID | [TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.254744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATG AAACTTTCTT AGCACAATCG AATTTCATTT CTGAAAGCAA GAGCAAAGAA TATAAAGATA TGAATCTGGG CATAGATCGT ATGTCATTGG CTATTAATGC TATGGGAGAT CCCTGCAAAA AAACACCTGC TATTCACATA GCAGGAACAA ATGGGAAAGG CTCTATTGGC GCTTTTATCA ATAGTGTTCT TAGCTTAGTA AATATCAAAA CTGGAGTTAC CACTTCTCCC CATTTAGTTG ATTGGGTCGA GAGGATATGT ATCAACAAGA CTCCAATATC AAAAGAAGAA TTTCAATCAT TAAGCCTGTC TCTTTCTCCA ATTGCAAAAA AATATAGCTT GACTCCTTTT GAATGTGTTA TTGCAATAGC ACTTAAATAT TTTACTTTAA GGGAAGTTGA ACTTCTGATC CTTGAAGTAG GGCTTGGAGG TAGACTTGAT GCAACAACCG CGCATCAATA TAGACCTATT ATTGCATTTG GAGCTATTGG GCTTGATCAT TGTGAATATT TAGGCAATAG TCTAGAAAAA GTAGCTATTG AGAAAGCAGC TGTAATCACT ACAAAGAGCA CTGTCATTAC AGCCACACAA AATAATATTG TAAAAAGAGT TTTAGAAGAG ACTGCCAAGA GGAAACAAGC AGTCATTCAC TGGGTAGATC CAATTCCATT AGACTGGGAA CTCGGCTTAT CAGGAGTAAT ACAGAAAGAA AATGCAGCTG TAGCTAAAGG GGTTATTGAA TCTTTAAAAA ATATAGGCTG GAATATTTCT GAAGGACAAC TTCGAGAAGG CTTATCTCTT GCTAAATGGC CTGGAAGACT TCAAACAACA AAATGGGAAG GGATGCCCAT AGTTGTTGAT GGTGCACATA ATCCTCATGC GGCTAATCAA TTATCAATTG AAAGGGACGC ATGGACTAAT CAAGAAAGTG GAATTATTTG GATACTAGGG ATTCAAAAAA GAAAAGACAT GAAAGGCATT TTGTATAAAC TAGTTAGAGA GAAAGATTTA GCTTGGATAG TTCCTGTCCC TGGCCAACAA AGCTGGTCAA AAGATCAAAT TTTAAGTTTT TGCCCTGAAT ATAAACACCA AATCAAAGAA GCTTTTAGTG TAGAGGAAGT TTTGTCAACA CTTAAAAAAC AACATAGATG GCCATCTCCA CCACCAATCA TTTCAGGATC GCTATATTTA ATAGGTGACC TTTTTCAAAG GAAAATCTTG ACAAGTTAA
|
Protein sequence | MSDETFLAQS NFISESKSKE YKDMNLGIDR MSLAINAMGD PCKKTPAIHI AGTNGKGSIG AFINSVLSLV NIKTGVTTSP HLVDWVERIC INKTPISKEE FQSLSLSLSP IAKKYSLTPF ECVIAIALKY FTLREVELLI LEVGLGGRLD ATTAHQYRPI IAFGAIGLDH CEYLGNSLEK VAIEKAAVIT TKSTVITATQ NNIVKRVLEE TAKRKQAVIH WVDPIPLDWE LGLSGVIQKE NAAVAKGVIE SLKNIGWNIS EGQLREGLSL AKWPGRLQTT KWEGMPIVVD GAHNPHAANQ LSIERDAWTN QESGIIWILG IQKRKDMKGI LYKLVREKDL AWIVPVPGQQ SWSKDQILSF CPEYKHQIKE AFSVEEVLST LKKQHRWPSP PPIISGSLYL IGDLFQRKIL TS
|
| |