Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_19121 |
Symbol | purU |
ID | 4718651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1651395 |
End bp | 1652267 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640079647 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001010302 |
Protein GI | 123969444 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.288968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATAGTTA AAGATTTTTT GGAACATCCT TCAATTATAT TCAAAATTGT TTGTCCTGAT CGTCCTGGCC TTGTGAGTTT ACTTACAAGT TGGATTTCAA ACTACGGTGG CAACATAAAA CATTCTGATC ATCATACAGA TCAAGATGCG GGTTTGTTCC TTAGTCGAAT TGAATGGAAT AGTAAAAATG CATTTTTGAA TAGAGATGAA ATTTATAAAG AATTTGAAAA AATTGCAGAT GAAGTAAATG GAAAATTTAA TGTAAATTAT TCTGATGAAA TTCCAAATGT TGCAATTTTT GTGAGTAAAC AAAATCATTG CTTGATTGAT TTACTTTGGC GAGTAAGAAA TGGTGAATTG AAAATGCAAG TGCCGGTAAT AATTTCGAAT CATTCTGACC TTGAAAATAT TGCAAACGAC TTTAATGCAA AATTTGTCTA TGTTGATACC TTTAATATTG ACAAATCTGT TGTTGAAGAT CAATTTTTAA ATTTATTAAA AGAATATGAA ATTGATCTTG TTGTGTTAGC CAAATATATG CAAATTTTGA GTGACTCTTT TTTAAAAAAG TTCTCTTCAA TAATCAATAT TCATCATTCT TTCTTACCTG CATTTAAGGG CGGTCAACCA TATCATCGAG CATGGAAGAG AGGTGTTAAA TTAATCGGTG CCACAGCTCA CTATGTTACT GAAGATCTTG ATGAAGGTCC GATCATAGAG CAATGCACAG TTAATGTAAG TCATAGGGAT GAAGTTGATG ATTTGATTAG AAAAGGAAGA GATATTGAAA GAATTGCTCT AGCAAGAGCA GTTAGATTAC ATCTGAATCA TCAAGTAATT GTTTATAATA GTAAAACTGC TGTTTTTGAT TGA
|
Protein sequence | MIVKDFLEHP SIIFKIVCPD RPGLVSLLTS WISNYGGNIK HSDHHTDQDA GLFLSRIEWN SKNAFLNRDE IYKEFEKIAD EVNGKFNVNY SDEIPNVAIF VSKQNHCLID LLWRVRNGEL KMQVPVIISN HSDLENIAND FNAKFVYVDT FNIDKSVVED QFLNLLKEYE IDLVVLAKYM QILSDSFLKK FSSIINIHHS FLPAFKGGQP YHRAWKRGVK LIGATAHYVT EDLDEGPIIE QCTVNVSHRD EVDDLIRKGR DIERIALARA VRLHLNHQVI VYNSKTAVFD
|
| |