Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_30021 |
Symbol | purU |
ID | 4776552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2656454 |
End bp | 2657344 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640088526 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001018997 |
Protein GI | 124024690 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.53116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTCCC CTCTTGGTGC TGTATGGAAT CCTTACGGTT ACAAGCTTCG TGTGATTCTG CAGCTCATCT GTCCAGATCG GCCTGCTTTG GTGAGTGAGT TGTCGGGATG GGTGGCTGTT AATGGAGGCA ATATTCTTCA TGCTGATCAC CATACGGATG TTGGGGCTGG ACTGTTTCTG AGCAGGATCG AATTTGGAAT TGAGGGCTTC GGATTGCCTA GAGAGGCGAT AGCACCAGCT GTGAATGCCC TGGCAGATCG CCTCGGGGGT CAGGCGCAAT TGCATTTTTC GGATGAGATC CCTCGGGTGG CAATCTTCGC CAGCAAGCAG AGTCACTGTC TGTTGGATTT GCTTTGGCGC ACTCGCAGTG GGGAGTTGCC GATGCAGGTG CCGCTTGTGA TTGCAAACCA CTCTCAATTG GAGCCTTTAT GCAGGGAGTT TGGTGTTTGT TTTGAGTGTG TTCCTATGAC GCCTGCTAGC AAGCCTGAGG CGGAACAAAC CATGCTGGAT TTGTTGGCTG AGCATCGGAT TGAGCTGGTT GTGTTGGCCA AGTACATGCA GGTCCTTAGT GGTGCTTTTC TAGAGCGTTT CCCCACCGTG ATTAATATTC ACCATTCCTT CCTGCCGGCT TTTAAGGGGG CACAGCCCTA TCACAGGGCC TGGGATCGAG GGGTGAAAGT GATTGGGGCT ACCGCCCATT ACGTCACTGA AGATCTGGAT GACGGGCCGA TTATTGAGCA GACCATCGAG CATGTAAACC ATCGTGATGA GGTGGAAGAT TTGATTCGTA AAGGACGTGA TACGGAACGT CTCGCTTTGG CAAGGGCCTT GAGATTGCAT CTTTGTCGCC AGGTGATGGT CTATCGCGGT AGAACCGCTG TTTTTGCATG A
|
Protein sequence | MSSPLGAVWN PYGYKLRVIL QLICPDRPAL VSELSGWVAV NGGNILHADH HTDVGAGLFL SRIEFGIEGF GLPREAIAPA VNALADRLGG QAQLHFSDEI PRVAIFASKQ SHCLLDLLWR TRSGELPMQV PLVIANHSQL EPLCREFGVC FECVPMTPAS KPEAEQTMLD LLAEHRIELV VLAKYMQVLS GAFLERFPTV INIHHSFLPA FKGAQPYHRA WDRGVKVIGA TAHYVTEDLD DGPIIEQTIE HVNHRDEVED LIRKGRDTER LALARALRLH LCRQVMVYRG RTAVFA
|
| |