Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18381 |
Symbol | purU |
ID | 5731686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1669730 |
End bp | 1670584 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641286225 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_001551723 |
Protein GI | 159904379 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTTCTA TAACTGTAAT ATTGCAACTG ATATGTCCTG ATCGTCCAGG ATTAGTAAGT GCAATAGCTG GGTGGGTCGC AAAAAATGAT GGCAATATTC GGCATGCTGA TCATCATACA GATGAGGGGG CAGGTCTTTT TTTGAGTCGA ATTGAGTGGG ATCTTCAAGG TTTTAGCATG CCTAGACAAT CTATAGGGTC AGCAGTGAAT AAATTGGCAG ATCGATTGGG AGGTCAAGCA CAATTAAACT TTTCGGATGA GTATCCAAGG GTTGCAATTT TTGTTAGTAA GCAGAGTCAT TGCTTATTAG ACCTTCTTTG GAGGGTCCGA AGTGGAGAAA TTCAAATGAA AGTACCCTTG ATCATTTCTA ATCATCTTGA TTTGAGTTAT ATAACTAGAG ATTTTGATGT TGATTTCCAA CATATTCCTG TTAACTCGCA CAATAAATTG GAATCTGAAA AAATTATTTT AAATACGTTA TTAGATCATC GTATTGAATT AATAGTTTTA GCTAAATATA TGCAAGTTCT CAGCCCTGGA TTTTTGAAAA AATTCCCATT AATTATCAAT ATCCATCATT CATTTTTGCC AGCTTTCAAA GGAGCGCAAC CGTACCATCA AGCTTGGAAT CGAGGAGTTA AATTGATAGG TGCAACTGCT CATTATGTTA CTGAAGAACT AGATGATGGT CCGATTATTG AGCAAACGAC TCTACAAGTG AGTCATAGAG ATGAAGTAGA TGATTTAATT CGAAAAGGTC GTGATACAGA AAGGATTGCG CTTGCGAGAG CATTGAGATT ACATCTGCGT AGACAGGTAA TGGTTTATTC CGGTCGGACC GCTGTTTTTG CATGA
|
Protein sequence | MTSITVILQL ICPDRPGLVS AIAGWVAKND GNIRHADHHT DEGAGLFLSR IEWDLQGFSM PRQSIGSAVN KLADRLGGQA QLNFSDEYPR VAIFVSKQSH CLLDLLWRVR SGEIQMKVPL IISNHLDLSY ITRDFDVDFQ HIPVNSHNKL ESEKIILNTL LDHRIELIVL AKYMQVLSPG FLKKFPLIIN IHHSFLPAFK GAQPYHQAWN RGVKLIGATA HYVTEELDDG PIIEQTTLQV SHRDEVDDLI RKGRDTERIA LARALRLHLR RQVMVYSGRT AVFA
|
| |