Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11841 |
Symbol | purT |
ID | 5731016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1070244 |
End bp | 1071422 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285552 |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | YP_001551069 |
Protein GI | 159903725 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) |
TIGRFAM ID | [TIGR01142] phosphoribosylglycinamide formyltransferase 2 [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.703197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0751246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTT ATCCAAAGAC CTTAATGTTG CTAGGGAGTG GAGAGCTTGG GAAGGAAGTC GCTATTTCGG CTAAAAGACT TGGCTGCAAA GTGATTGCTT GCGATAGGTA CGCTGACGCT CCTGCAATGC AAGTGGCAGA TATTGCCGAA ATTTTGGATA TGACCAACTC CGGTGAGCTT AAGGATGTTA TTAATTCATA TAAGCCAGAT GTAATCATTC CAGAAATAGA AGCTTTAGCA GTTGATGCTT TGGATGAGAT TGAGAGAGAT GGGATAACTG TCATACCAAC AGCAAGAGCA ACACAAATAA CAATGAATAG AGACAAAATA AGAGATTTAG CATCTAATAA GCTTCATCTC AAAACTGCTA AATATGTTTA TGCATCCAAC AGTGATGAAG TCAAAAAAGC TGCTAAAGAA CTTGGGTTGC CCGTAATAGT TAAACCAATT ATGAGTTCTT CTGGGAAAGG TCAATCTTTC ATAAGGAAAG AAGAGGACAT TGAGCTTGCC TGGGAACTTG CTCTAAAGAA AGCAAGAGGA GTTTCAAGCA GGGTAATAGT AGAAGAGTTT CTCGAATTCG ACTTTGAGAT TACATTGTTG ACTATTCGTC AAAAAGACGG GTCCACACTT TTCTGTCCTC CAATTGGCCA CGAACAAAAA AATGGGGACT ATCAATCCAG TTGGCAACCT GTAGTAATTA CAGAGAGTCA ACTACAAGAA GCACAAGTTA TGGCAAAAGC TGTAACTCAA GAGCTTGGCG GAGTTGGAAT ATTTGGGGTT GAATTCTTTA TAACTAAAGA AAGTGTGATT TTTTCTGAAT TGTCGCCTAG GCCACATGAT ACAGGCCTTG TAACGCTTAT TAGCCAGGAC CTAAGTGAGT TTGACCTTCA TGTCAGGGCG ATCCTAGGCC TTCCAATTCC TTCAATCACA GTTAATCACC CTAGTGCTAG TAGAGTTATC TTGTCTGAGA AGAATTGCAC AAAAGTTGCG TATAGAGGAA TTGAAAAAGC TTTGGAAGAA GAAGGTACCA AAATACTAAT ATTTGGAAAG CCAAGTGCTA CAAAAGGCAG AAGGATGGGT GTGGCATTGG CAAAGTCTTC AACCTTAGAT AAAGCTCTAA TAAAAGCTAA TAATTCGGCA ATTAATGTAG AAGTAATTCA AGATGATTAC TCCAATTAA
|
Protein sequence | MNIYPKTLML LGSGELGKEV AISAKRLGCK VIACDRYADA PAMQVADIAE ILDMTNSGEL KDVINSYKPD VIIPEIEALA VDALDEIERD GITVIPTARA TQITMNRDKI RDLASNKLHL KTAKYVYASN SDEVKKAAKE LGLPVIVKPI MSSSGKGQSF IRKEEDIELA WELALKKARG VSSRVIVEEF LEFDFEITLL TIRQKDGSTL FCPPIGHEQK NGDYQSSWQP VVITESQLQE AQVMAKAVTQ ELGGVGIFGV EFFITKESVI FSELSPRPHD TGLVTLISQD LSEFDLHVRA ILGLPIPSIT VNHPSASRVI LSEKNCTKVA YRGIEKALEE EGTKILIFGK PSATKGRRMG VALAKSSTLD KALIKANNSA INVEVIQDDY SN
|
| |