Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_16551 |
Symbol | purB |
ID | 4911004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1383597 |
End bp | 1384892 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640161252 |
Product | adenylosuccinate lyase |
Protein accession | YP_001091879 |
Protein GI | 126696993 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGAGC GTTACACATT ACCCGAGATG GGAAAAATCT GGACTGATAG CGCAAAATTC CAGAGTTGGC TTAAGGTTGA AATAGCTGCA TGTGAAGCAA ATTTTTCCCT AGGAAAAATT CCTGAGAATG CCATGAAAGA GATACGTTCA AATGCAAAGT TTGATGAATC TAGAATTGCA GAAATTGAGA AAGAAGTTAA ACATGATGTC ATAGCATTTC TTACAAGCGT TAATGAATTT GTAGGAGATT CTGGAAGATA TATCCATGTT GGTATGACCA GTAGTGATGT ACTTGATACT GGCTTATCTC TTCAGTTAAA AGATTCTTGC GAATTGTTAT CAGAAGAAAT TGAGAACCTA GAAAATGAAG TCAGATTATT AGCAAGGAAG CACAAAAATA CATTAATGAT TGGTAGATCT CATGCAATTC ATGGGGAGCC AATCTCCTTC GGTTTTAAAC TTGCTGGATG GTTAGCAGAA ATAATAAGGA ACAAAAAAAG ATTGTTAACA CTGAAAGAAT CTGTAGCAAT TGGACAAATA AGTGGTGCAA TGGGAACTTA CGCTAATACG AATCCCAAAG TAGAACAAAT AACTTGTGAT TTACTCGGCT TAAAACCTGA TACAGCAAGT ACTCAGGTCA TATCGAGAGA CAGGCATGCA GAATATGTTC AAACTATTGC GCTAGTTGGC GCTTCTCTAG ATAGATTCGC AACTGAAATA AGAAATTTAC AAAGAACTGA TGTTTTAGAA GTTGAGGAGG GCTTTACAAA AGGGCAAAAA GGAAGTTCTG CCATGCCTCA TAAAAGAAAT CCTATTAGAA GTGAAAGAGT AAGCGGTTTA GCAAGAATTT TGAGGAGTTA CGTCTTAACA GCACTGGACA ATGTTCCACT TTGGCACGAA AGAGATATAA GCCATAGTTC AAATGAACGT ATCATGTTAC CTGACGTATC AATCTGTTTG CATTTTATGC TCAGGGAAAT GAAAGATATA GTAAGCAATT TGGAAGTTTA TCCAAAAAAT ATGCTTAAAA ATTTAAATAT ATATGGTGGT GTAATCTTTA GTCAGAAAGT TTTACTTTTG CTTGTAGAAA AGGGGTTGTC TAGAGAAAAA GCTTATAGCT TAGTCCAAAA AAATGCGCAT CAGGCGTGGA ATACTGAAAA TGGGAATTTC AAACAAAATA TAGAGAGAGA TAATGAAATA ATGGATTTTA TTGATCAAAG TGACTTGGAA GAATGTTTTA ATCCTTCAAT TCATCTCAAT AATTTAAGTG TAATATGGGA GAAGTTAGGT ATCTAG
|
Protein sequence | MIERYTLPEM GKIWTDSAKF QSWLKVEIAA CEANFSLGKI PENAMKEIRS NAKFDESRIA EIEKEVKHDV IAFLTSVNEF VGDSGRYIHV GMTSSDVLDT GLSLQLKDSC ELLSEEIENL ENEVRLLARK HKNTLMIGRS HAIHGEPISF GFKLAGWLAE IIRNKKRLLT LKESVAIGQI SGAMGTYANT NPKVEQITCD LLGLKPDTAS TQVISRDRHA EYVQTIALVG ASLDRFATEI RNLQRTDVLE VEEGFTKGQK GSSAMPHKRN PIRSERVSGL ARILRSYVLT ALDNVPLWHE RDISHSSNER IMLPDVSICL HFMLREMKDI VSNLEVYPKN MLKNLNIYGG VIFSQKVLLL LVEKGLSREK AYSLVQKNAH QAWNTENGNF KQNIERDNEI MDFIDQSDLE ECFNPSIHLN NLSVIWEKLG I
|
| |