Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04631 |
Symbol | purB |
ID | 4776521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 466458 |
End bp | 467753 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085967 |
Product | adenylosuccinate lyase |
Protein accession | YP_001016480 |
Protein GI | 124022173 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCGAGC GCTACACACT GCCCGAAATG GGCGAGATTT GGACCGACAG GGCCAAGTAT CAAAGCTGGC TAGATGTAGA GATCGCTGCT TGTGAGGCCA ACTGTCAACT GGGGAAGATC CCAGAGGCTG AAATGCAGCA GATTCGTGAA CGCGCAACCT TCGAACCACA GCGCATCTTG GAGATTGAGG CAGAGGTTCG CCACGACGTC ATCGCCTTCC TGACCAACGT TAATGAAAAC GTAGGGGATG CCGGCCGCTA CATCCACGTC GGCATGACCA GCAGCGATGT GCTGGATACG GGTCTGGCCC TGCAGTTAAA AAACTCTGTG GCATTGCTAC AACAAGAACT GGGCAGCCTT CAAGAGGCGA TCCGCAGCTT GGCAGTGGAG CACAAGGGCA CAGTCATGAT CGGCCGCTCC CATGCCATCC ATGGCGAACC AATCACCTTC GGTTTCAAAC TGGCCGGTTG GCTAGCAGAA ACAATGCGCA ATGCCGAGCG ACTGGAGAGG CTGGAGAGGG ATGTGGCTGT AGGCCAGATC AGTGGCGCCA TGGGCACCTA CGCCAACACG GATCCAAAGG TTGAGCAGCT CACATGCGAG CGCCTTTGCC TCATCCCAGA CACCGCTAGT ACCCAGGTCA TCTCTCGCGA TCGTCATGCG GACTATGTAC AGACCCTGGC ATTAGTGGGG GCGTCTCTAG ATCGATTCGC GACAGAGATC CGCAACTTGC AGCGAACCGA TGTGCTGGAA GTGGAGGAGA GCTTTGCTAA GGGACAAAAG GGAAGTTCGG CGATGCCACA CAAACGCAAC CCGATTCGGG CTGAGCGGAT TAGTGGTCTT GCAAGGGTCC TACGCAGCTA TGTCGTCGCA GCACTCGAGA ACGTGGCCCT CTGGCATGAG CGTGATATCA GTCACAGCTC CACTGAGCGA ATGATGCTGC CGGATTGCTC CGTCACACTC CACTTCATGT TGCGAGAGAT GACCCAAGTC GTGCAGGGCC TTGGCGTCTA CCCAGCAAAC ATGCGCCGCA ACATGAATAT CTATGGCGGC GTGGTGTTCA GTCAGCGGGT GCTATTGGCG CTTGTTGAGA ACGGCATGAA CAGAGAAGAT GCCTACAGTG TTGTCCAGCG CAACGCCCAT GCTGCGTGGA ATACCGAAGG GGGTAATTTC CGCGCCAATC TTGAGGCCGA TCCTGAAGTA TCGACCCTTC TCAATGCCAA GGCGCTAGCC GAATGCTTCA GCACAGAGCT ACACCAAGCC AACCTGGACG TGATCTGGCA ACGGCTCGGA CTCTGA
|
Protein sequence | MIERYTLPEM GEIWTDRAKY QSWLDVEIAA CEANCQLGKI PEAEMQQIRE RATFEPQRIL EIEAEVRHDV IAFLTNVNEN VGDAGRYIHV GMTSSDVLDT GLALQLKNSV ALLQQELGSL QEAIRSLAVE HKGTVMIGRS HAIHGEPITF GFKLAGWLAE TMRNAERLER LERDVAVGQI SGAMGTYANT DPKVEQLTCE RLCLIPDTAS TQVISRDRHA DYVQTLALVG ASLDRFATEI RNLQRTDVLE VEESFAKGQK GSSAMPHKRN PIRAERISGL ARVLRSYVVA ALENVALWHE RDISHSSTER MMLPDCSVTL HFMLREMTQV VQGLGVYPAN MRRNMNIYGG VVFSQRVLLA LVENGMNRED AYSVVQRNAH AAWNTEGGNF RANLEADPEV STLLNAKALA ECFSTELHQA NLDVIWQRLG L
|
| |