Gene P9303_04631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04631 
SymbolpurB 
ID4776521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp466458 
End bp467753 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content56% 
IMG OID640085967 
Productadenylosuccinate lyase 
Protein accessionYP_001016480 
Protein GI124022173 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGAGC GCTACACACT GCCCGAAATG GGCGAGATTT GGACCGACAG GGCCAAGTAT 
CAAAGCTGGC TAGATGTAGA GATCGCTGCT TGTGAGGCCA ACTGTCAACT GGGGAAGATC
CCAGAGGCTG AAATGCAGCA GATTCGTGAA CGCGCAACCT TCGAACCACA GCGCATCTTG
GAGATTGAGG CAGAGGTTCG CCACGACGTC ATCGCCTTCC TGACCAACGT TAATGAAAAC
GTAGGGGATG CCGGCCGCTA CATCCACGTC GGCATGACCA GCAGCGATGT GCTGGATACG
GGTCTGGCCC TGCAGTTAAA AAACTCTGTG GCATTGCTAC AACAAGAACT GGGCAGCCTT
CAAGAGGCGA TCCGCAGCTT GGCAGTGGAG CACAAGGGCA CAGTCATGAT CGGCCGCTCC
CATGCCATCC ATGGCGAACC AATCACCTTC GGTTTCAAAC TGGCCGGTTG GCTAGCAGAA
ACAATGCGCA ATGCCGAGCG ACTGGAGAGG CTGGAGAGGG ATGTGGCTGT AGGCCAGATC
AGTGGCGCCA TGGGCACCTA CGCCAACACG GATCCAAAGG TTGAGCAGCT CACATGCGAG
CGCCTTTGCC TCATCCCAGA CACCGCTAGT ACCCAGGTCA TCTCTCGCGA TCGTCATGCG
GACTATGTAC AGACCCTGGC ATTAGTGGGG GCGTCTCTAG ATCGATTCGC GACAGAGATC
CGCAACTTGC AGCGAACCGA TGTGCTGGAA GTGGAGGAGA GCTTTGCTAA GGGACAAAAG
GGAAGTTCGG CGATGCCACA CAAACGCAAC CCGATTCGGG CTGAGCGGAT TAGTGGTCTT
GCAAGGGTCC TACGCAGCTA TGTCGTCGCA GCACTCGAGA ACGTGGCCCT CTGGCATGAG
CGTGATATCA GTCACAGCTC CACTGAGCGA ATGATGCTGC CGGATTGCTC CGTCACACTC
CACTTCATGT TGCGAGAGAT GACCCAAGTC GTGCAGGGCC TTGGCGTCTA CCCAGCAAAC
ATGCGCCGCA ACATGAATAT CTATGGCGGC GTGGTGTTCA GTCAGCGGGT GCTATTGGCG
CTTGTTGAGA ACGGCATGAA CAGAGAAGAT GCCTACAGTG TTGTCCAGCG CAACGCCCAT
GCTGCGTGGA ATACCGAAGG GGGTAATTTC CGCGCCAATC TTGAGGCCGA TCCTGAAGTA
TCGACCCTTC TCAATGCCAA GGCGCTAGCC GAATGCTTCA GCACAGAGCT ACACCAAGCC
AACCTGGACG TGATCTGGCA ACGGCTCGGA CTCTGA
 
Protein sequence
MIERYTLPEM GEIWTDRAKY QSWLDVEIAA CEANCQLGKI PEAEMQQIRE RATFEPQRIL 
EIEAEVRHDV IAFLTNVNEN VGDAGRYIHV GMTSSDVLDT GLALQLKNSV ALLQQELGSL
QEAIRSLAVE HKGTVMIGRS HAIHGEPITF GFKLAGWLAE TMRNAERLER LERDVAVGQI
SGAMGTYANT DPKVEQLTCE RLCLIPDTAS TQVISRDRHA DYVQTLALVG ASLDRFATEI
RNLQRTDVLE VEESFAKGQK GSSAMPHKRN PIRAERISGL ARVLRSYVVA ALENVALWHE
RDISHSSTER MMLPDCSVTL HFMLREMTQV VQGLGVYPAN MRRNMNIYGG VVFSQRVLLA
LVENGMNRED AYSVVQRNAH AAWNTEGGNF RANLEADPEV STLLNAKALA ECFSTELHQA
NLDVIWQRLG L