Gene P9211_05071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_05071 
SymbolpurA 
ID5730549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp474314 
End bp475627 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content43% 
IMG OID641284866 
Productadenylosuccinate synthetase 
Protein accessionYP_001550392 
Protein GI159903048 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0104] Adenylosuccinate synthase 
TIGRFAM ID[TIGR00184] adenylosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.22054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAATG TTGTAGTCAT AGGTGCTCAA TGGGGTGACG AGGGGAAAGG GAAGATCACA 
GATTTACTAA GTAGATCTGC AGATGTTGTA GTGCGTTACC AAGGTGGTGT TAATGCGGGC
CACACAATAG TTGTAGAAGA CAAGGTACTT AAGCTTCATC TAATTCCTTC TGGCATTTTG
TATCCAGAAA CGATTTGCCT AATTGGATCA GGAACAGTAG TCGATCCCAA GGTAATGCTT
AAGGAGATCG AAATGCTGAT TGAAAACGAT ATTGATATTT CAGGTCTTCA ACTTGCTTCC
ACGGCCCACG TAACAATGCC ATATCACCGA TTAATAGATC AAGCAATGGA AAAGCGAAGA
GGTGAGCAGA AGATAGGGAC AACAGGCCGA GGGATTGGTC CTACTTATGC AGACAAAGCT
CAAAGGAATG GGATTCGAGT TATTGACTTG TTAGATGAAC AAAAGTTAAG GGAAAGGCTA
AGGATCCCTC TTGCAGAAAA AAACAATGTA CTTCAGAAAA TTTATAAAGA GCTCCCTCTA
GACCAAGAAA AGGTCATTGA AGAGTATTTA GAGTATGGAG ATCGCCTAAG GCCGCATGTA
GTCGATTGTT CTAGGGCAAT CCACCAAGCT GCCCGTAATC GAAAAAATAT TCTTTTTGAA
GGAGCTCAAG GAACTCTCCT AGATCTTGAC CACGGAACAT ATCCATTCGT AACATCTTCC
AATCCTGTCT CAGGAGGCGC TTGTATTGGA GCAGGAGTTG GACCAACATT AATAGACAGA
GTCATTGGGG TAGCAAAGGC CTATACAACT AGAGTTGGAG AAGGTCCTTT CCCTACTGAG
CTAGAAGGCA GTTTAAATGA CCAACTTTGT GACAGGGGTG GAGAATATGG AACAACTACA
GGGCGTCGAA GAAGGTGTGG TTGGTTTGAC GGAGTAATTG GCAAATATGC TGTGGAAGTA
AATGGGTTGG ACTGTATCGC CATCACCAAA CTTGACGTAC TTGACGAGCT TGAGGAAATT
AAAGTGTGCG TGGCTTACCA ATTAGACGGA CAAAAAATCG AATATTTCCC AAGTAGTGCG
GAAGATTTTA GTAGATGCAA ACCAATCTTT AAATCATTAC CTGGCTGGAA ATCCTCAACA
GCAGAGTGCA AACGCCTAGA GGATCTACCA CCATCTGCCA TGGCCTATCT GAGATTCCTA
GCAGAGCTTA TGGAAGTCCC AATTGCAATA GTCTCACTAG GAGCAAGTAG AGACCAAACT
ATTGTTGTAG AAGATCCAAT CCATGGACCC AAACGTGCTT TATTAAATAT ATAA
 
Protein sequence
MANVVVIGAQ WGDEGKGKIT DLLSRSADVV VRYQGGVNAG HTIVVEDKVL KLHLIPSGIL 
YPETICLIGS GTVVDPKVML KEIEMLIEND IDISGLQLAS TAHVTMPYHR LIDQAMEKRR
GEQKIGTTGR GIGPTYADKA QRNGIRVIDL LDEQKLRERL RIPLAEKNNV LQKIYKELPL
DQEKVIEEYL EYGDRLRPHV VDCSRAIHQA ARNRKNILFE GAQGTLLDLD HGTYPFVTSS
NPVSGGACIG AGVGPTLIDR VIGVAKAYTT RVGEGPFPTE LEGSLNDQLC DRGGEYGTTT
GRRRRCGWFD GVIGKYAVEV NGLDCIAITK LDVLDELEEI KVCVAYQLDG QKIEYFPSSA
EDFSRCKPIF KSLPGWKSST AECKRLEDLP PSAMAYLRFL AELMEVPIAI VSLGASRDQT
IVVEDPIHGP KRALLNI