Gene P9211_00021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00021 
Symbol 
ID5731464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp2130 
End bp4544 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content34% 
IMG OID641284344 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001549887 
Protein GI159902543 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.327511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00549488 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAGAAA ACTCCTCAAT CGAATTTGAA AATTTAGTCA AATTATCTTC TTTTAAAACC 
GATTATGATG TTTTAAGTTC TTTATCACAT GAAGGATTAA AAAAGGAAGA TTACATTGAA
ATCTGCAAAC GTCTAGAACG AGCTCCAAAC CGGACTGAAT TGGGAATGTT TGGAGTTATG
TGGTCTGAAC ATTGTTGTTA TAGAAATTCA AAACCATTAT TACAAAATTT GCCTACTTCA
GGTAAGCGTA TTCTTGTAGG TCCTGGAGAA AATGCTGGAG TTGTTGATAT AGGAGATGGT
CAAAGGATAG CGTTTAAAAT AGAAAGTCAT AACCATCCTT CTGCAGTTGA ACCATTTCAA
GGTGCTGCTA CTGGAGTTGG GGGGATTTTA CGTGATATTT TTACAATGGG TGCACGTCCT
ATTGCTTTAT TAAATGCACT TAGGTTTGGT TTGCTAGAAG ATAAAAAAAA TGTTGGCTTA
TTAGAGGGTG TTGTTGCTGG AATAGCGCAT TATGGCAACT GTGTTGGTGT ACCAACAATT
GGAGGTGAGA CATCTTTTAA TACGAGCTAT TCAGGTAATC CTTTAGTGAA TGCTATGGCT
ATTGGCTTAC TAGAAACAAA AGATATTGTT TGTTCTGGAG CAAAAGGTAT CGATTATCCA
GTCATTTATG TTGGTAGTAC TACAGGAAGA GATGGAATGG GAGGTGCTAG TTTTGCTAGT
GCGGAATTAA GTAACACTTC TCTTGATGAT CGTCCAGCTG TTCAAGTTGG TGACCCATTT
CTTGAGAAAG GTTTAATTGA AGGATGTTTA GAAGCTTTTA AAACAGGTAA TGTTATAGCT
GCTCAAGATA TGGGAGCTGC AGGACTTACT TGTAGTTGCT CTGAAATGGC TGCGAAGGGA
GAAGTAGGTA TTGAGTTGGA TTTGGACCTT GTTCCTGCTC GTGAACAAGG TATGACACCA
TATGAATTTT TACTTTCAGA ATCTCAAGAA AGAATGCTTT TTGTAGTTAA GCCTGGCAAA
GAAAATGATG TGATGAAAAT TTTTACAAAT TGGGGACTAA AAGCTCAAGT TGTAGGAAAA
GTTCTCAAGG AAAGAGTTGT AAGAGTTTTA TATCAGAAAA AAATAGTTGT AGATTTACCA
GCTGACGCTT TAGCTGAAGA TACTCCAGTT AATGAACATT CTTTACTATC TAATCCTCCA
AAATATATCT TAGATCATTG GAAATGGACT GAAGAAAACC TTCCTTTTTC TAATGAAAAA
GGTATAACAG TGCAAGATAA AATAAAAAAA GACAGATTTT TTACATGGAA TAATATCTTA
TTAAAATTAT TAGATGATCC AACAATTGCT TCTAAGAAAT GGATATATAA TCAGTATGAT
TACCAGGTTC AGAATAATAC AATTATTCCA CCTGGAGCTG CTGATGCGGC TGTTATTCGC
GTAAGAGATA TAGATTCAAA AAAAAGTGAT AAATTGATTA ATAAAGGTTT AGCAGCAGTG
GTTGATTGTC CTAATCGTTG GGTTGCTTTA GATCCTGAAC GTGGCGCTAT AGCAGCAGTT
GCTGAAGCAT CAAGGAATAT TTCTTGTGTT GGAGGAGAAC CTTTGGCTTT AACTGATAAT
TTGAATTTCT CATCTCCTGA TCATGCTATC GGTTATTGGC AATTAGCTAA AGCTTGTGAA
GGTATTTCTA AAGCATCCTT ACATCTTGAT ACTCCTGTAA CTGGAGGAAA TGTTTCCTTA
TATAATGAGA TACGTCTTTC TACTGGAGAA ATACAACCTA TTCAACCAAC ACCAGTCATA
GGAATGATAG GTTTAATAGA CGATATTAAT ACTATTGTAG GTCAATCATG GATTAATGAA
GGTGATCTTA TTTGGTTATT AGGTGTTCCT TTAGAAGCAT CATCATTATT AGATAATAGA
ATTAGCTTAT CTTGTACTGC ATATTTAGAG AATATTTTTA ATTTACATAC AGGCAGACCC
CCTGAAATAG ATTTGAATTT GGAAAAATTA ATACAATCAT TTCTCCGAAA ATCAATTTCT
AATCAATTAA TACTCTCTGC TCATGATGTC AGTGATGGAG GTATAGCGAC TGCTTTAGCA
GAATCTGTTA TATCTTCTGG ATTAGGAGCA AAATGTATCT TTCCTAATAC TTCAAATAGA
ATTGATAGTT TATTATTTGC TGAAGGTGGA TCTAGAATTG TTATAAGCAT TTCTCCTAGT
AAACTTTCTG AATGGAAATT AAATTTAAGG ACTTTTGCTA GAGAGAATAG TTTTTCCATA
CCTGCTATGC AAATAGGTCA TGTTCAACGT GACCCTAGTC TTTCCATTTC TCAAGCTAAT
GTTGAATTAA TACAATTATC AATTTCTCAA TTAGCATCTT CATTTAATAA TGCAATTCCT
CGTAGAATGT CGTGA
 
Protein sequence
MLENSSIEFE NLVKLSSFKT DYDVLSSLSH EGLKKEDYIE ICKRLERAPN RTELGMFGVM 
WSEHCCYRNS KPLLQNLPTS GKRILVGPGE NAGVVDIGDG QRIAFKIESH NHPSAVEPFQ
GAATGVGGIL RDIFTMGARP IALLNALRFG LLEDKKNVGL LEGVVAGIAH YGNCVGVPTI
GGETSFNTSY SGNPLVNAMA IGLLETKDIV CSGAKGIDYP VIYVGSTTGR DGMGGASFAS
AELSNTSLDD RPAVQVGDPF LEKGLIEGCL EAFKTGNVIA AQDMGAAGLT CSCSEMAAKG
EVGIELDLDL VPAREQGMTP YEFLLSESQE RMLFVVKPGK ENDVMKIFTN WGLKAQVVGK
VLKERVVRVL YQKKIVVDLP ADALAEDTPV NEHSLLSNPP KYILDHWKWT EENLPFSNEK
GITVQDKIKK DRFFTWNNIL LKLLDDPTIA SKKWIYNQYD YQVQNNTIIP PGAADAAVIR
VRDIDSKKSD KLINKGLAAV VDCPNRWVAL DPERGAIAAV AEASRNISCV GGEPLALTDN
LNFSSPDHAI GYWQLAKACE GISKASLHLD TPVTGGNVSL YNEIRLSTGE IQPIQPTPVI
GMIGLIDDIN TIVGQSWINE GDLIWLLGVP LEASSLLDNR ISLSCTAYLE NIFNLHTGRP
PEIDLNLEKL IQSFLRKSIS NQLILSAHDV SDGGIATALA ESVISSGLGA KCIFPNTSNR
IDSLLFAEGG SRIVISISPS KLSEWKLNLR TFARENSFSI PAMQIGHVQR DPSLSISQAN
VELIQLSISQ LASSFNNAIP RRMS