Gene P9301_00021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_00021 
Symbol 
ID4912392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp2040 
End bp4379 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content33% 
IMG OID640159566 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001090226 
Protein GI126695340 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATC AAGAAAATAA TGATCTATAT GATCTTAATG AAGCACTACA AGTTGAAAAT 
TTAACACTTA ATGATTACGA AGAAATTTGC AAAAGATTAA AGAAAAAACC TAACAGAACA
GAATTAGGCA TGTTTGGCGT TATGTGGTCC GAACATTGTT GTTATAGAAA TTCAAAACCT
TTACTATCTA AGTTTCCCAC TAAAGGTAAA AATGTTTTAG TTGGACCTGG AGAAAATGCT
GGAGTTATTG ATGTTGGAAA TAATCAAAAA CTTGTTTTTA AAATAGAAAG TCATAATCAT
CCTTCTGCTA TTGAACCTTT TCAAGGAGCA GCAACAGGGG TAGGAGGAAT TCTAAGAGAT
ATATTTACAA TGGGTGCAAG GCCCATTGCA GTCTTGAATT CATTGAGATT CGGCAATCTT
GATAAATCAT CAAATGTTGA TTTACTACGA GGAGTTGTAT CCGGTATTGC ACATTATGGA
AATTGCGTAG GTGTGCCGAC TGTTGGAGGT GAAATTGACT TTGATGATAG TTACTCTGGA
AATCCTTTAG TGAATGTTAT GGCTTTAGGA CTTTTAGAGA CCGAAGAAAT CGTTTGTTCT
GGAGCTAAAA ATGTAGGATC ACCAGTCTTA TATGTTGGTA ATACAACTGG CAGAGATGGT
GTTGGTGGTG CTAGTTTTGC TAGTTCAGAA TTAACTACAA CTTCATTGGA TGATAGACCT
GCAGTTCAGG TAGGTGATCC ATTTGTTGAG AAAAGTCTTA TTGAAGCTTG TTTGGATGCT
TTCAAGACAG GGGATGTTAT TGCAGCTCAA GATATGGGTG CTGCAGGTTT AACATGCAGT
AGCGCGGAAA TGGCTGCAAA TGGAAATTTA GGGATTTCAA TTGATTTAGA TTTGGTCCCT
TCCAGAGAAG ATGATATGTC TTCATACCAA TATTTATTAT CTGAATCGCA AGAAAGAATG
TTGTTTGTCG TTAAAGAAGA AAAAATTAAT GATCTTATTG AAAAATTTAA TAAATGGGGA
TTATATGCCA GTGTTATTGG TGAAGTGATA GGAACTAATG AGGTAATTAT TTCTCATAAA
GGTAAAATTG TGGCTCAAAT ACCTACTTCT GCCTTATCTG ATGATACTCC TGTAAATTTC
CATAATGTGA TTAATAACCC ACCCGACGAT CTTTTAAATA AATGGGAATG GAAAGAAAAT
GATTTACCAG AAATTCATGA GCAAAAAATA TTTTCATTGA AGGAAAATAA GAAATTTTCT
TTTCAAGAAA TCATTTTAAA ACTACTTTCT AATCCATCAA TAGCTTCTAA ACGATGGATT
TATAAACAAT ATGACTCTCA AGTTCAAGCA AATACAGTTT TTAAACCTGG AAAATCAGAT
GCAGCCGTAG TAAGACTAAG GGAACAATAT AAAAAAAATA AAAGTAAAGT ATTTTCTGGT
GTCGCTGCTT CAGTTGATTG CAATAGTAGA TGGGTTGCGC TTGATCCTTT TAGAGGATCT
ATCGCTGCCG TTGCAGAGTC CGCTAGAAAC GTTAGTTGTG TTGGTGCTGA ACCTGTAGCA
ATTACAAATA ATTTAAATTT TTCTTCCCCT GAGAGTGAAA TAGGATATTG GCAACTCTCA
TCTTCATGTA ATGGAATTAC AGAAGCTTGT AAAGCTTTAG AAACTCCTGT TACAGGAGGT
AATGTATCTT TATATAATGA ATCTAAAAAC AAAGATAATC TAATTACTCC TATCAATCCT
ACTCCTGTTA TTGGAATGGT TGGAAAGATA GATAATATCG AAAAAGCTAT AAGTTGTGAA
TGGAAAAATA TTGATGATCA AATCTGGTTA ATTGGCTCCT ATAAATCAGA TACGACAATT
GCAGCTAGTT CTTATTTGGA ATATTTTCAT GGAGAAATTA CTGGTCGGCC TCCAAAAATA
GATTTGTTGG ATGAAAAGTT TTGCCAGAGT TTTTTAAGAA ATGCGATTTT AAATAGTCTT
GTAGTTTCTT CTCACGATAT TAGCGATGGC GGTTTAGCTA TAGCTTTAGC AGAGTCTTGT
ATTTTATCTG CAAAAGGTGC AACGATAGAA TTAAAAAAAG ATATAAACAG AGAAGATAAT
TTATTGTTTG CAGAAGGAGG TTCTAGAATT ATTTTTTCAA TAAACAAAAT TAAACAAAAT
GAATGGCTTA ATTATTTAAA ACAAAATCAA ATAAACTTTC CATCAAGTGT ATATGTAAAA
AAAATAGGAC ATGTGTCTAG TGAAACTCTT AAGATAAATA TCCAAGATAA AAATATTTGC
AATATTAGGG TTGAGGAATT ATCTGAAAAA TTTAATAATA GTATTTCAGA TTACTTTTAA
 
Protein sequence
MINQENNDLY DLNEALQVEN LTLNDYEEIC KRLKKKPNRT ELGMFGVMWS EHCCYRNSKP 
LLSKFPTKGK NVLVGPGENA GVIDVGNNQK LVFKIESHNH PSAIEPFQGA ATGVGGILRD
IFTMGARPIA VLNSLRFGNL DKSSNVDLLR GVVSGIAHYG NCVGVPTVGG EIDFDDSYSG
NPLVNVMALG LLETEEIVCS GAKNVGSPVL YVGNTTGRDG VGGASFASSE LTTTSLDDRP
AVQVGDPFVE KSLIEACLDA FKTGDVIAAQ DMGAAGLTCS SAEMAANGNL GISIDLDLVP
SREDDMSSYQ YLLSESQERM LFVVKEEKIN DLIEKFNKWG LYASVIGEVI GTNEVIISHK
GKIVAQIPTS ALSDDTPVNF HNVINNPPDD LLNKWEWKEN DLPEIHEQKI FSLKENKKFS
FQEIILKLLS NPSIASKRWI YKQYDSQVQA NTVFKPGKSD AAVVRLREQY KKNKSKVFSG
VAASVDCNSR WVALDPFRGS IAAVAESARN VSCVGAEPVA ITNNLNFSSP ESEIGYWQLS
SSCNGITEAC KALETPVTGG NVSLYNESKN KDNLITPINP TPVIGMVGKI DNIEKAISCE
WKNIDDQIWL IGSYKSDTTI AASSYLEYFH GEITGRPPKI DLLDEKFCQS FLRNAILNSL
VVSSHDISDG GLAIALAESC ILSAKGATIE LKKDINREDN LLFAEGGSRI IFSINKIKQN
EWLNYLKQNQ INFPSSVYVK KIGHVSSETL KINIQDKNIC NIRVEELSEK FNNSISDYF