Gene A9601_00021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00021 
Symbol 
ID4716684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp2038 
End bp4377 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content32% 
IMG OID640077699 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001008397 
Protein GI123967539 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATC ATGAAAATAA TGATCTATTT GATCTTAATG AAGCATTAAA AGTTGAAAAT 
TTAACACTTA ATGATTACGA AGAAATTTGC AAAAGATTAA AGAGAAAACC TAATAGAACG
GAATTAGGCA TGTTTGGCGT TATGTGGTCT GAACATTGTT GTTATAGAAA TTCAAAACCT
TTACTATCTA AGTTTCCTAC TAAAGGTAAA AATGTTTTAG TTGGACCTGG AGAAAATGCT
GGCGTTATTG ATGTTGGAAA TAATCAAAAA CTTGTTTTTA AAATAGAAAG TCATAATCAT
CCATCTGCTA TTGAACCTTT TCAAGGCGCA GCAACAGGTG TAGGAGGAAT TTTAAGAGAT
ATATTTACAA TGGGTGCAAG GCCAATAGCA GTGTTGAATT CATTGAGATT TGGTAACCTT
GATAAATCAT CAAATGTTGA TTTACTACGA GGAGTTGTAT CGGGTATTGC ACATTATGGA
AATTGTGTAG GGGTGCCTAC TGTTGGAGGT GAAATTGACT TCGATGATAG TTACTCTGGA
AATCCTCTAG TGAATGTCAT GGCTTTAGGA CTTTTAGAGA CCGAGGAAAT CGTTTGTTCT
GGAGCTAAAA ATGTAGGATC ACCAGTATTA TATGTTGGTA ATACAACTGG CAGAGACGGT
GTTGGAGGTG CTAGTTTTGC TAGTTCAGAA TTAACTACAA CTTCATTGGA TGATCGACCT
GCAGTTCAGG TAGGTGATCC ATTTATTGAG AAAAGTCTTA TTGAAGCTTG TTTGGATGCT
TTCAAGACAG GGGATGTAAT TGCAGCTCAA GATATGGGTG CTGCAGGTTT AACATGCAGT
AGCGCAGAAA TGGCCGCAAA TGGAAATTTA GGGATATCTA TTGATTTAGA TTTGGTCCCT
TCTAGAGAAG ATGATATGTC TTCATATCAA TATTTATTAT CTGAATCGCA AGAAAGAATG
TTGTTTGTGG TTAAGGAAGA AAAAATTAGT GATCTTATTG AAAAATTTAA TAAATGGGGA
TTATATGCCA GTGTTATTGG TGAAGTTATA GGAACTAATG AGGTAATTAT TTCTCATAAA
GGTAATATTG TGGCCCAAAT ACCTACTTCT GCCTTATCTG ATGATACTCC TGTAAATTTT
CACAATGTGA TTAATAATCC ACCCGACGAT CTTTTAAATA AATGGGAATG GAAAGAAAAT
GATTTACCAG AAATTAACGA GCAAAAAATA TTTTCATTGA AGGAAAATAA AAAATTTTCT
TTTTCAGAAA TCATTTTAAA GCTACTCTCT AATCCATCAA TAGCTTCTAA AAGATGGATT
TATAAACAAT ATGACTCTCA AGTACAAGCA AATACAGTAT TTACACCTGG AAAATCAGAT
GCAGCTGTAG TAAGACTAAG GGAACAAAAT AAAAAAAGTA AAAATAAAGT ATTTTCTGGT
GTTGCTGCTT CAGTTGATTG TAATAGTAGA TGGGTTGCGC TTGATCCTTT TAGAGGATCT
ATCGCTGCTG TAGCAGAATC AGCTAGAAAT GTTAGTTGTG TTGGTGCTGA ACCAGTAGCA
ATTACAAATA ATTTAAATTT TTCTTCTCCT GAGAATGAAA TAGGATATTG GCAACTCTCA
TCTTCATGTA ATGGAATTGC TGAAGCCTGT AAAGCTTTAG AAACTCCTGT TACAGGAGGT
AATGTATCTT TATATAATGA ATCAAAAAAT AAAGATAATC TAATTACTCC TATTAATCCT
ACTCCTGTTA TTGGAATGGT TGGAAAGATA GATAATGTCG AAAAAGCTAT AAGTAGTGAA
TGGAAAAATA TTGAAGATCA AATCTGGTTA ATTGGTTCTT ATAAATCAGA TACGACAACT
GCAGCTAGTT CTTATTTGGA ATATTTTCAT GGAGAAATTA CAGGTCGGCC TCCAAAAATA
GATTTGTCGG ATGAAAAGTT TTGTCAGAGT TTTTTAAGAA ATGCGATTTT AAACAGTCTT
GTAGTTTCTT CTCACGATAT AAGTGACGGA GGTTTAGCTA TAGCTTTAGC AGAGTCTTGT
ATTTTGTCCG CAAGGGGTGC AACTATAGAA TTAGAGAAAG ATTTAAATAG AGTTGATAAT
TTATTATTTG CCGAAGGGGG GTCAAGAATT ATTTTTTCAA TTAGTAAAAT GAAACAAAAT
GAATGGTTTA ATTATTTAAA ACAAAATCAA ATAAATTTTC CATCAAGTGT TTATGTAAAA
AAAATAGGAT ACGTATCTAG TGATACGCTG AAGATAAAAA TCAACGAAAA AAATATTTGC
AATATTAGGG TTGAGGAATT AACCGAAAAA TTTAATAATA GTATTTCAGA TTACTTTTAA
 
Protein sequence
MINHENNDLF DLNEALKVEN LTLNDYEEIC KRLKRKPNRT ELGMFGVMWS EHCCYRNSKP 
LLSKFPTKGK NVLVGPGENA GVIDVGNNQK LVFKIESHNH PSAIEPFQGA ATGVGGILRD
IFTMGARPIA VLNSLRFGNL DKSSNVDLLR GVVSGIAHYG NCVGVPTVGG EIDFDDSYSG
NPLVNVMALG LLETEEIVCS GAKNVGSPVL YVGNTTGRDG VGGASFASSE LTTTSLDDRP
AVQVGDPFIE KSLIEACLDA FKTGDVIAAQ DMGAAGLTCS SAEMAANGNL GISIDLDLVP
SREDDMSSYQ YLLSESQERM LFVVKEEKIS DLIEKFNKWG LYASVIGEVI GTNEVIISHK
GNIVAQIPTS ALSDDTPVNF HNVINNPPDD LLNKWEWKEN DLPEINEQKI FSLKENKKFS
FSEIILKLLS NPSIASKRWI YKQYDSQVQA NTVFTPGKSD AAVVRLREQN KKSKNKVFSG
VAASVDCNSR WVALDPFRGS IAAVAESARN VSCVGAEPVA ITNNLNFSSP ENEIGYWQLS
SSCNGIAEAC KALETPVTGG NVSLYNESKN KDNLITPINP TPVIGMVGKI DNVEKAISSE
WKNIEDQIWL IGSYKSDTTT AASSYLEYFH GEITGRPPKI DLSDEKFCQS FLRNAILNSL
VVSSHDISDG GLAIALAESC ILSARGATIE LEKDLNRVDN LLFAEGGSRI IFSISKMKQN
EWFNYLKQNQ INFPSSVYVK KIGYVSSDTL KIKINEKNIC NIRVEELTEK FNNSISDYF