Gene NATL1_00021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00021 
Symbol 
ID4780626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp2124 
End bp4535 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content34% 
IMG OID640083265 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001013831 
Protein GI124024715 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTAT CCTTTGAAAA TAAAGATTTC AACCAATTCA TTAATTCATC AAAATTTCTT 
GTTGAATATG ATGTTATGTC CGCTCTTAAG CAAGAAGGGT TAAAACAATC TGATTATGTA
GAAATTTGTA GAAGACTTAA TCGGGGTCCA AATAGAAATG AGTTAGGAAT GTTTGGTGTT
ATGTGGTCTG AACATTGTTG CTATCGAAAC TCTCGTCCTT TACTAAAAAA CTTGCCTACA
ACAGGTAGTC GAATCCTTGT AGGGCCTGGA GAAAATGCAG GCGTAGTTGA TATTGGTTTT
GGACAAAGAT TAGTTTTTAA AATTGAAAGT CATAACCATC CATCAGCCGT CGAGCCTTTT
CAAGGAGCAG CAACTGGTGT AGGTGGAATT CTTAGAGATA TTTTTACTAT GGGAGCTAGG
CCAATAGCCT TATTGAATGC TTTAAGGTTT GGTCCATTAG ATGATGAAAA AAATATTAGT
TTATTGGAAG GTGTCGTTGC AGGTATTTCT CACTATGGAA ATTGCGTAGG AGTTCCTACT
ATTGGAGGAG AAGTGGGATT TGATAGTAGT TATTCAGGTA ACCCCTTAGT TAATGTAATG
GGGTTGGGTT TGATGGAAAC AGAAGAAATA GTTTGTTCAG GTGCCAGTGG AATAGACTTT
CCAGTTCTTT ACGTAGGAAA TACAACTGGT AGAGATGGTA TGGGAGGAGC AAGCTTTGCT
AGTTCTGAAC TTTCTAAAAC TTCAATAGAT GATCGACCAG CCGTTCAAGT TGGTGATCCT
TTTCTTGAGA AAGGATTAAT AGAAGCTTGT TTAGAGGCTT TTAAAACAGG ATATGTAATT
GCTGCACAGG ATATGGGAGC GGCGGGGTTG ACTTGTAGTT GTTCTGAAAT GGCTTCAAAA
GGTGAGGTAG GAATTGAATT GAATCTTGAC CTTGTTCCGG CTAGGGAAAA GGGCATGACT
GCATATGAAT TTTTGTTGTC TGAATCACAA GAACGAATGC TGTTTGTTGT GAAGCCTGGT
TCAGAAGAAG AATTGAGAGA ATTATTTATA AGATGGGGAT TATATGTTGA AGTTGTTGGA
AAAGTTTTAA AAGAAAAAGT TGTTAGAGTG ATTCATAAAG GTGAAGTTGT AGCAAATCTT
CCCGCATCTG CGCTTGCTGA TGATACGCCT ATAGAAGAAC ATTTATTAAT AAATTCAACA
CCTGAATATT TGCAAGAACA TTGGAAATGG ACTGAAGATT TATTACCTAA AACTTTAGAT
AATGGAATTA TTAATATAAA GAATAATTTA TTTATTAGTT GGAATAATGT TCTATTAGAT
TTATTAAGTA TGCCTTCAAT AGCTTCAAAA AATTGGATTT ATAAACAATA TGATTATCAA
GTACAATCTA ATACAGTTGT TTCTCCTGGA GAAGCTGATG CTGCTGTTAT TAGGATTCGA
TCTCAAAATG ACTTTTTAAC CAAGCCAAAG AAAGATAGAG GAATAGCTTC AGTCGTAGAT
TGTAATGATA GATGGGTTTA TTTAGATCCT CTAAGAGGAA GTATGTCTGC TGTCGCAGAA
GCAGCTAGGA ATTTAAGTTC TGTGGGAGCT GAACCTATAG CAATTACAAA TAATTTAAAT
TTTTCTTCTC CAGACAAACC AGTTGGTTTT TGGCAATTAT CAATGTCATG TGAAGGGATA
ACTAAAGCTT GTTTAGCATT GAGTACTCCT GTTACAGGAG GAAATGTGTC ATTATATAAT
GATACTAAAT TGCAAAATAA TACCGTTTTA CCTATACATC CAACTCCAGT AATTGGTATG
GTTGGGTTAA TAGAAGATAT CAATAAAATT TGTAAAAAAT CTTGGGTTAA AGCTGAAGAT
CAAATATGGA TGATTGGGTT ACCTTTAGAA AATAATATAA ACCAAGATGA AAGAATTTCA
CTATCTGCTT CTTCTTTTTT AGAGTATATT CATGGATTGA AAACAGGAAG ACCTCCTGAA
ATAGATTTAA ATTTAGAAAA ACAAGTTCAT GCATTTCTAA GAGAAGTAAT AAAACAAGGT
ATTGTTAATT CTGCTCATGA TTTAGGTGAT GGTGGTCTGG CAGTTGCTAT AGCTGAATGT
TGTATCTCTT CTGGCTATGG AGCAAATATT ATTCTCCCTC CTAGCCAATC AAGGTTAGAT
AGACTTTTAT TTGCTGAAGG AGGCGCAAGA GTTTTAGTGA GTTGTTCAAC TGATCAAAGC
GAAGAATTAA AGAAATATTA TAAAAATATT TCCTTACAAG GATCTAATCT TTTTTCAATA
TCTCATTTAG GTAATGTAAA TAATCAAAAA AAACTATTAG TGTCTCAATC GAATAATACA
ATTATAGATG TTAATATTCT AGACTTGAAA GATACATATA AAGATGCCAT CCATAAAAAA
ATAACTAAAT AA
 
Protein sequence
MTVSFENKDF NQFINSSKFL VEYDVMSALK QEGLKQSDYV EICRRLNRGP NRNELGMFGV 
MWSEHCCYRN SRPLLKNLPT TGSRILVGPG ENAGVVDIGF GQRLVFKIES HNHPSAVEPF
QGAATGVGGI LRDIFTMGAR PIALLNALRF GPLDDEKNIS LLEGVVAGIS HYGNCVGVPT
IGGEVGFDSS YSGNPLVNVM GLGLMETEEI VCSGASGIDF PVLYVGNTTG RDGMGGASFA
SSELSKTSID DRPAVQVGDP FLEKGLIEAC LEAFKTGYVI AAQDMGAAGL TCSCSEMASK
GEVGIELNLD LVPAREKGMT AYEFLLSESQ ERMLFVVKPG SEEELRELFI RWGLYVEVVG
KVLKEKVVRV IHKGEVVANL PASALADDTP IEEHLLINST PEYLQEHWKW TEDLLPKTLD
NGIINIKNNL FISWNNVLLD LLSMPSIASK NWIYKQYDYQ VQSNTVVSPG EADAAVIRIR
SQNDFLTKPK KDRGIASVVD CNDRWVYLDP LRGSMSAVAE AARNLSSVGA EPIAITNNLN
FSSPDKPVGF WQLSMSCEGI TKACLALSTP VTGGNVSLYN DTKLQNNTVL PIHPTPVIGM
VGLIEDINKI CKKSWVKAED QIWMIGLPLE NNINQDERIS LSASSFLEYI HGLKTGRPPE
IDLNLEKQVH AFLREVIKQG IVNSAHDLGD GGLAVAIAEC CISSGYGANI ILPPSQSRLD
RLLFAEGGAR VLVSCSTDQS EELKKYYKNI SLQGSNLFSI SHLGNVNNQK KLLVSQSNNT
IIDVNILDLK DTYKDAIHKK ITK