Gene NATL1_12301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_12301 
SymbolpurT 
ID4781016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1071892 
End bp1073055 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content35% 
IMG OID640084509 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_001015053 
Protein GI124025937 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.795061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTT TTCCAAAGAA AATAATGCTT TTAGGTAGTG GAGAATTGGG GAAAGAGGTA 
GCAATAGCAG CAAAGAGATT AGGTTGTTAT GTTATTGCAT GTGATAGGTA CAATGATGCT
CCAGCAATGC AAATAGCTGA TCAATTTAAT GTATTTAATA TGAATAATGG TAGTGAATTA
AAAGAAGTGA TATATAAATG TAATCCTGAT ATTATTATTC CTGAAATTGA AGCTCTTGCA
GTTGATGTAC TAAAGGAAAT CGAACAAAAG ATAACAGTAA TACCCAACTC AAGAGCTACT
GCAACAACAA TGAATAGAGA TAAAATTAGA GATTTGGCTT CTAACGAATT AAATATAAGA
ACTGCAAAAT TTAGTTATGC GGTTAATCAA TCTGAGCTTG ATTTGCATGC AGAGACGATT
GGGTATCCTT TATTAATTAA GCCAGTCATG AGTTCTTCGG GTAAAGGGCA AAGCTTAGTT
AAAAATAAGA ATGAATTAGC ACAGGCTTGG AATTTGGCTA TTGAAAAATC AAGAGGTAAA
TCAAATAAAA TCATATTAGA AGAATTTATT GATTTTGATT TGGAAATTAC TTTATTAACT
ATTAGACAAT CTAATGGTAA GACATTATTT TGTGCTCCTA TAGGACATGA GCAAAAAAAT
GGAGATTATC AATGTAGTTG GCAACCAGCT GAATTAACTG AAAGTGTCTT AGAGAAGGCT
CAACAGATTG CCAAACGTGT TACCGATAAT CTTGGTGGAG TAGGTTTGTT TGGTGTTGAA
TTTTTTATTA AGGGTGAGGA GGTGATTTTT TCAGAGCTAT CACCAAGACC GCATGACACA
GGCTTAGTAA CTTTGATTAG CCAAAATTTA AATGAGTTTG AATTACACCT TAGAGCTGTT
TTAGGTATCC CAATCCCAGA AATAGTTTGC CATGAAGCTT CTGCAAGTAG AGTGATTTTG
GCTTCCATGG AAACCACAGA CGTTGCTTTT ACGGGACTTG AGCAAGCCTT AAGCCAATCG
AATACAAATG TATTCATGTT TGGCAAGCCG AGCTCAACGG AAGGTCGTCG AATGGGAGTA
GCCGTAGCAA AGGCTGAAAC AATCGATGAG GCAAGAATTA AAGCCGACAA TGCAGCGCAA
TCAATCCAAT TCATAAATGA ATAA
 
Protein sequence
MNVFPKKIML LGSGELGKEV AIAAKRLGCY VIACDRYNDA PAMQIADQFN VFNMNNGSEL 
KEVIYKCNPD IIIPEIEALA VDVLKEIEQK ITVIPNSRAT ATTMNRDKIR DLASNELNIR
TAKFSYAVNQ SELDLHAETI GYPLLIKPVM SSSGKGQSLV KNKNELAQAW NLAIEKSRGK
SNKIILEEFI DFDLEITLLT IRQSNGKTLF CAPIGHEQKN GDYQCSWQPA ELTESVLEKA
QQIAKRVTDN LGGVGLFGVE FFIKGEEVIF SELSPRPHDT GLVTLISQNL NEFELHLRAV
LGIPIPEIVC HEASASRVIL ASMETTDVAF TGLEQALSQS NTNVFMFGKP SSTEGRRMGV
AVAKAETIDE ARIKADNAAQ SIQFINE