Gene A9601_10931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10931 
SymbolpurT 
ID4717804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp934894 
End bp936069 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content31% 
IMG OID640078808 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_001009484 
Protein GI123968626 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.673369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT CAATTTTTTC TAAAAAGAGA ATTTTATTAC TTGGTAGTGG CGAGCTTGGA 
AAAGAATTAG TAATAGAATC CAAAAGATTA GGATTAGAAG TCATTGCAAT TGATCGATAT
GAAAAAGCTC CTGCTATGCA AGTTGCTGAT TATTCAAGAG TAATTGAAAT GGGAGATAAA
AATATTTTAA AAAATGTAAT AAAAGAATTT AAGCCTGACT ATGTTGTCCC AGAAATAGAG
GCACTTTCAA TTGAAGCCCT AAAAGAACTC GAGGATGAAG GATTCAATAT TGTTCCCAAT
GCTAGAACTG TAGAAATTAC AATGAATAGA GATAAAATTA GAGACTTAGC TTCTAAAGAT
TTAAAAATTA AAACTGCAAA GTTTGATTAT ATTTTTGAAT TTGATGATTT AGAAAAAAAA
GCAGATGAAA TTGGATTCCC ACTTTTACTT AAACCTTTAA TGAGCTCTTC AGGAAAAGGG
CAAAGTTTGG TTGAAACAAA AAATGATTTA CAAAATGCTT GGAAACAGGC ACAAGCAAAT
TCAAGAGGAA AGGTTAAAGG TGTAATTATT GAAGAATTTA TTAATTTTGA TTTTGAGTTT
ACTCTTTTAA CTGTAAGAAA AGAAAATGGT GAAAATATTT TTTGTTTACC AATTGGACAT
CTTCAATCTA ATGGAGACTA TCAATGTAGT TGGCAACCTT TAGAGATCAA GGAGTCCTTA
ATTATTGAAG CTAAGAGAAT GACTAGTAGA ATATTAAATA ACCTTAATGG AGCTGGATTA
TACGGAGTAG AATTTTTTAT AAAAGGAAGT GAGGTTATCT TTTCAGAATT ATCTCCAAGA
CCTCACGACA CTGGTATGGT TACATTAGTT AGTCAAAATA TTAATGAATT TGAATTACAT
TTAAGGGCTT TTTTAAATTT ACCAATACCG CGTATCGATC TAATAGAGCC CTCTGCAACC
AGAGTTATAC TCTCTAACCA AGAGTATCTA AATCCTATTT ATGAGGGTCT TTATGAAGCA
TTAGAATTTG AAAAGACCAA AGTGCTCATA TTTGGCAAAC CAGTTTCCAG AAAAGGCAGA
AGAATGGGTG TTGTTCTCTC TTCAAATACT GACATAAATT TGGCCAGAAA AAATGCAGAT
GAAGCTGCTC TTAAAATAAA AGTCAGTACT ACATAA
 
Protein sequence
MKESIFSKKR ILLLGSGELG KELVIESKRL GLEVIAIDRY EKAPAMQVAD YSRVIEMGDK 
NILKNVIKEF KPDYVVPEIE ALSIEALKEL EDEGFNIVPN ARTVEITMNR DKIRDLASKD
LKIKTAKFDY IFEFDDLEKK ADEIGFPLLL KPLMSSSGKG QSLVETKNDL QNAWKQAQAN
SRGKVKGVII EEFINFDFEF TLLTVRKENG ENIFCLPIGH LQSNGDYQCS WQPLEIKESL
IIEAKRMTSR ILNNLNGAGL YGVEFFIKGS EVIFSELSPR PHDTGMVTLV SQNINEFELH
LRAFLNLPIP RIDLIEPSAT RVILSNQEYL NPIYEGLYEA LEFEKTKVLI FGKPVSRKGR
RMGVVLSSNT DINLARKNAD EAALKIKVST T