Gene P9211_11841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11841 
SymbolpurT 
ID5731016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1070244 
End bp1071422 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content39% 
IMG OID641285552 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_001551069 
Protein GI159903725 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2
[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.703197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0751246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT ATCCAAAGAC CTTAATGTTG CTAGGGAGTG GAGAGCTTGG GAAGGAAGTC 
GCTATTTCGG CTAAAAGACT TGGCTGCAAA GTGATTGCTT GCGATAGGTA CGCTGACGCT
CCTGCAATGC AAGTGGCAGA TATTGCCGAA ATTTTGGATA TGACCAACTC CGGTGAGCTT
AAGGATGTTA TTAATTCATA TAAGCCAGAT GTAATCATTC CAGAAATAGA AGCTTTAGCA
GTTGATGCTT TGGATGAGAT TGAGAGAGAT GGGATAACTG TCATACCAAC AGCAAGAGCA
ACACAAATAA CAATGAATAG AGACAAAATA AGAGATTTAG CATCTAATAA GCTTCATCTC
AAAACTGCTA AATATGTTTA TGCATCCAAC AGTGATGAAG TCAAAAAAGC TGCTAAAGAA
CTTGGGTTGC CCGTAATAGT TAAACCAATT ATGAGTTCTT CTGGGAAAGG TCAATCTTTC
ATAAGGAAAG AAGAGGACAT TGAGCTTGCC TGGGAACTTG CTCTAAAGAA AGCAAGAGGA
GTTTCAAGCA GGGTAATAGT AGAAGAGTTT CTCGAATTCG ACTTTGAGAT TACATTGTTG
ACTATTCGTC AAAAAGACGG GTCCACACTT TTCTGTCCTC CAATTGGCCA CGAACAAAAA
AATGGGGACT ATCAATCCAG TTGGCAACCT GTAGTAATTA CAGAGAGTCA ACTACAAGAA
GCACAAGTTA TGGCAAAAGC TGTAACTCAA GAGCTTGGCG GAGTTGGAAT ATTTGGGGTT
GAATTCTTTA TAACTAAAGA AAGTGTGATT TTTTCTGAAT TGTCGCCTAG GCCACATGAT
ACAGGCCTTG TAACGCTTAT TAGCCAGGAC CTAAGTGAGT TTGACCTTCA TGTCAGGGCG
ATCCTAGGCC TTCCAATTCC TTCAATCACA GTTAATCACC CTAGTGCTAG TAGAGTTATC
TTGTCTGAGA AGAATTGCAC AAAAGTTGCG TATAGAGGAA TTGAAAAAGC TTTGGAAGAA
GAAGGTACCA AAATACTAAT ATTTGGAAAG CCAAGTGCTA CAAAAGGCAG AAGGATGGGT
GTGGCATTGG CAAAGTCTTC AACCTTAGAT AAAGCTCTAA TAAAAGCTAA TAATTCGGCA
ATTAATGTAG AAGTAATTCA AGATGATTAC TCCAATTAA
 
Protein sequence
MNIYPKTLML LGSGELGKEV AISAKRLGCK VIACDRYADA PAMQVADIAE ILDMTNSGEL 
KDVINSYKPD VIIPEIEALA VDALDEIERD GITVIPTARA TQITMNRDKI RDLASNKLHL
KTAKYVYASN SDEVKKAAKE LGLPVIVKPI MSSSGKGQSF IRKEEDIELA WELALKKARG
VSSRVIVEEF LEFDFEITLL TIRQKDGSTL FCPPIGHEQK NGDYQSSWQP VVITESQLQE
AQVMAKAVTQ ELGGVGIFGV EFFITKESVI FSELSPRPHD TGLVTLISQD LSEFDLHVRA
ILGLPIPSIT VNHPSASRVI LSEKNCTKVA YRGIEKALEE EGTKILIFGK PSATKGRRMG
VALAKSSTLD KALIKANNSA INVEVIQDDY SN