Gene P9211_07981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07981 
SymbolpurK 
ID5730075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp703189 
End bp704388 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content40% 
IMG OID641285162 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001550683 
Protein GI159903339 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00471712 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAATG AATCATTAAC AAGAAATAAC AAAGTAGGCC CAACAGTTGG TGTCGTTGGT 
GGAGGACAAC TTGCGCAGAT GTTGGCTAAA GCTGCCAGAG AGAGAGGGAT TGATTTAATT
GTTCAGACTG GATTGAGGAG TGACCCTGCC GTTCAATATG CTAAGGGATT AGTACTTTCG
GATACAAGCG ATATTAATGG AACAAAAGAA CTTGCAAGCA AATGTAGCTG CTTGACCTTT
GAGAATGAAT GGATTGATGT AACGTCTCTC TCTTCGCTGG CTAACGATAG CTCCTTATTT
CAGCCTAGCT TAAATTCAAT TAAGCCATTA GTAGACAAAC TTTCTCAAAG AAAGCTTTTA
AACGATTTGA ATATCCCTGG TCCAGAATGG CTGCCCCTAG CATGTATTAA AAAAAAGGAT
TTAGAGCTTC CAGACGGTTG GGCATATCCA GTAATGGCAA AGGCAGGTAA AGGAGGATAT
GACGGTAAAG GAACAAGAGT TATAAATGAT GCCAATGAAC TTAAGGAGCT GTTTTTCTCT
GTTGATGTTT CTAATTGGTT CTTAGAGAAA TGGATTAGCT ATCAAAAAGA GTTGGCAATT
GTTGTCAGTC GAGATACCTG TGGACGAATA AACTCTTATC CCTTAACAGA GACTTTCCAA
CATAAGCAAG TCTGTGATTG GGTTGTTGCA CCTGCGAATG TAAGTCATTC AGTCCTTGTC
ACGGCTTATA ACGTAGGTGC TTCGTTATTG AGAGAGCTAA ATTATGTAGG TGTACTTGCG
ATTGAATTCT TTTATGGGGA TGAGGGATTG CTCGTCAATG AAATAGCCCC ACGTACTCAC
AACTCTGCTC ATTTTACAAT TGACGCATGC AGTAGTAGTC AATTTGACCA ACAAATATGC
ATAGCTGCCG GTCTACCAGC CCCTCCGGTT AAATTAGTCG TACCAGGCGC AATCATGATC
AATTTGTTAG GCTTGCGAGG CAAAGCAAAT TCCTTAGATG AACGATTGGA GAAGTTAAAG
CAAATAAATG GCGCAAAACT TCATTGGTAT TCTAAAGATA AAGTCTTGCC TGGAAGAAAA
TTAGGTCATT TAACAGTTCC CTTACTCGAT TTAGACCCCA CTTCAAGAAT TAATAAGGCG
ACAAGCATTT TAAAAAAAGT AAGAGCAATC TGGCCCTTTT TTGTTCCCGA TATCAATTAG
 
Protein sequence
MRNESLTRNN KVGPTVGVVG GGQLAQMLAK AARERGIDLI VQTGLRSDPA VQYAKGLVLS 
DTSDINGTKE LASKCSCLTF ENEWIDVTSL SSLANDSSLF QPSLNSIKPL VDKLSQRKLL
NDLNIPGPEW LPLACIKKKD LELPDGWAYP VMAKAGKGGY DGKGTRVIND ANELKELFFS
VDVSNWFLEK WISYQKELAI VVSRDTCGRI NSYPLTETFQ HKQVCDWVVA PANVSHSVLV
TAYNVGASLL RELNYVGVLA IEFFYGDEGL LVNEIAPRTH NSAHFTIDAC SSSQFDQQIC
IAAGLPAPPV KLVVPGAIMI NLLGLRGKAN SLDERLEKLK QINGAKLHWY SKDKVLPGRK
LGHLTVPLLD LDPTSRINKA TSILKKVRAI WPFFVPDIN