Gene NATL1_07431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07431 
SymbolpurK 
ID4781291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp684650 
End bp685795 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content37% 
IMG OID640084018 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001014566 
Protein GI124025450 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0031088 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTGGGG TCGTGGGTGG AGGACAGCTT GCAATGCTTT TGATTGAGGC TGGAAAGAAA 
AGAAATGTTG ATGTCGTTGT TCAGACGGCT GCTAAAACTG ATCCTGCTGC TAAAAAGACA
AATCAACTCG TTTTGCATGA CCCTACGAAT CCTGTGGGTA CAAAACTTCT TGCAGAAAAG
ACCCGCTTGA TTACTTTTGA AAATGAATGG GTTGATATCT CAAGTTTACT TTCTCTTGAA
AATAATGGAG TTTCTTTTGT CCCTAGACTT CAATCAATAA GACCTTTAAT TAATAAAATA
ACTCAAAGGG AGCTATTAAA CAGTCTTGAT ATTCCCTGCC CTGATTGGTT GTCTATACCA
TTAAAAAAAT CAACAGAAAT TGATCTTCCT GCAGATTGGG GATTTCCTTT GATGGCAAAA
GCTGCCAAAG GTGGATATGA CGGGAAAGGA ACTAAAATTA TTAAAAATCT AAAGCAACTT
CAAGAATTTC TATCAGTTGA AAGAGAAGGG CAATGGATGT TAGAGAAATG GATCTCTTTT
GATAAGGAAT TATCCATTGT TTCTAGTAGG GATTCAAAAG GAATTGTACG TAGTCTGCCA
ATCGTAGAGA CATATCAATC TAAACAAGTA TGTGACTGGG TCCTTGCTCC AGCTGATATC
AATCATGACG TTGATCTTAT GGTTAGAAAT ATCGCAGCTT CGTTGCTTGC TGAGTTGCAA
TATGTTGGAG TTATTGCTAT TGAATTTTTC TATGGATCTG AAGGATTACT TGTAAATGAA
ATAGCTCCAA GAACTCATAA CTCAGGTCAT TTTTCTATTG ATGCTTGTAG CAGCAGTCAG
TTTGATCAAC AAATATGTAT CACCTCTGGT ATTGATGTAC CCATGCCTGA AATGCTTGTT
AATGGTGCTT TAATGGCAAA CTTGCTTGGT TTGCAAAGTA ACTATCCAAC ATCACTTACC
CAAAGATTGA ATGATTTGAG GGGTATTCCT GGCTTGAATG TTCATTGGTA TGAAAAAGAG
GAAGAAAAAA AGGGCAGGAA GCTTGGTCAC GTTACATATC TCTTGAATAA TAAGGACGCT
TTGTCTAGAA AAAAAGAAGC ATTAGATGTT TTACAAACCA TACGGTCAAT TTGGCCGACC
TCTTGA
 
Protein sequence
MIGVVGGGQL AMLLIEAGKK RNVDVVVQTA AKTDPAAKKT NQLVLHDPTN PVGTKLLAEK 
TRLITFENEW VDISSLLSLE NNGVSFVPRL QSIRPLINKI TQRELLNSLD IPCPDWLSIP
LKKSTEIDLP ADWGFPLMAK AAKGGYDGKG TKIIKNLKQL QEFLSVEREG QWMLEKWISF
DKELSIVSSR DSKGIVRSLP IVETYQSKQV CDWVLAPADI NHDVDLMVRN IAASLLAELQ
YVGVIAIEFF YGSEGLLVNE IAPRTHNSGH FSIDACSSSQ FDQQICITSG IDVPMPEMLV
NGALMANLLG LQSNYPTSLT QRLNDLRGIP GLNVHWYEKE EEKKGRKLGH VTYLLNNKDA
LSRKKEALDV LQTIRSIWPT S