Gene NATL1_17671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17671 
SymbolpurD 
ID4779675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1448378 
End bp1449682 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content37% 
IMG OID640085055 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001015587 
Protein GI124026472 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA ATCAAAAAGC TGAAGAATTG AAAAGGATAT TAATAGTTGG AAGCGGAGGT 
AGAGAAAACT CTATTGCTTG GGCTCTATCA AAAAATCAAT CTATTGAGCA AATATATGTT
TGTCCTGGAA ATGGTGGGAC AGCCAAATTT GAAAAATGTA TTTGCCTAAA ACCCAACTCT
GAAGATGAAA AAATAATCAT TAATGAGTGT AAGCGACTTG CCATTGATTT AGTCATCATT
GGTCCTGAAG ACCCTTTGGC TGAGGGCTTA GCAAATAAAA TGCGTGAGGC AGGGTTAACA
GTTTTTGGTC CAGGCAAAGA TGGTGCTCAA TTAGAGGCAA GTAAAGACTG GTCTAAGGCT
CTGATGATAG AGAACAATAT CCCTACAGCA AAGTACTGGT CTGCAAAATC TAAAGAAGAA
GCACTAGAAA TTCTTAAAAG ATTTAACCAA CCTCTTGTTA TCAAAGCAGA TGGTCTTGCG
GCAGGTAAAG GTGTCACTGT TTGTGAAACC ATTGAGGAAT CTAAGGAAGC TATTAAGGAC
ATATTTTCTG GAAAGTTTGG CTCTGCAGGA AATAAAGTTA TTCTCGAAGA AAAAATTGAA
GGGCCAGAAG TTTCTATTTT TGCCCTTTGT GATGGAGAGA AATTAATTGT TCTTCCTCCA
GCCCAAGATC ATAAAAGATT ACTTGATGGA GACAATGGTC CAAATACTGG CGGAATGGGA
GCTTATGCAC CTGCACTTTT AATTAATGAA CAAGATCTTA AGGACTTAAC TGAACTGGTT
CTTATCCCAA CTTTAAAAGG TTTGAAAAAA AAGAATATCA ATTATATCGG TGTCATTTAT
GCCGGTTTAA TGCTTACATC GTCAGGGCCA AAAGTTATTG AATTCAATTG CCGCTTTGGA
GATCCAGAAT GTCAAGCTTT AATGCCCTTA ATGGGAGAGG AATTTGCTTC TGTCCTTTTT
GCTTGCGCTC GAGGAGAGAT TGAGAATGCA CCAAAACTTA CATTTAACTC TGAATGCAGT
GCTTGTATAG TTGCAGCCTC TAAGGGATAC CCTGAAAGCC CACAAAAAGG TGAAAAGATT
GCCATCAATG TTGAATCAAA TTCTTCACTC CAAATTTTTC ACGCAGGCAC CACAATTGAC
AAATTTGACA ATATAATTAC CTCAGGTGGA AGAGTTCTTT CAGTAGTTGC TCAGGGAGAA
AGTTTCGACA AAGCTTTCGA TTTAGCCTAT TCTAACCTTA AAAAAATTAA CTTTAATGGT
ATGCATTTTC GCGAAGATAT AGGTTACCAA GTTAGGAATA TTTGA
 
Protein sequence
MKKNQKAEEL KRILIVGSGG RENSIAWALS KNQSIEQIYV CPGNGGTAKF EKCICLKPNS 
EDEKIIINEC KRLAIDLVII GPEDPLAEGL ANKMREAGLT VFGPGKDGAQ LEASKDWSKA
LMIENNIPTA KYWSAKSKEE ALEILKRFNQ PLVIKADGLA AGKGVTVCET IEESKEAIKD
IFSGKFGSAG NKVILEEKIE GPEVSIFALC DGEKLIVLPP AQDHKRLLDG DNGPNTGGMG
AYAPALLINE QDLKDLTELV LIPTLKGLKK KNINYIGVIY AGLMLTSSGP KVIEFNCRFG
DPECQALMPL MGEEFASVLF ACARGEIENA PKLTFNSECS ACIVAASKGY PESPQKGEKI
AINVESNSSL QIFHAGTTID KFDNIITSGG RVLSVVAQGE SFDKAFDLAY SNLKKINFNG
MHFREDIGYQ VRNI