Gene NATL1_18651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18651 
SymbolpurB 
ID4780294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1522143 
End bp1523438 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content38% 
IMG OID640085154 
Productadenylosuccinate lyase 
Protein accessionYP_001015685 
Protein GI124026570 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.537762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTGAGC GTTACACAAA CCCAGAGATG GGAAATATTT GGTCTGATCA AGCCAAATAC 
CAAACATGGC TTGATGTTGA AATTGCCGCA TGTGAGGCTA ATTGCAAATT AGGGAAAATC
CCTCAAAGTG CAATGGAAAC AATTCGAACA AAGGCAAGAT TCAAGCCAGA ACGCATACTC
GAAATAGAGG AGGAAGTTCG CCATGACGTA ATTGCCTTTC TAACAAATGT AAATGAATAT
GTTGGGGATG CTGGCCGTTA CATTCACGTT GGAATGACCA GTAGCGATGT CCTTGATACT
GGTCTTGCAC TGCAATTAAA GTCATCCGTC AAACTTTTAA GAAAAGAGCT TTTATTACTT
GAAGAAGCTA TTAGAGATTT AGCAAGTCAG CATAAAAAAA CCGTAATGAT TGGACGTTCT
CATGCCATTC ATGGAGAACC TATTACCTTT GGATTCAAGT TGGCGGGATG GCTAGCTGAA
ACTCTCAGGA ACAAAGATAG GCTAAACAGT CTTGAGAAAG ATATTTCTGT TGGTCAAATC
AGCGGAGCTA TGGGCACTTA TGCCAATACT GATCCAGAAA TAGAAAAAAT AACTTGCGAA
CTTTTGGAGC TTGATTGTGA CACTGCTAGC ACTCAAGTTA TCTCAAGAGA TAGGCATGCT
AATTATGTGC AGATTCTTGC TTTGATTGGA TCTTCACTAG ATCGTTTTTC TACAGAAATT
AGAAACCTTC AAAGAACTGA TGTTCTTGAA GTAGAGGAAA ACTTTGCTAA AGGCCAAAAA
GGAAGCTCTG CAATGCCTCA TAAAAGAAAT CCTATACGTA GTGAACGGGT AAGTGGGCTT
TCCAGGGTTT TAAGAAGTTA TGTAGTTGCA GCTCTTGAAA ATGTAGCCCT ATGGCACGAA
AGAGATATAA GCCACAGCTC CAATGAAAGA TTAATGCTGC CAGACACATC TATTACTCTT
CATTTCATGC TCACAGAAAT GACCGCAATA ATTAAAGGTC TTGGAGTATA TCCAAATAAT
ATGCTGAAAA ATTTGAACAT TTATGGAGGA GTAGTGTTTA GTCAAAGAGT ACTTTTGGCT
TTAGTTGAGA ATGGAATGAG TAGAGAAGAT TCTTATAGAT TAGTTCAAAA AAATGCTCAT
TCAGCCTGGA ATCAACCCGA AGGAAATTTC AAAAAGAACC TTGAGAATGA CCCAGAGGTA
ATGAATAGTC TCTCTACTGA AAAACTCTCT GATTGCTTCT CAACCGAATT ACATCAATCA
AATTTGAGAG TTATTTGGGA AAGACTTGGC ATATAA
 
Protein sequence
MIERYTNPEM GNIWSDQAKY QTWLDVEIAA CEANCKLGKI PQSAMETIRT KARFKPERIL 
EIEEEVRHDV IAFLTNVNEY VGDAGRYIHV GMTSSDVLDT GLALQLKSSV KLLRKELLLL
EEAIRDLASQ HKKTVMIGRS HAIHGEPITF GFKLAGWLAE TLRNKDRLNS LEKDISVGQI
SGAMGTYANT DPEIEKITCE LLELDCDTAS TQVISRDRHA NYVQILALIG SSLDRFSTEI
RNLQRTDVLE VEENFAKGQK GSSAMPHKRN PIRSERVSGL SRVLRSYVVA ALENVALWHE
RDISHSSNER LMLPDTSITL HFMLTEMTAI IKGLGVYPNN MLKNLNIYGG VVFSQRVLLA
LVENGMSRED SYRLVQKNAH SAWNQPEGNF KKNLENDPEV MNSLSTEKLS DCFSTELHQS
NLRVIWERLG I