Gene NATL1_18451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18451 
SymbolnadE 
ID4780578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1507085 
End bp1508794 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content35% 
IMG OID640085134 
Productcarbon-nitrogen hydrolase:NAD+ synthase 
Protein accessionYP_001015665 
Protein GI124026550 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.522379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAG CTCTGGCCCA ACTAAATCCA GTTATTGGAG ATCTCGATGG TAATTCCGAG 
AAAATATTTG AAGTATGCGA AGAGATCAAA GACAAGGATG TGGATCTAGT AATAACACCA
GAGCTTTCTT TAATCGGCTA CCCTCCAAAA GACTTATTAT TTAATCCAAG ACTTTTTGAA
GAGGAAAAAG ATGCCCTAAA TAAATTAAGC AAAAGATTAA GTAATCTTAG TCAAAATTTA
TCTGTATTAA TTGGTATTGC AGAACCAACT CCAGATATAG AAATTCCTAA ACTCTACAAT
TCAGTTGTGC TTTTAGAAAA AGGTACCTGG AGAATCGTAG CCAGGAAGCA ACTCCTACCT
ACTTACGATG TATTTGATGA AAAACGTTAT TTTCGTTCAG CAGAGAATAG TAGTATTTTA
AGTTTTAATT TTCAAAAAAA ACTTTGGAAG ATAGGAATCA CTATTTGTGA AGATATATGG
GTTGAGCAAA ATCTACAAAA TGAAAAGATC CTAGGGAAAG ATCCTTTAAA ATTCTTAGAA
AACGAAAAAT TAGATTTGCT TATCAATCTT TCAGCTTCTC CATTTATTGA ATCGAAAAGT
TTAATACGTC AACGCATAGC AGCTAAGGCA GCCATGCGTC TTTCATGCCC AGTGATCTAT
GTCAATCAAG TAGGAGGGAA TGATGAATTG ATCTTTGACG GATCTAGTTT TGCGATTAAT
AAAGAAGGCG AGTTAAAACA AGAACTTCCT AGTTTTAAAG AATCAGTTGG TCTTTGTGAT
ATATCTTCCC TCAAGCAACA ACCTTCGATA TTAAGTAAAT ACCCAACATC ACAAGAAGTC
ATTTTCAGGG CTCTAGTGCT TGGAGTTAAA GATTATGCAA GAAAATGTAA TTTCCACAGT
GCCGTTATTG GATTAAGTGG GGGTATTGAC TCAGCCCTAG TAGCAACTAT TGCAGTAGCA
GCTTTAGGTA TCTCTAAAGT TCATGCGATT TTAATGCCTT CTCCATGGAG TTCAGCAGGG
TCCGTAAAAG ATGCAACCAC ATTAGCTAAT CGACTGGGAA TAAATCATCA AAAGATTCCG
ATCTCAGATG TAATGAATAG TTTTAACGAT GTTTTATCCA ATTCAATATG GGGAATACCA
ATCGGTATCA CAGCTGAGAA TCTACAATCT CGCATTAGGG GAACAATATT GATGGCCATA
GCAAATCAAA AAAAATATTT ACTTCTATCA ACAGGAAATA AGTCAGAACT TGCCGTAGGA
TATTGCACAC TTTACGGAGA CATGAATGGA GGCTTGTCTG TAATTGGAGA TCTTTATAAG
ACAAGTGTAT TTGATCTATG TGATTGGATA GATAGTGAAT CGTCATCTAG TTGTAGAAAT
GATTTTTCTC TCCCTGAAAA AGGAGAAATT ATAAGCTCTG AGATAAGAGA CAAGCCTCCA
AGTGCTGAAT TACGGCCTGA ACAATTAGAT AGCGACTCTT TGCCTGAGTA TTCCATTCTT
GATCAAATAC TAAAAGGATT AATTGAACAG CACTTACCAA CTGAGGTGTT AATAGAAAAA
GGATTTGATA AAAAAATTGT TCATAAAGTA GCCAATCTTT TAAAGAACGC TGAATTCAAA
CGTTATCAAG CACCTCCTTT ATTGAAAATT AGTAATCAAG CATTTGGAAG TGGGTGGAAA
AAACCTATAG CCTCAGGACA AATCCTTTAA
 
Protein sequence
MKLALAQLNP VIGDLDGNSE KIFEVCEEIK DKDVDLVITP ELSLIGYPPK DLLFNPRLFE 
EEKDALNKLS KRLSNLSQNL SVLIGIAEPT PDIEIPKLYN SVVLLEKGTW RIVARKQLLP
TYDVFDEKRY FRSAENSSIL SFNFQKKLWK IGITICEDIW VEQNLQNEKI LGKDPLKFLE
NEKLDLLINL SASPFIESKS LIRQRIAAKA AMRLSCPVIY VNQVGGNDEL IFDGSSFAIN
KEGELKQELP SFKESVGLCD ISSLKQQPSI LSKYPTSQEV IFRALVLGVK DYARKCNFHS
AVIGLSGGID SALVATIAVA ALGISKVHAI LMPSPWSSAG SVKDATTLAN RLGINHQKIP
ISDVMNSFND VLSNSIWGIP IGITAENLQS RIRGTILMAI ANQKKYLLLS TGNKSELAVG
YCTLYGDMNG GLSVIGDLYK TSVFDLCDWI DSESSSSCRN DFSLPEKGEI ISSEIRDKPP
SAELRPEQLD SDSLPEYSIL DQILKGLIEQ HLPTEVLIEK GFDKKIVHKV ANLLKNAEFK
RYQAPPLLKI SNQAFGSGWK KPIASGQIL