Gene A9601_16481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16481 
SymbolnadE 
ID4718378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1392965 
End bp1394662 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content31% 
IMG OID640079374 
Productcarbon-nitrogen hydrolase:NAD+ synthase 
Protein accessionYP_001010038 
Protein GI123969180 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTT TACTTGCCCA AATAAATCCA GTTGTTGGAG ATTTGGAGGG CAATGCAAAG 
AAAATACTTA ATATTGCCTC AAAAGCTAGC TCAATTTCTG CTGATTTTGT TCTTACTCCT
GAATTGTCAT TATGGGGATA TCCCGCAAAT GACTTGCTTT TAAAAAAAAA TTTAATCAAA
AATCAATACC AAATTCTTGA TAAATTAGCT TCAGATACTA ATAAAAAATA TGGAAACTTA
AGTATCACAG TCGGAATAGC TGAGTTAATA AACGATTCTT TTTTTCCTAA TCTTTATAAT
TCTATTGCGT TAGTTGAGGG CGGTGAATGG AGAATAATTG CTCGTAAAAT TATTCTTCCC
ACTTATGAAG TATTTGATGA GAAAAGATAT TTTAGATCAG AAGAAAAAGT TTCTGTATTG
ATTAAAAAAA TTAAAAATCA AACTTGGCGA CTTGGCTTCA CTATATGTGA GGATTTATGG
GTCAACAAAA ATATAGAGGG TAGAGGAATT CATAGAAAAA ATCCCATTGT TGATTTAAAG
AAAAAAGAAG TTGATATTTT AGTTAATTTA TCAGCCTCCC CATACACTCT GAAAAAGTTA
GAGCTAAGAT CCAAGGTTTC TAGTTTTGCT GCACAATATC TTCAAGTACC GCTGATTTAT
GTCAACCAGA TTGGAGCGAA TGATAATTTA ATTTTTGATG GAAATAGTTT TATTCTTGAT
AAGAATGGAT CTACAATTAA GCAATTAAAA TCTTTTTCAG AAGATCTCTC CAGCTGGGAA
ATTGAGCAAA CAAAACCTGA AAAAAAAGAA TTCAAAAACT CCGAAATGTC ATCAATTTTT
GATGCATTAG TCCTAGGTGT TAAAGATTAT GCAAAAAAAT GTAGATTTAA AACAGCTTTA
GTTGGATTAA GTGGTGGTAT TGATTCAGCA CTTGTCTCAG CAATTGCTAC AGCAGCTTTA
GGAAGTGAAA ATGTTTATTG CGTTTCAATG CCCTCTAAGT GGAGCTCATC TCATTCAAAG
ATTGATGCAA AAGACTTAGC AAGAAGATTA AAGATAAGTT TCAAGAGCAT TCCAATTGAA
AACTTGATGA ACTCTTTTGA AGAATCATTT ATAAAAACCA TAGATTTTGA AATGGCTGAA
ATAACAAATC AAAATATTCA GTCAAGGATA AGAGGCACAC TTCTTATGGC TTTAGCAAAT
CAAGAAAAGC ATTTGTTACT TTCAACAGGC AATAAATCTG AGCTGGCCGT AGGATATTGC
ACACTTTATG GAGATATGAA TGGGGGATTA TCGGTAATTG GGGATTTATA TAAAACAAAT
GTTTTTAAAT TATGCAATTG GCTTGATAGT GAAGATTCAA TTAATCATAG AAAATCATAT
ATGTTAGATA CTAATGTAGA TATAATTGGA AAAAATATTC GTACTAAAGC GCCAAGTGCC
GAACTAGGTC CAGATCAATT AGATACTGAT TCTCTCCCAC CTTATTCAAT TTTAGATAAT
ATTCTTAAAG GAATTATTGA AGAGAAAAAA GATTTACAAC AACTAGAAAA AGATGGTTAT
AAAAAAGATT TAATTTTAAA AATTATCTCA CTCATAAAAA AAGCGGAATT CAAAAGAAAG
CAGGCTCCTC CAATCCTAAA ACTAAGCAGT CAATCATTAG GAAGTGACTG GAGAGTTCCC
ATAGCAATAT CTTATTGA
 
Protein sequence
MKFLLAQINP VVGDLEGNAK KILNIASKAS SISADFVLTP ELSLWGYPAN DLLLKKNLIK 
NQYQILDKLA SDTNKKYGNL SITVGIAELI NDSFFPNLYN SIALVEGGEW RIIARKIILP
TYEVFDEKRY FRSEEKVSVL IKKIKNQTWR LGFTICEDLW VNKNIEGRGI HRKNPIVDLK
KKEVDILVNL SASPYTLKKL ELRSKVSSFA AQYLQVPLIY VNQIGANDNL IFDGNSFILD
KNGSTIKQLK SFSEDLSSWE IEQTKPEKKE FKNSEMSSIF DALVLGVKDY AKKCRFKTAL
VGLSGGIDSA LVSAIATAAL GSENVYCVSM PSKWSSSHSK IDAKDLARRL KISFKSIPIE
NLMNSFEESF IKTIDFEMAE ITNQNIQSRI RGTLLMALAN QEKHLLLSTG NKSELAVGYC
TLYGDMNGGL SVIGDLYKTN VFKLCNWLDS EDSINHRKSY MLDTNVDIIG KNIRTKAPSA
ELGPDQLDTD SLPPYSILDN ILKGIIEEKK DLQQLEKDGY KKDLILKIIS LIKKAEFKRK
QAPPILKLSS QSLGSDWRVP IAISY