Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21211 |
Symbol | dapA |
ID | 4780949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1778538 |
End bp | 1779485 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640085418 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001015941 |
Protein GI | 124026826 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTGCA GATTGCTGAA CTTCTACTGC CAAAGAATTA TGAATAAGTC AGCTTTATTA TCACCAGCTC CTTTTGGAAG GCTCCTAACC GCAATGGTGA CCCCATTTGA TGATGAAGGG AAAGTTGATT ATGGTCTTGC TGCCGATTTG GCAAATTATT TGGTAGATCA AGGTTCAGAT GGCATCGTTG TATGTGGAAC TACTGGAGAG TCACCGACTC TAAGTTGGCA AGAACAACAA AAATTGCTGG AAATAGTAAG AAATTCCTTA GGCTCTAGGG CTAAAGTTTT AGCTGGAACA GGCAGTAATT CGACTTCTGA GGCAATTGAA GCTACAAAGG AAGCAGCTAA TTCAGGCGCT GATGGAGCAT TAGTTGTTGT TCCTTATTAC AACAAACCAC CGCAAGAGGG ATTAGAAGTT CATTTTCGCG CTATTGCAAA TGCCGCTCCA AAGTTGCCTT TAATGCTCTA TAACATCCCT GGGCGGACAG GGTGTTCAAT ATCGCCTAGT ATTGTTAGTA AGCTTATGGA TTGCAGTAAT GTAGTCAGTT TTAAAGCTGC AAGTGGAACA ACTGAGGAAG TGACTCAATT AAGAAACTAT TGTGGATCAG ATTTAGCTAT TTATAGCGGT GATGATGCTT TGGTTTTACC AATGCTTTCA GTAGGGGCAG TTGGTGTTGT TAGTGTTGCA AGTCATTTAG TTGCACCTAA TTTGAAGAAA ATTATAGAGA GTTTTTTAGA GGGTAAATAT TCTGAGGCAC TTTATTTGCA CGAGACATTA CAACCTCTTT TTAAATCCCT TTTTGCAACT ACAAATCCAA TTCCTGTTAA AGCGGCACTT CAACTCATCG GTTGGTCTGT TGGACCTCCT CGAAGTCCTC TAGTCTCTTT AAACAGTGAA ATGAAAGAAG AACTCGTGAA GATACTCTCT TCTCTGAGAT TGATTTGA
|
Protein sequence | MQCRLLNFYC QRIMNKSALL SPAPFGRLLT AMVTPFDDEG KVDYGLAADL ANYLVDQGSD GIVVCGTTGE SPTLSWQEQQ KLLEIVRNSL GSRAKVLAGT GSNSTSEAIE ATKEAANSGA DGALVVVPYY NKPPQEGLEV HFRAIANAAP KLPLMLYNIP GRTGCSISPS IVSKLMDCSN VVSFKAASGT TEEVTQLRNY CGSDLAIYSG DDALVLPMLS VGAVGVVSVA SHLVAPNLKK IIESFLEGKY SEALYLHETL QPLFKSLFAT TNPIPVKAAL QLIGWSVGPP RSPLVSLNSE MKEELVKILS SLRLI
|
| |