Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18621 |
Symbol | dapA |
ID | 4718600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1595969 |
End bp | 1596871 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640079596 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001010252 |
Protein GI | 123969394 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.696996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACAG ACAAAACTGA GTGTAATAAT CCATTATTTG GAAGAATATT GACTGCAATG GTTACTCCAT TCACTGAGAA TGGAGATGTA GATTATGAAC TAGCTATAAA ACTTTCAAAT TATCTTTTTG AGAACGGTTC CGATGGAATT GTGTTGTGCG GTACTACTGG AGAATCTCCG ACTCTTTCAT GGGCGGAGCA GCATGATTTA TTTATTGCGG TAAAAGGATC TTTGGATGCA AGCTGTAAAG TAATAGTTGG CACTGGTAGC AATTGTACAA GCGAAGCTGT GGAAGCTACA AAAAAAGCTT ACGACTCTGG TGCCGACGGT GCTTTGGTCG TTGTTCCTTA TTACAATAAG CCGCCTCAAG AAGGTCTTTA TAAACATTTC AGTTCTATTG CTAAATCTGC AAAGGATTTG CCTCTTATGC TCTACAACAT TCCTGGCAGG ACTGGATGCA ATTTATTACC TGATACTGTG AAGAAACTTA TGGATTTCTC AAATATTCTC AGTATTAAAG CTGCAAGCGG TAGAATAGAA GAAGTAACAG AATTAAGAGC TATTTGTGGC TCCGAACTCT CTGTATATAG TGGCGACGAT TCATTGTTGC TTCCAATGTT ATCGGTAGGT GCTGTAGGAG TAGTAAGTGT TGCAAGTCAT TTAGTTGGAT TGCAATTGAA AGAGATGATT CATTCTTTTC AAAGTGGAAA GGTTTCCAAT GCTCTTGCTA TTCATGAAAA ACTTCAGCCT CTTTTCAAAG CACTCTTTAT GACTACTAAT CCAATCCCAA TTAAAGCTGC TTTGGAGCTC TCGGGATGGG ATGTAGGTAA TCCTAGAAGT CCTTTGTCAC CTTTAAACAA TGACATGAAA AAGCAACTAT CTTTTATCCT GAATTCCCTA TAA
|
Protein sequence | MITDKTECNN PLFGRILTAM VTPFTENGDV DYELAIKLSN YLFENGSDGI VLCGTTGESP TLSWAEQHDL FIAVKGSLDA SCKVIVGTGS NCTSEAVEAT KKAYDSGADG ALVVVPYYNK PPQEGLYKHF SSIAKSAKDL PLMLYNIPGR TGCNLLPDTV KKLMDFSNIL SIKAASGRIE EVTELRAICG SELSVYSGDD SLLLPMLSVG AVGVVSVASH LVGLQLKEMI HSFQSGKVSN ALAIHEKLQP LFKALFMTTN PIPIKAALEL SGWDVGNPRS PLSPLNNDMK KQLSFILNSL
|
| |