Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00721 |
Symbol | dapA |
ID | 4778731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 71142 |
End bp | 72050 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085572 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001016094 |
Protein GI | 124021787 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.28982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCTG CTGCTGAGTC GTCTCCAACT CCTTTTGGCC GTTTGCTGAC GGCCATGGTT ACACCTTTTG ACGCTGATGG ATGCGTTGAT CTGGCTTTGG CTGGTCGTCT TGCTCGCTAT CTCGTAGATG AGGGATCTGA TGGGCTGGTT GTCTGCGGCA CGACTGGGGA ATCGCCCACT TTGAGCTGGC AGGAGCAGCA TCAATTGCTT GGGGTGGTCC GTCAGGCAGT GGGGCCAGGT GTGAAGGTTC TTGCCGGTAC TGGTAGCAAT AGCACCGCTG AGGCGATAGA GGCCACCACT CAGGCAGCTG CGGTGGGTGC TGATGGGGCA TTGGTGGTTG TTCCTTATTA CAACAAGCCT CCGCAGGAAG GTCTTGAAGC CCATTTCAGG GCTATTGCCC AGGCTGCCCC TGAGTTGCCG CTGATGCTCT ACAACATTCC TGGGCGGACT GGCTGTTCGC TTGCTCCAGC AACAGTTGCA AGGTTGATGG AATGTCCGAA TGTGGTGAGT TTCAAAGCCG CCAGTGGCAC CACGGATGAG GTGACGCAGT TGAGGTTGCA GTGTGGTTCA AAACTGGCTG TTTACAGCGG CGACGATGGC TTGCTTTTGC CCATGATGTC GGTGGGGGCT GTTGGGGTGG TGAGTGTCGC AAGTCACCTT GTAGGTCGTC GGCTTAAGGC GATGATTGAG GCCTATCTCA ATGGTCAGGG TGCTCTTGCC CTCAGTTATC ACGAGCAGTT GCAACCTTTG TTCAAGGCTC TATTTGTCAC CACCAATCCG ATTCCTGTTA AAGCAGCCCT CGAGCTCAGC GGTTGGCCGG TCGGATCCCC CCGCCTCCCT TTGCTTCCAC TTGATCCCGT TATGCGAGAT GCTCTTTCAA ACACCTTGAC TGCCTTGTGT CAGACCTGA
|
Protein sequence | MSSAAESSPT PFGRLLTAMV TPFDADGCVD LALAGRLARY LVDEGSDGLV VCGTTGESPT LSWQEQHQLL GVVRQAVGPG VKVLAGTGSN STAEAIEATT QAAAVGADGA LVVVPYYNKP PQEGLEAHFR AIAQAAPELP LMLYNIPGRT GCSLAPATVA RLMECPNVVS FKAASGTTDE VTQLRLQCGS KLAVYSGDDG LLLPMMSVGA VGVVSVASHL VGRRLKAMIE AYLNGQGALA LSYHEQLQPL FKALFVTTNP IPVKAALELS GWPVGSPRLP LLPLDPVMRD ALSNTLTALC QT
|
| |