Gene P9303_00721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00721 
SymboldapA 
ID4778731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp71142 
End bp72050 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content56% 
IMG OID640085572 
Productdihydrodipicolinate synthase 
Protein accessionYP_001016094 
Protein GI124021787 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.28982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTG CTGCTGAGTC GTCTCCAACT CCTTTTGGCC GTTTGCTGAC GGCCATGGTT 
ACACCTTTTG ACGCTGATGG ATGCGTTGAT CTGGCTTTGG CTGGTCGTCT TGCTCGCTAT
CTCGTAGATG AGGGATCTGA TGGGCTGGTT GTCTGCGGCA CGACTGGGGA ATCGCCCACT
TTGAGCTGGC AGGAGCAGCA TCAATTGCTT GGGGTGGTCC GTCAGGCAGT GGGGCCAGGT
GTGAAGGTTC TTGCCGGTAC TGGTAGCAAT AGCACCGCTG AGGCGATAGA GGCCACCACT
CAGGCAGCTG CGGTGGGTGC TGATGGGGCA TTGGTGGTTG TTCCTTATTA CAACAAGCCT
CCGCAGGAAG GTCTTGAAGC CCATTTCAGG GCTATTGCCC AGGCTGCCCC TGAGTTGCCG
CTGATGCTCT ACAACATTCC TGGGCGGACT GGCTGTTCGC TTGCTCCAGC AACAGTTGCA
AGGTTGATGG AATGTCCGAA TGTGGTGAGT TTCAAAGCCG CCAGTGGCAC CACGGATGAG
GTGACGCAGT TGAGGTTGCA GTGTGGTTCA AAACTGGCTG TTTACAGCGG CGACGATGGC
TTGCTTTTGC CCATGATGTC GGTGGGGGCT GTTGGGGTGG TGAGTGTCGC AAGTCACCTT
GTAGGTCGTC GGCTTAAGGC GATGATTGAG GCCTATCTCA ATGGTCAGGG TGCTCTTGCC
CTCAGTTATC ACGAGCAGTT GCAACCTTTG TTCAAGGCTC TATTTGTCAC CACCAATCCG
ATTCCTGTTA AAGCAGCCCT CGAGCTCAGC GGTTGGCCGG TCGGATCCCC CCGCCTCCCT
TTGCTTCCAC TTGATCCCGT TATGCGAGAT GCTCTTTCAA ACACCTTGAC TGCCTTGTGT
CAGACCTGA
 
Protein sequence
MSSAAESSPT PFGRLLTAMV TPFDADGCVD LALAGRLARY LVDEGSDGLV VCGTTGESPT 
LSWQEQHQLL GVVRQAVGPG VKVLAGTGSN STAEAIEATT QAAAVGADGA LVVVPYYNKP
PQEGLEAHFR AIAQAAPELP LMLYNIPGRT GCSLAPATVA RLMECPNVVS FKAASGTTDE
VTQLRLQCGS KLAVYSGDDG LLLPMMSVGA VGVVSVASHL VGRRLKAMIE AYLNGQGALA
LSYHEQLQPL FKALFVTTNP IPVKAALELS GWPVGSPRLP LLPLDPVMRD ALSNTLTALC
QT