Gene PMN2A_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1709 
Symbol 
ID3607117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp379332 
End bp380711 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content36% 
IMG OID637688598 
Productputative aldehyde dehydrogenase 
Protein accessionYP_292900 
Protein GI72383545 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0791372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAG AAAATTTCGT ACTTCATCAA TTACAAGATC TAGTTCTATC AGGCAAAACA 
AGAAATGAGA AATGGAGAAG GGCACAGCTT AAATCCTTAT CAACTTTATT AGAAAATCAT
CAGCAAGAAA TATTAAAAGC CTTAAGTCAA GATTTAGGGA AGCCAGCTAC AGAAGCGTTC
TTCGAGATTA TTGCAGTAAA GCAAGAAATA AAACTGGCGC AGAAAAGTTT ATCTAATTGG
ATGAAGACGA GGCAAATCAA TGTGCCTGTC TCTCTTAAAC CAGCTCAAGC ATTGGTCCAG
CCGGATCCGT TGGGCTGCAT TTTGATAATT GGGCCATGGA ATTATCCTTT TTCGCTTACC
CTTCAACCAC TAGTAGGAGC ATTAGCCGCT GGAAACACTG CTGTTTTAAA GCCATCAGAG
CATGCTCCTA ACGTTTCAAA TCTGATAAAA AAACTTATAG AAAAATATTT CCCACCAGAG
ATCGTGCAAG TTTTTGAAGG AGATGGAAAT ATTGCTGCTG ATTTAATGAC TCGACAATTT
GATCACGTCT TTTTTACAGG TGGAGAAAAT ATAGGGAAAA AAGTAATGGA AGCCGCCTCA
AAAAACCTCA CTCCAGTAAC TTTAGAACTT GGTGGCAAAA GCCCAGCTGT TGTTATCGAT
GGTGCAAATC TAGAAGTAAC TTCAAAGAGA GTTATATGGG GAAAAAGTCT AAACGCTGGT
CAAACATGTA TTGCTCCGGA TCATTTACTG GTTGAGGATA AACTTTTTGA TTCATTAATT
TCTAATTTAA TAAATTCGAT CAATGATTTC TACGGAAATA CGCCTTTAGA TTCAAAACAT
CTGGGGAGCA TTATCAATGA AAAGCAATTT AATAGACTTA ATAATTTACT AACACAAGCT
AAAAAGAATA ACCAGATAAT CTATGGAGGA GATAGCAATG AAAAAGAGAA AAGAATTAGC
CCTACATTGA TCAAAATTGA CAATAGAAAT GATCCTCTTA TGAAGGAAGA ACTTTTTGGC
CCATTGCTGC CTATTTTAAG TATTAAAAAT CTCGACCAAG CTATTTCAGA TTTCAAGTTA
TTGCCTAAAC CCCTTGCTTT ATATCTTTTT GGAGGAGGTG AGAAAGAACA AGGTAAAGTA
CTCTCTATGA CCTCTTCAGG AGGTGTTTGT TTCAATGATG TTGTTCTACA GGCAGGGATA
CCTGAACTGC CTTTTGGAGG CGTCGGAACA AGTGGCATGG GGAAATACCA CGGCAAAGCA
GGTTTTGATA ACTTTACTCA TTACAAATCA GTCCTAAAAA GACCTTTTTG GTTAGATCTA
AACTTCAGAT ACCCTCCTTA TAAGTTAGAT TTGTCTTTAC TTAATAAATT AATAGGTTAA
 
Protein sequence
MSLENFVLHQ LQDLVLSGKT RNEKWRRAQL KSLSTLLENH QQEILKALSQ DLGKPATEAF 
FEIIAVKQEI KLAQKSLSNW MKTRQINVPV SLKPAQALVQ PDPLGCILII GPWNYPFSLT
LQPLVGALAA GNTAVLKPSE HAPNVSNLIK KLIEKYFPPE IVQVFEGDGN IAADLMTRQF
DHVFFTGGEN IGKKVMEAAS KNLTPVTLEL GGKSPAVVID GANLEVTSKR VIWGKSLNAG
QTCIAPDHLL VEDKLFDSLI SNLINSINDF YGNTPLDSKH LGSIINEKQF NRLNNLLTQA
KKNNQIIYGG DSNEKEKRIS PTLIKIDNRN DPLMKEELFG PLLPILSIKN LDQAISDFKL
LPKPLALYLF GGGEKEQGKV LSMTSSGGVC FNDVVLQAGI PELPFGGVGT SGMGKYHGKA
GFDNFTHYKS VLKRPFWLDL NFRYPPYKLD LSLLNKLIG