Gene P9303_21711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21711 
Symbol 
ID4776458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1926550 
End bp1927929 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content56% 
IMG OID640087681 
Productputative aldehyde dehydrogenase 
Protein accessionYP_001018171 
Protein GI124023864 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0450332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCCCTG AGAACTTTCT GCTGAGTGGG CTACGCAAGC CAGTCATTTC GGGCCTCACA 
AGACCAGAAG CCTGGCGCAG AAAGCAACTC AAGCAAATTG AGGTCCTCAT CGAAAAGCAC
CAAGACGAGG TACTTGATGC CCTGGCAACG GATCTAGGCA AACCACCAAC CGAGGCATTG
TTTGAGCTGA TCGCGCTGCG TGGAGAGCTC AAACTGGCTC AACGGCAGCT CAGCCGCTGG
ATGCAACCCA GGCACGTACA AGTGCCTTTA GCCCATCAGC CAGGTCAAGC CGAGGTGATC
CTCGACCCCC TGGGTTGCGT TCTGATCATC GGCCCCTGGA ATTACCCCTT CTCTCTAACC
CTCCAACCAC TGATTAGTGC TCTGGCAGCA GGCAATACAG CAGTGCTCAA ACCTTCCGAG
CATGCCCCTG CCACCTCTCG ACTGATCGCC CATGTGATCC CTCAACATTT CTCCAGCGAG
GTCGTACAGG TGATCGAGGG GGATGGGGCC ATTGCAGCGG CACTGATCAA GCAACCCTTT
GATCACATCT TCTTTACCGG CAGCGGCGCC ATCGGCCAGA AGGTCATGGC TGCTGCCGCC
GAACATCTCA CTCCGGTGAC CCTGGAGCTA GGAGGGAAAA GCCCAGCCAT CGTGATTGAT
GGCGCCGATC TCTCGGTCAC GGCTAGGCGA TTGGTATGGG GGAAGGGTCT CAATGCTGGT
CAAACCTGCA TCGCCCCAGA CCATCTACTT ATCCAAGAAC AACTCAAACA GCCTCTACTG
CAGGCGATGA AAGGAGCCAT TACTGAGCTC TATGGAGGCG ATCCGCTGCG ATCACCCCAC
CTCGCAAAAA TCATCAACGA TTGTCATTTC CAGCGACTAC AACACTTGCT TGATCAAGCA
AAGCAGCGCG GCAAGGTGCT CTCAGGCGGA CAAATTGACC CCGATCAAAG ACGCATCGCT
CCCACTCTGA TTGACGTAGA CAAGCGCGAC GATCCGCTGA TGGAGGAAGA GCTCTTCGGC
CCACTGCTGC CTGTCATCAG CGTGCACAGC CTTAATGAAG CTCTCGCTGA GGTCCGACAA
CAACCAAAGC CCCTGGCCCT CTATCTCTTT GGCGGAACAC ATGCCGACCA ACAACAGCTC
CTCAACACAA CCAGCTCAGG CGGAGTTTGT TTCAACGATG TGGTGATGCA TGTAGGCATT
CCTGAGCTGC CCTTTGGTGG AGTCGGGGCC AGTGGCATGG GGCGTTATCA CGGCCTGGCT
GGTTTCGAAA CCTTTTCCCA TCAAAAGTCT GTCTTACGTC GCCCCTTCTG GCTAGATCTC
AAATTGCGCT ACCCCCCCTA CAAGGCCAAT CTGGCGATGC TCAAAAAGCT GCTGGGATAA
 
Protein sequence
MLPENFLLSG LRKPVISGLT RPEAWRRKQL KQIEVLIEKH QDEVLDALAT DLGKPPTEAL 
FELIALRGEL KLAQRQLSRW MQPRHVQVPL AHQPGQAEVI LDPLGCVLII GPWNYPFSLT
LQPLISALAA GNTAVLKPSE HAPATSRLIA HVIPQHFSSE VVQVIEGDGA IAAALIKQPF
DHIFFTGSGA IGQKVMAAAA EHLTPVTLEL GGKSPAIVID GADLSVTARR LVWGKGLNAG
QTCIAPDHLL IQEQLKQPLL QAMKGAITEL YGGDPLRSPH LAKIINDCHF QRLQHLLDQA
KQRGKVLSGG QIDPDQRRIA PTLIDVDKRD DPLMEEELFG PLLPVISVHS LNEALAEVRQ
QPKPLALYLF GGTHADQQQL LNTTSSGGVC FNDVVMHVGI PELPFGGVGA SGMGRYHGLA
GFETFSHQKS VLRRPFWLDL KLRYPPYKAN LAMLKKLLG