Gene A9601_03571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03571 
Symbol 
ID4717051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp330130 
End bp331521 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content29% 
IMG OID640078066 
Productputative aldehyde dehydrogenase 
Protein accessionYP_001008752 
Protein GI123967894 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT CTGATAAATT TCATTTAGAA GACATCTATA AATTAAAAAA TACAGTTCTC 
ACTGGTAAAA CTGAAGATAT AAAATGGCGG ATCCATCATA TCAATATAGT TTCTAAACTT
TTAGATGAAA ATAAAAAAGA GATAATTAAA TCACTTTTTG TTGATCTCGG CAAATCTGAA
ATTGAAGGGC TTTCAGAAAT CCTTTTAGTG AAAGAAGAAA TTTCACTTAT AAAAAAGAAA
CTCAATTCTT GGATGCGACC AAAAAAGATT GATACCCCTT TTTATCTTTT TCCATCATCC
TCCAAAGTTA TTTATGAACC TCTTGGGTGT GTCTTAATTC TTGGTCCTTA TAATTATCCA
TTACTTTATA TTTTAAAGCC ATTGGTAAAT ATTTTCTCAG CAGGAAATAC AGCAGTTATA
AAACCATCAG AGAAATGTCC TGCGACCTCA AAACTTATTA AAAAGCTTAC TTCCAAATAT
TTCAGTAAAG ATGTCCTAAT GACAGTAGAG GGTGATAATA AACAATCCAT AAAATTAATT
GAACAAAATT TTGACCACAT TTTTTTTACA GGAAGTACTA AAACTGGAAA ATCTATAATG
AAATTAGCTG CAAAAAACTT AACTCCATTA ACTCTTGAGT TAAGCGGAAC AAATCCTGTA
ATTGTTTTCA AGAATGCAAA TTTAGAAGTG GCTGCAAAAA GAATTGTTTG GGGTAAATTT
TTTAATTCTG GTCAATCGTG CATGGCTCCG AATCATATCT TTGTAGATAA AGAAATTGAA
AATATTTTTA TAGAAAAATT AAAGAAATAC ATAATAAGTT TTTACGGAGA TAATCCAATT
ATTTCCGAAA ACCTATCAAA ATTGGAGAAA AAACAATTTA CATCAACTGT AGAAATTCTC
AAACAATATG AAAAAGAAAA AAGAATTTTA TTTGGGGGGA CTTTTAGTAA AAAAAAGTTG
AAAATATCTC CTACAATTTT GAGAACTAAA TTAAATGAAA AAGATATTTT GCAGAAAGAA
TTATTCAGTT CACTACTTCC TGTAGTTGGA ATTAATGGTA TGGAATCAGC TTTAACACAG
ATTAGTCTAA CATCAAAACC CTTAGCAATC TACTTATTTG GAGGCAATAA AAAAATCCAT
AATCATATTT CAAAAGTAAC CAGCTCTGGA ACAATTTGTA TAAATGATGT GATGTTACCA
GTCCTTATTC CAAATTTACC TTTTGGAGGC GTTGGGCAAA GTGGTATTGG CAAATTTCAT
GGAGAAGAAG GCTTTCGAAA TTTTTCAAAT CAAAAATCTA TTACTTTTAA AGGTTTTTTA
TTTGATTCAA ATCTGCGATA TCCCCCCTAT GAAAGAGTAA AGAAATTTTT AAAGTTTATT
TTTCAGATTT AA
 
Protein sequence
MKISDKFHLE DIYKLKNTVL TGKTEDIKWR IHHINIVSKL LDENKKEIIK SLFVDLGKSE 
IEGLSEILLV KEEISLIKKK LNSWMRPKKI DTPFYLFPSS SKVIYEPLGC VLILGPYNYP
LLYILKPLVN IFSAGNTAVI KPSEKCPATS KLIKKLTSKY FSKDVLMTVE GDNKQSIKLI
EQNFDHIFFT GSTKTGKSIM KLAAKNLTPL TLELSGTNPV IVFKNANLEV AAKRIVWGKF
FNSGQSCMAP NHIFVDKEIE NIFIEKLKKY IISFYGDNPI ISENLSKLEK KQFTSTVEIL
KQYEKEKRIL FGGTFSKKKL KISPTILRTK LNEKDILQKE LFSSLLPVVG INGMESALTQ
ISLTSKPLAI YLFGGNKKIH NHISKVTSSG TICINDVMLP VLIPNLPFGG VGQSGIGKFH
GEEGFRNFSN QKSITFKGFL FDSNLRYPPY ERVKKFLKFI FQI