Gene P9303_25371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25371 
Symbol 
ID4778176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2229664 
End bp2231811 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content54% 
IMG OID640088058 
Productshort chain dehydrogenase 
Protein accessionYP_001018533 
Protein GI124024226 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.639467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTGCC AAAACCGCTG GTCGGATGCC GAGGCACAAG CTGCCATCAA GTCTTACGCC 
GCGCAGGACG TCTCTGAAGA CCTGGCTCTT CGCACTTACA CAGCCCGTTT GCTCGGCTCC
GATCCCCAGC TGGTGCTGCA TGGCGGAGGC AACACCTCGG TTAAAACCAG CTGCATAGGA
TTGTTTGGCG ATCACATACC AGTGTTGTGC GTGAAGGGCT CAGGCTGGGA CCTATCAACC
ATCGAGCCGG CCGGCCATCC TGCTGTGCGA TTGGAGAACT TGCAGGCCTT AAGAGATCTA
TCTGCACTTA GCGACGAAGA CATGGTTGCA GCTCAACGCA GCAACCTGAT CGATCCATCG
TCACCCAACC CTTCGGTTGA AGCACTACTG CACGCGTTCT TACCAAGCAA ATTCGTCGAT
CACACCCACG CAGTAGCTGT TTTAGCCCTT GCAGATCAAC CAGATGCCGA ACAGATCTGC
CGTGAACTGT ACGGTCGACG TGTCGCGATT GTTCCCTATG TCATGCCGGG CTTCCAACTA
GCGTTGGCCG CCATCAAAGC CTACGAACAA GCAGAAGTAG AAGCAGCTCA AGCGGGAGTT
GAACTTGAAG GGATGGTGCT TCTCAAGCAC GGCTTATTTT CGTTTGCTGC CACAGCACAA
CAAAGCTACG AGCGAATGAT CAACTTGGTG CGTGAGGCGG AAGAACGTCT TGGGGAAACC
CCAACGCTTT GCCTACCACC ACCAACCAAT CCAGCTCCTA AAAAAAACAT CGCTGCACTT
CTGCTCCCAC TGTTACGTGG TGCCCTGGCT CAATCCGCAG CTGTGCATAA CGCTCCCCAA
CGTTGGCTTA TGGAGTTGCG CTCAACCCCA CTGGCACTCC AACTAGTAAA CGACATTCAC
CTCCAAGACT GGTCTCGCCG TGGAGTCGCT ACCCCCGACC ACGTGATCCG AACCAAACCC
TGGCCTCTCA TTCTCAAAAA GCCTCCACAA CTCCAAGGAG ATGAAGCGAT TGAATCCTGC
CCAGTGCTGG AGGAATGGCT CCACTCAGCC AAACTGGCAT TGGAGAAATA CATCAACTCA
TATCAGGATT ACTTCGAGCG TCAGAATGCT CGCGTTGGAA GCCATAAACA ACACCTCGAT
CCACTACCAA GGCTGATCGC AATCCCTGAA CTTGGCCTCG TTGGCCTAGG CCGTTCAACA
GCAGAAGCCA ATGTCACCGC AGATATTGGT GAAGCCTGGG CCGCCACACT GATGGCAGCT
GAATCAGTGG GACGTTTTCA ACCAGTCAAC GAGGCAGATA CCTTCGAGAT GGAGTACTGG
AGTCTCGAAC AAGCAAAGCT CGGCAAGGGC AAAGAAGCCC CACTGGCACG CCATGTTGTC
TTAGTCACTG GTGGTGGTGG TGGAATTGGT GCAGCGATCG CCCTTGCCTT CGCCAAGCAA
GGTGCACAAG TTGTCGTACT TGACAAGAAT GGTGAAGCAG CAACAACGAC TGCCAAAGAA
TGTGGCTCAA GCGCTCTCGG ACTGAAGTGC GACCTCACCA ATGCTGCTGA GGTTCATGAT
GCATTCACGA CAATTGCAGC TTGCTTCGGG GGTTTAGACA TCGTGGTATC CAATGCTGGG
GCGGCTTGGA GCGGAGACAT TGCCACTCTT CCAGAATCCA AGTTGCGAGC AAGCTTCGAG
CTCAACCTAT TTGCTCACCA GCACGTTGCA CAAGCTGCAG TTCGCCTGTT TCGAGCCCAG
GGCAATAGAA CGACAGAAAC CAGCAAATCC TTAGGCGGAC AGCTGCTCTT CAATATCAGC
AAGCAAGCGC TAAACCCAGG CCCTGGTTTT GGAGCCTACG GAATTGCAAA AGCGGCATTG
CTTGCACTGA TGAAGCAATA CGCCCTGGAA GAAGGGCCCT CAAGCATTCG CTGTAACGCC
ATCAATGCAG ATCGGATTCG CTCCGGCCTG CTCGATCAGG CAATGATTCG AGAACGAGCG
GAAGCGCGCG GCATCAGCGA AGCCAACTAC ATGGGTGGGA ACTTGCTCGG TGCAGAAGTC
CGAGCAAGTG ATGTGGCGAA TGCTTTTGTA GCATTAGCCT TAATGCCACG AACCACTGGT
GCATTACTGA CAGTAGACGG CGGAAATGTT GCGGCGATGG TGCGTTAA
 
Protein sequence
MTCQNRWSDA EAQAAIKSYA AQDVSEDLAL RTYTARLLGS DPQLVLHGGG NTSVKTSCIG 
LFGDHIPVLC VKGSGWDLST IEPAGHPAVR LENLQALRDL SALSDEDMVA AQRSNLIDPS
SPNPSVEALL HAFLPSKFVD HTHAVAVLAL ADQPDAEQIC RELYGRRVAI VPYVMPGFQL
ALAAIKAYEQ AEVEAAQAGV ELEGMVLLKH GLFSFAATAQ QSYERMINLV REAEERLGET
PTLCLPPPTN PAPKKNIAAL LLPLLRGALA QSAAVHNAPQ RWLMELRSTP LALQLVNDIH
LQDWSRRGVA TPDHVIRTKP WPLILKKPPQ LQGDEAIESC PVLEEWLHSA KLALEKYINS
YQDYFERQNA RVGSHKQHLD PLPRLIAIPE LGLVGLGRST AEANVTADIG EAWAATLMAA
ESVGRFQPVN EADTFEMEYW SLEQAKLGKG KEAPLARHVV LVTGGGGGIG AAIALAFAKQ
GAQVVVLDKN GEAATTTAKE CGSSALGLKC DLTNAAEVHD AFTTIAACFG GLDIVVSNAG
AAWSGDIATL PESKLRASFE LNLFAHQHVA QAAVRLFRAQ GNRTTETSKS LGGQLLFNIS
KQALNPGPGF GAYGIAKAAL LALMKQYALE EGPSSIRCNA INADRIRSGL LDQAMIRERA
EARGISEANY MGGNLLGAEV RASDVANAFV ALALMPRTTG ALLTVDGGNV AAMVR