Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25371 |
Symbol | |
ID | 4778176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2229664 |
End bp | 2231811 |
Gene Length | 2148 bp |
Protein Length | 715 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640088058 |
Product | short chain dehydrogenase |
Protein accession | YP_001018533 |
Protein GI | 124024226 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.639467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTGCC AAAACCGCTG GTCGGATGCC GAGGCACAAG CTGCCATCAA GTCTTACGCC GCGCAGGACG TCTCTGAAGA CCTGGCTCTT CGCACTTACA CAGCCCGTTT GCTCGGCTCC GATCCCCAGC TGGTGCTGCA TGGCGGAGGC AACACCTCGG TTAAAACCAG CTGCATAGGA TTGTTTGGCG ATCACATACC AGTGTTGTGC GTGAAGGGCT CAGGCTGGGA CCTATCAACC ATCGAGCCGG CCGGCCATCC TGCTGTGCGA TTGGAGAACT TGCAGGCCTT AAGAGATCTA TCTGCACTTA GCGACGAAGA CATGGTTGCA GCTCAACGCA GCAACCTGAT CGATCCATCG TCACCCAACC CTTCGGTTGA AGCACTACTG CACGCGTTCT TACCAAGCAA ATTCGTCGAT CACACCCACG CAGTAGCTGT TTTAGCCCTT GCAGATCAAC CAGATGCCGA ACAGATCTGC CGTGAACTGT ACGGTCGACG TGTCGCGATT GTTCCCTATG TCATGCCGGG CTTCCAACTA GCGTTGGCCG CCATCAAAGC CTACGAACAA GCAGAAGTAG AAGCAGCTCA AGCGGGAGTT GAACTTGAAG GGATGGTGCT TCTCAAGCAC GGCTTATTTT CGTTTGCTGC CACAGCACAA CAAAGCTACG AGCGAATGAT CAACTTGGTG CGTGAGGCGG AAGAACGTCT TGGGGAAACC CCAACGCTTT GCCTACCACC ACCAACCAAT CCAGCTCCTA AAAAAAACAT CGCTGCACTT CTGCTCCCAC TGTTACGTGG TGCCCTGGCT CAATCCGCAG CTGTGCATAA CGCTCCCCAA CGTTGGCTTA TGGAGTTGCG CTCAACCCCA CTGGCACTCC AACTAGTAAA CGACATTCAC CTCCAAGACT GGTCTCGCCG TGGAGTCGCT ACCCCCGACC ACGTGATCCG AACCAAACCC TGGCCTCTCA TTCTCAAAAA GCCTCCACAA CTCCAAGGAG ATGAAGCGAT TGAATCCTGC CCAGTGCTGG AGGAATGGCT CCACTCAGCC AAACTGGCAT TGGAGAAATA CATCAACTCA TATCAGGATT ACTTCGAGCG TCAGAATGCT CGCGTTGGAA GCCATAAACA ACACCTCGAT CCACTACCAA GGCTGATCGC AATCCCTGAA CTTGGCCTCG TTGGCCTAGG CCGTTCAACA GCAGAAGCCA ATGTCACCGC AGATATTGGT GAAGCCTGGG CCGCCACACT GATGGCAGCT GAATCAGTGG GACGTTTTCA ACCAGTCAAC GAGGCAGATA CCTTCGAGAT GGAGTACTGG AGTCTCGAAC AAGCAAAGCT CGGCAAGGGC AAAGAAGCCC CACTGGCACG CCATGTTGTC TTAGTCACTG GTGGTGGTGG TGGAATTGGT GCAGCGATCG CCCTTGCCTT CGCCAAGCAA GGTGCACAAG TTGTCGTACT TGACAAGAAT GGTGAAGCAG CAACAACGAC TGCCAAAGAA TGTGGCTCAA GCGCTCTCGG ACTGAAGTGC GACCTCACCA ATGCTGCTGA GGTTCATGAT GCATTCACGA CAATTGCAGC TTGCTTCGGG GGTTTAGACA TCGTGGTATC CAATGCTGGG GCGGCTTGGA GCGGAGACAT TGCCACTCTT CCAGAATCCA AGTTGCGAGC AAGCTTCGAG CTCAACCTAT TTGCTCACCA GCACGTTGCA CAAGCTGCAG TTCGCCTGTT TCGAGCCCAG GGCAATAGAA CGACAGAAAC CAGCAAATCC TTAGGCGGAC AGCTGCTCTT CAATATCAGC AAGCAAGCGC TAAACCCAGG CCCTGGTTTT GGAGCCTACG GAATTGCAAA AGCGGCATTG CTTGCACTGA TGAAGCAATA CGCCCTGGAA GAAGGGCCCT CAAGCATTCG CTGTAACGCC ATCAATGCAG ATCGGATTCG CTCCGGCCTG CTCGATCAGG CAATGATTCG AGAACGAGCG GAAGCGCGCG GCATCAGCGA AGCCAACTAC ATGGGTGGGA ACTTGCTCGG TGCAGAAGTC CGAGCAAGTG ATGTGGCGAA TGCTTTTGTA GCATTAGCCT TAATGCCACG AACCACTGGT GCATTACTGA CAGTAGACGG CGGAAATGTT GCGGCGATGG TGCGTTAA
|
Protein sequence | MTCQNRWSDA EAQAAIKSYA AQDVSEDLAL RTYTARLLGS DPQLVLHGGG NTSVKTSCIG LFGDHIPVLC VKGSGWDLST IEPAGHPAVR LENLQALRDL SALSDEDMVA AQRSNLIDPS SPNPSVEALL HAFLPSKFVD HTHAVAVLAL ADQPDAEQIC RELYGRRVAI VPYVMPGFQL ALAAIKAYEQ AEVEAAQAGV ELEGMVLLKH GLFSFAATAQ QSYERMINLV REAEERLGET PTLCLPPPTN PAPKKNIAAL LLPLLRGALA QSAAVHNAPQ RWLMELRSTP LALQLVNDIH LQDWSRRGVA TPDHVIRTKP WPLILKKPPQ LQGDEAIESC PVLEEWLHSA KLALEKYINS YQDYFERQNA RVGSHKQHLD PLPRLIAIPE LGLVGLGRST AEANVTADIG EAWAATLMAA ESVGRFQPVN EADTFEMEYW SLEQAKLGKG KEAPLARHVV LVTGGGGGIG AAIALAFAKQ GAQVVVLDKN GEAATTTAKE CGSSALGLKC DLTNAAEVHD AFTTIAACFG GLDIVVSNAG AAWSGDIATL PESKLRASFE LNLFAHQHVA QAAVRLFRAQ GNRTTETSKS LGGQLLFNIS KQALNPGPGF GAYGIAKAAL LALMKQYALE EGPSSIRCNA INADRIRSGL LDQAMIRERA EARGISEANY MGGNLLGAEV RASDVANAFV ALALMPRTTG ALLTVDGGNV AAMVR
|
| |