Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_06351 |
Symbol | |
ID | 4776866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 599926 |
End bp | 601062 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640086142 |
Product | aldo/keto reductase family protein |
Protein accession | YP_001016652 |
Protein GI | 124022345 |
COG category | [R] General function prediction only |
COG ID | [COG1453] Predicted oxidoreductases of the aldo/keto reductase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.162815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCAA TACGACGCCC CTTTGGTCGG GGACCGGCAG TGAGCCTGTT CACCCTTGGG ACGATGAGGG CAATCGGCTC AGCCGAGCAA ATGTATGGGG TTGTTAAAGC TGCCCAGGCC GCTGGCATTA ACCACATCGA AACCTCCCCG GCCTATGGCC AGGCAGAAAG TTTTCTTGGC ACTGCTCTGC GACAGCTGCA ACAAAACCAA GCTGAACCTT CTGGAGGCTG GGTCATCACC AGCAAACTTC TGCCAGGGCT CAGCCTGAAA GAAGGGCAGT GCGAATTGCA CAACCTTTTA GCGCGACTTG GACGACCCAA GCTTGAAAAC CTTGCAGTTC ATGGCCTCAA TCGCCCTGAG CACCTGGAGT GGGCCCTAAG GGGAGACGGC GCAGCGCTAA TGCGTTGGGC TGAGGAAGAG GATCTTGTTG TCCAAGTGGG ATTCACTAGC CATGGCTCAT CTCCCCTCAT CAAGGAAGCC TTAGCAAGCA GTCGTTTTCA ATTCTGCAGC CTGCATCTGC ACCTACTAGA TCCCGAACGT ATTCCCCTGG CACAGGAGGC CCTTGCATCA GGCATGGGTG TGATGGCAAT CTCCCCTGCT GATAAAGGAG GACGGCTTCA AGACCCAAGC CCAACCCTGG TTGAGGACTG CAGCCCACTC TCGCCTCTCC AACTTGCCTA TCGCTTCCTG CTGGCTGCAA AGATCAGCAC CCTCAGCCTT GGTGCTGCGC AGCCAGAAGA CCTGACCCTT GCTGCGCAGT TGGCCAACGC CGATGGGCCA CTCAATCAAC GCGAGCAAAG AGCTCTCAAC CAACTTCGCC AACAAGGTGA GCGCCGCCTA GGTGAAAACC GATGCGGCCA GTGCAAAGCC TGCCTGCCCT GCCCAAATTC TGTACCCATA CCAGATCTAC TACGACTACG AAATCTGGCC GTAGGCCATA ACCTTCAAGC TTTTACAGAA GAGCGTTACA ACCTGATCGG ACGAGCAGGA CACTGGTGGG AAGGTATTGA TGGCAGCGCC TGCGAGCGCT GTGGCGAATG CCTACCCCGC TGCCCCCATC ACCTGCCGAT CCCGGATCTC CTCGCTGACA CGCACCAACG CCTAGCGGCA GCTCCCAGGC GCAGGCTATG GGGTTGA
|
Protein sequence | MMPIRRPFGR GPAVSLFTLG TMRAIGSAEQ MYGVVKAAQA AGINHIETSP AYGQAESFLG TALRQLQQNQ AEPSGGWVIT SKLLPGLSLK EGQCELHNLL ARLGRPKLEN LAVHGLNRPE HLEWALRGDG AALMRWAEEE DLVVQVGFTS HGSSPLIKEA LASSRFQFCS LHLHLLDPER IPLAQEALAS GMGVMAISPA DKGGRLQDPS PTLVEDCSPL SPLQLAYRFL LAAKISTLSL GAAQPEDLTL AAQLANADGP LNQREQRALN QLRQQGERRL GENRCGQCKA CLPCPNSVPI PDLLRLRNLA VGHNLQAFTE ERYNLIGRAG HWWEGIDGSA CERCGECLPR CPHHLPIPDL LADTHQRLAA APRRRLWG
|
| |