Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11261 |
Symbol | |
ID | 5730406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1031174 |
End bp | 1032397 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285494 |
Product | putative lycopene beta cyclase |
Protein accession | YP_001551011 |
Protein GI | 159903667 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.479015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATAGCCT GTTCTGATGT ATTGGTTATG GGGGCTGGCC CAGCAGCTCT TTGTATTGCC TCGGAGTTAG TTCAGAAAGG ATTGAAAGTT AGCGCTCTTG CCTCGAATTC TCCAGATAGA TTATGGACTA ATACTTATGG AATTTGGGCA GAAGAACTTG AAAGCCTTGG GATGGCATCT TTATTAGGAA GCCGATGGAC TAATACTGTC AGTTACTTTG GAGATGGCGT TAAGGAGGAA GGATTAAAGC CCACTCTTCA TAATTTTGAT TATGGACTTT TTGATCAATC ACTCTTTCAA AAGAATCTTT TAGATAAATG TGATGGTATT AATTGGATAA TTGAAACTGC CGAAGATATA CGCTATAGGG ATTCGATAAC GGAGGTCATT TGTACCTCAG GCAAAATATA TCGAGCAAGA GTAGTTATTG ATGCTAGTGG ACATAGAAGT CCTTTCGTTA AGAGACCAGA CCATGGACCA ATTGCTCAAC AAGCTGCCTA TGGAATAGTT GGAAGGTTTA GCTCTCCTCC TGTTGAGAAA GATCAATTTG TTCTAATGGA TTTTCGACCT GATCATTTAA CTAAAGATGA ATTAGAAGAA CCACCTTCAT TTTTGTATGC AATGGATTTT GGTGAAGGAC TTTATTTTGT TGAAGAAACA TCTTTGGCTT GTGCTCCTCC ACTTACATGG AGCAAGTTGA AAGAAAGATT GCTTTTAAGG TTGTCTCATA GAGGAATCGA AATTCAAGAA GTTGTTCATG AAGAGCATTG CTTGTTCCCT ATGAATTTGC CACTGCCCTT TTTGAATCAA CCCCTCCTTG CCTTTGGAGG TGCTGCAAGT ATGGTTCATC CCGCTTCTGG CTATATGGTT GGTGCTCTTT TGAGGCGAGC TCCAGCATTG GCCGATGAGC TATCGAAAGC TATAACTAGC GATCCAAGTT TGGATTCGGC TCGCTTGGCT AAGCGAGGTT GGCAGGTTCT TTGGACTCCC GACCTCGTCT TAAGGCATCG CTTATATCAA TTTGGATTAA AAAGGTTAAT GAGCTTTGAT GAGACTCTTT TAAGAAGCTT CTTTACTTCT TTTTTTAAGC TCCCACAAGA TGAATGGTTT GGGTTTCTAG CTAATACGCT TCGCTTGCCT AGGTTGTTAA TAGTAATGAT TAGGTTATAT TTCCTTTCTC CAATCGGAGT AAAATTAGGA ATGGTTGGAT TGGTAAAAAA GTAG
|
Protein sequence | MIACSDVLVM GAGPAALCIA SELVQKGLKV SALASNSPDR LWTNTYGIWA EELESLGMAS LLGSRWTNTV SYFGDGVKEE GLKPTLHNFD YGLFDQSLFQ KNLLDKCDGI NWIIETAEDI RYRDSITEVI CTSGKIYRAR VVIDASGHRS PFVKRPDHGP IAQQAAYGIV GRFSSPPVEK DQFVLMDFRP DHLTKDELEE PPSFLYAMDF GEGLYFVEET SLACAPPLTW SKLKERLLLR LSHRGIEIQE VVHEEHCLFP MNLPLPFLNQ PLLAFGGAAS MVHPASGYMV GALLRRAPAL ADELSKAITS DPSLDSARLA KRGWQVLWTP DLVLRHRLYQ FGLKRLMSFD ETLLRSFFTS FFKLPQDEWF GFLANTLRLP RLLIVMIRLY FLSPIGVKLG MVGLVKK
|
| |