Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01891 |
Symbol | crtQ |
ID | 4779287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 176722 |
End bp | 178182 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640083453 |
Product | zeta-carotene desaturase |
Protein accession | YP_001014018 |
Protein GI | 124024902 |
COG category | [S] Function unknown |
COG ID | [COG3349] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02732] carotene 7,8-desaturase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.905208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.646889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAATCG CAATAATAGG TGCAGGTTTA GCAGGATTGA CTGCTGCAGT TGACCTTGTT GATGAAGGAC ATGACGTCGA CCTATACGAA GCAAAACCTT TTATAGGAGG TAAAGTTGGA AGCTGGGAAG ATTCCGATGG CAATCATATA GAGATGGGTT TGCATGTCTT CTTTTTCAAC TATGCCAATC TGTTTTCATT AATGAGGAAA GTAGGCGCGT TTGAAAATCT TTTACCAAAA GACCACACTC ACCTTTTTGT TAACAAGGGC GGTGACATTA AATCACTTGA TTTTCGATTT TTTGCAGGAG CTCCTTTTAA TGGCTTAAAA GCCTTCTTCA CTACACCTCA ATTAAATTTG ATTGACAAGC TAAGAAATGC TCTTGCTCTT GGCACAAGTC CGATAGTAAG AGGGCTGATT GACTACGAGG GTGCGATGAA AACAATACGG TCACTAGATT CTATAAGCTT TCAAAAATGG TTTCTTAACC ATGGAGGAAG CCTAAATAGT ATCGAAAGGA TGTGGAATCC GATTGCATAT GCGCTGGGAT TCATTGATTG CGAAGCCATT TCTGCAAGAT GCATGCTTAC CATCTTTATG ATGTTCGCTT CTAAGACAGA GGCCTCTAAG CTCAACCTAT TGAAGGGATC ACCCCACAAA TGGCTCACTA AGCCAATACT CGATTACATT GAACAAAGAG GCGGAAAACT TCACTTAGAA AATATTGTTA AAGAAATTCA TTCAGAGGAT TCTGATCATC CATCTGTGAC CGGAATAACC CTTCAAACTC CCGAGGGTGA ACAAACAATT AAAGCCGACA AATATCTAGC TGCTTGCGAT GTGTCTGGTA TAAAAAGAAT AATTCCTAGA TCATGGAGAC GTTTTAAAGA GTTCGACTCA CTTTTTAAGC TTGATGCTGT TCCAGTAGCG ACAGTACAAC TTAGATACGA TGGCTGGGTA ACGGAGATCA ATAATCAACA AGCTCAAAAA AACCTAGAAA GTCCATCTGG TTTAGACAAC TTGTTATATA CAGCCGATGC TGATTTTAGT TGTTTTGCAG ATTTAGCTTT ATCAAGCCCA GAAGACTATA AAAAAGAAGG TCAGGGCTCT CTACTTCAAT GTGTGTTAAC GCCTGGAGAT CCATGGATTA CCAAGTCCTC TGATGAAATC GTAAAACATA CTGATTTACA GGTCAGGACA CTTTTCCCTT CTTCAAGAGG TTTAAAGCTT TTGTGGAGCA ATGTGGTCAA GGTTTCTCAC TCTCTCTACA GGGAGGCTCC TGGAATGGAG CCATATAGAC CAGATCAAAA AACTTCATTT AGTAATTTTT TCTTAGCAGG CAGTTACACA AAACAGGACT ACATTGACTC TATGGAAGGA GCAACAATGA GTGGACATCT TGCTGCTTCA GCAATGCTCT CAAAGTCTGT TTCACTAGCA AAAAATTCTT CGGTTGCTTA A
|
Protein sequence | MKIAIIGAGL AGLTAAVDLV DEGHDVDLYE AKPFIGGKVG SWEDSDGNHI EMGLHVFFFN YANLFSLMRK VGAFENLLPK DHTHLFVNKG GDIKSLDFRF FAGAPFNGLK AFFTTPQLNL IDKLRNALAL GTSPIVRGLI DYEGAMKTIR SLDSISFQKW FLNHGGSLNS IERMWNPIAY ALGFIDCEAI SARCMLTIFM MFASKTEASK LNLLKGSPHK WLTKPILDYI EQRGGKLHLE NIVKEIHSED SDHPSVTGIT LQTPEGEQTI KADKYLAACD VSGIKRIIPR SWRRFKEFDS LFKLDAVPVA TVQLRYDGWV TEINNQQAQK NLESPSGLDN LLYTADADFS CFADLALSSP EDYKKEGQGS LLQCVLTPGD PWITKSSDEI VKHTDLQVRT LFPSSRGLKL LWSNVVKVSH SLYREAPGME PYRPDQKTSF SNFFLAGSYT KQDYIDSMEG ATMSGHLAAS AMLSKSVSLA KNSSVA
|
| |