Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20111 |
Symbol | |
ID | 4779537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1654438 |
End bp | 1655955 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085303 |
Product | phytoene dehydrogenase and related proteins |
Protein accession | YP_001015831 |
Protein GI | 124026716 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02733] C-3',4' desaturase CrtD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.428487 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG AATCTATCAT TGTTGTTGGG GGAGGGATAG CTGGCTTAAC AGGTGCTGCT CTTCTCTCTA AAGAGGGTTA TCAGGTAACT TTGGTCGAAG CACATAGTCA ATTAGGTGGT TGTGCAGGAA CTTTTAAAAG AGGTTCTTAT ACCTTTGATG TTGGCGCTAC CCAAGTTGCA GGTCTTGAGA GAGGAGGAAT TCATCATCGT TTATTTAATT ATTTAGATAT TCCTTTACCT GATGCAAAAA TTTTGGATCC TGGCTGTTCA GTCACTCTCG GTGATGGGAG TAGGCCAATC AATCTTTGGC ATGATCCATT GAGATGGCAG AAAGAAAGAC AAGAACAGTT TCCTGGGAGT GAAATCTTCT GGTCATTATG TTCTAAAATT CATGAAAGTA ACTGGGAATT TGTCGAAAGA GATCCAATAC TTCCGGTAAG AAATTTTTGG GATTTAAGTC AATTAATTAG AGCCATACGT CCTTCAAATC TTTTTACTGG TTTTCTGAGT AAGTTAACTA TTACAGACTT GCTTAAAATA ACTGGTTGTC ATAAAGATAG ACGTCTACGA AGTTTTTTAG ACCTTCAATT AAAACTTTAC TCTCAAGAGC CAGCAAGTAG AACGGCAGCT TTATACGGTG CAACTGTTCT TCAAATGGCC CAATCTCCTA GAGGTCTATG GCATCTTCAT GGATCAATGC AAATTCTTAG CGATTTGTTG AAAGATAGTT TCTTGAGAGA TGGTGGAAGC CTTTTGATTG GGCATAGAGT GACCAAAATA ATAAGGAAAG AAAATTCAAA TATTTTTGAT GTCAATGTGA TTGATAGAAG AAAGAATTTG ATACGGATGA AAGCATCAGA TATTGTTTTT AGCTTGCCTC CTCAATCACT TTTAGATTTA ATTCCTATTG ATGGAGGTTT ATCTACAACA TACCGTGAAA GTATAAAAAA TTTACCTAAA CCCAGTGGTG CTATTGTATT TTACGGAGCT CTCCGTCGTG TAGATTTGCC CGTTGATTGT CCAGGTCATA TTCAAGTATT TGATGAGCAT TTTGGTTCTT TATTTATTTC TATCAGTATG GAAAATGATC AACGTGCACC AGTTGGGATG GCCACTTTAA TAGCAAGTGT ATTTGTAGAT ATTGATCAAT GGTCTAATCT AGATAGTCAA TCATATATCA GAAAAAAAAA TGTTGTATCG AAACAGATTA GAGCTATTTT AGATCACAAG TTTGATTTGC TAGAGACGAG TTGGGATCAT CAAGAACTTT CTACCCCAAG AAGTTTTGAA AGGTGGACTG GACGTCCTAG TGGAATAGTC GGAGGGCTTG GTCAACATCC AGATCAATTT GGTCCTTTTG GCCTGTCAAG TAGAACCCCT CTCAGAGGTT TATGGCTTTG TGGAGATTCT ATATACCCTG GTGAAGGTAC CGCTGGCGTA AGTCAATCAG CTTTGATGGT TGTAAGACAA TTATTGGAAT CTAAAGGTAG ACATCTAAAC ATCCCTGTCT TTAATTAG
|
Protein sequence | MSEESIIVVG GGIAGLTGAA LLSKEGYQVT LVEAHSQLGG CAGTFKRGSY TFDVGATQVA GLERGGIHHR LFNYLDIPLP DAKILDPGCS VTLGDGSRPI NLWHDPLRWQ KERQEQFPGS EIFWSLCSKI HESNWEFVER DPILPVRNFW DLSQLIRAIR PSNLFTGFLS KLTITDLLKI TGCHKDRRLR SFLDLQLKLY SQEPASRTAA LYGATVLQMA QSPRGLWHLH GSMQILSDLL KDSFLRDGGS LLIGHRVTKI IRKENSNIFD VNVIDRRKNL IRMKASDIVF SLPPQSLLDL IPIDGGLSTT YRESIKNLPK PSGAIVFYGA LRRVDLPVDC PGHIQVFDEH FGSLFISISM ENDQRAPVGM ATLIASVFVD IDQWSNLDSQ SYIRKKNVVS KQIRAILDHK FDLLETSWDH QELSTPRSFE RWTGRPSGIV GGLGQHPDQF GPFGLSSRTP LRGLWLCGDS IYPGEGTAGV SQSALMVVRQ LLESKGRHLN IPVFN
|
| |