Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50101 |
Symbol | LUT1-1 |
ID | 7198820 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 41103 |
End bp | 43229 |
Gene Length | 2127 bp |
Protein Length | 644 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | lutein deficient 1-like protein |
Protein accession | XP_002185034 |
Protein GI | 219129729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACAGTCAT ATAGATACTT CTAGAGTGCT TCCTCTTGTG TTAGTCAATT TCGTTTGCAT CCATCGCTAC GGTGAGAATG TATTCCATAA CTTTCTTGTC CTTAACAGCG GTGGTTGTCG TTGCTTGCTT CTTTCCGAAA CGCACACAAT CGCTAGTCGT CCCATTCCCG TCTTCAGCTG TCCGTTGTTG TTTCGGCATT CACCGTCAAT CCCGCGTTCT GACATCTGCC TTGTTCTCCA CAACCGAGGA CAAGACCGAC GAACAGACGG ACAATAAAGC TGGCGAGCCG GCAACCATGA AAGGCGAAGT CGCATCATCC GTCATAACAA CCGAAAAGAA TCCAGACGCC GAGGGTCTCC CATGGTGGTG GGAAGTGGTA TGGGATCTTG ACATAATGCA AGTTGGCAAG TCAGGTGAGG AGATTTCCTT TGGTGATTCG GCAAATGTCT TGCGGACGAA CATTGAGCAG ATCTACGGAG GATTTCCCAG TCTCGACGGC TGTCCCCTAG CCGAAGGAGA GCTTGCAGAC ATTGGCGATG GTACGATGTT TATTGGGCTA CAGAACTACT ACCGGAACTA CGGAAGGTAT GACGAAAACA AATTTCGTGT GCGTAGAGGC ATTCTCATTT GTATTGTGAT ATCGCCTTCG CATCTCACAC TCCGCTTTCT CTTCCTCAGT CCATACAAGC TGTGCTTCGG TCCAAAATCG TTTCTCGTAA TATCTGATCC TGTACAAGCG AAACACATTC TCAAGGATGC CAACACCAAC TACGATAAGG GCGTTCTGGC CGAAATTCTC GAACCGATCA TGGGAAAAGG CCTGATTCCG GCTGATCCCG AAACATGGAG CATCCGCCGC CGACAGATTG TTCCAGCCTT TCACAAAGCA TGGCTGGAAC ACATTGTCGG CCTTTTCGGC TATTGCAACC AACCTTTAAT TGATACACTC AACAAGCGCG TTGACGGAGA TGGCAAGGTA GAAATGGAAT CCCTCTTTTG TTCTGTCGCA TTGGATATTA TTGGCCTGTC TGTTTTCAAT TACGAGTTTG GTTCCGTAAC TCAAGAGTCC CCTGTAATCA AAGCGGTGTA TTCGGCGCTC GTGGAGGCGG AACATCGCTC CATGACCCCA GCCCCGTACT GGAATTTACC TCTGGCCAAT CAACTTGTGC CCCGCCTCCG TAAATTCAAC AGTGATCTCA AACTTTTGAA CGATGTTTTG GACGACCTCA TTACCCGGGC GAAACAAACG CGGACAGTGG AAGATATCGA AGAGCTCGAA AATCGAAACT ATAATGAGGT CCAAGATCCC TCCTTGCTAC GATTTTTAGT CGACATGCGT GGTGCTGATA TTGACAACAA ACAATTGCGG GACGACCTGA TGACTATGTT GATTGCCGGG CATGAAACAA CCGCTGCTGT CTTAACTTGG GCGCTCTTCG AACTTACCAA GAACCCCGAA ATTATGAAAG AGCTGCAAGA CGAAATCGAT GAAGTCGTTG GAGATCGCAT GCCCAACTAC GAAGACATCA AGAAAATGAA ATTCTTACGC CTCGTGGTCG CAGAGACTTT GCGAATGTAC CCCGAGCCTC CTTTGTTAAT CCGCCGATGC CGTACTCCCG ATGAGCTTCC ACAGGGGGCC GGCCGTGAAG CTAAAGTAAT TCGAGGCATG GACATATTTA TGGCCGTGTA CAACATTCAC CGCGACGAAC GGTTCTGGCC TAGTCCCGAT ACCTTCGACC CATTACGTTT CACACGGTCG CATTCCAACC CGGACGTTCC GGGTTGGGCA GGTTTTGACC CGAAAAAATG GGAGGGAAAA TTGTACCCGA ATGAGGTCGC GTCGGACTTT GCCTTTTTGC CCTTTGGTGG TGGCGCCCGA AAATGTGTCG GAGACGAGTT TGCGATACTT GAGGCTACAG TGACGCTTGC CATGGTGCTA CGACGATTCG AGTTTTCTTT CGACGAGTCC AAGTTCGAAG GCAAGGATGA CATTTTGAGC TCGGCCCAAG GACTGAACCA TCCTGTTGGT ATGCGGACGG GGGCAACCAT ACACACTCGA AATGGGCTTC ATCTAGTGGT CGAGAAGCGT GGGGTACCGA AATAATCATA AAAGATTGTT GGAAGTC
|
Protein sequence | MYSITFLSLT AVVVVACFFP KRTQSLVVPF PSSAVRCCFG IHRQSRVLTS ALFSTTEDKT DEQTDNKAGE PATMKGEVAS SVITTEKNPD AEGLPWWWEV VWDLDIMQVG KSGEEISFGD SANVLRTNIE QIYGGFPSLD GCPLAEGELA DIGDGTMFIG LQNYYRNYGS PYKLCFGPKS FLVISDPVQA KHILKDANTN YDKGVLAEIL EPIMGKGLIP ADPETWSIRR RQIVPAFHKA WLEHIVGLFG YCNQPLIDTL NKRVDGDGKV EMESLFCSVA LDIIGLSVFN YEFGSVTQES PVIKAVYSAL VEAEHRSMTP APYWNLPLAN QLVPRLRKFN SDLKLLNDVL DDLITRAKQT RTVEDIEELE NRNYNEVQDP SLLRFLVDMR GADIDNKQLR DDLMTMLIAG HETTAAVLTW ALFELTKNPE IMKELQDEID EVVGDRMPNY EDIKKMKFLR LVVAETLRMY PEPPLLIRRC RTPDELPQGA GREAKVIRGM DIFMAVYNIH RDERFWPSPD TFDPLRFTRS HSNPDVPGWA GFDPKKWEGK LYPNEVASDF AFLPFGGGAR KCVGDEFAIL EATVTLAMVL RRFEFSFDES KFEGKDDILS SAQGLNHPVG MRTGATIHTR NGLHLVVEKR GVPK
|
| |