Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50052 |
Symbol | |
ID | 7198744 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 254801 |
End bp | 258318 |
Gene Length | 3518 bp |
Protein Length | 932 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184930 |
Protein GI | 219129509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0332007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAAACAGTT CACAAATCCT TGTAGCTTCT CAATCGATAG TTTTGCTCTA CCGTACGTGA CATGAGACGA ATACCAACCT ACGTTCTCCC TTGGCTGTGG CTTTGTTCCG TTCTCCAGTC GCCTATTCCC ACAGTATCAT TCGTTCCGCC CTTCAATCAT GCTTCGTCAT CATCGGCATC ATCATCCCAC CTGCATGCAC GCTATACTCC ACCAGTTTCA CAAGTGGTAA TGCCGCCACC ACCACCTCCA CCACCGCCCA TTCCAGAACC ACCCCAACTC GATGGAATCG ACCGGATCTT ACAGAATCTT GCGCAGAAGT TGGATTCTTC GCTCCCCGAT TGGAAGAACT TGCATTCGCC GGTTGGAAAG GATGTACAGG GGCCTTTGAT TGAGTCTGTT GTCTCAATCC AACGTCAGCT GGGACAACTG GAGTCCGATG CGGCCTCACA GATTCATCTC GTTTCCATAA AGTTCCAGAG CACCGTGCTC CAACAGATTC CCCAATTAGA ACCCGTTCTT CAGAAAGTTA CAGCTTTTTT AGCACCGTTA GTGCAAGATC CAACAACCCA ATTGATCGTA TCTGCCTTGG TGTCCTACAC GATTGTGTCC AAGTTGCTAG CCATGACCTT ACCACCTCCA CCATTGAAGC CGTATCCCAG TGGACGCTAC GACCCCGTGG CCTCCCGGGC CTACTTTGAC CAACGATTGC CGCTGGTGAT TGCCCGGAGT TTTTCGATTC TAGTGCAAAG TCTGCAATTC GCTGCTGCGC TACTCCAGGA CAAAATGCAG TAAGTGAATT ACGCATCGTT GTTGTGATTG AGTGCCGTCC TTTCTGTTTA TGTGTTTGTG TGTGGAGCCG TTATCGATCC TGTATTCGAA TGTTTCTTTT TCCCTTGGAC AATTGTGTCA AACTCATACG GAACGATAAT TTTGCTACCG TATAGGAACA AGCTGGTACA AAATGAATTT CAACGCGGTG AGCAACTGGC GGTTCTACTC TCACGCCTTG GACCAACCTT CATCAAAGTC GGCCAGTCTT TATCGATTCG TACCGATCTA CTCTCTCCGG CGTACGTCCG CGGCCTGGCA TCCTTGCAGG ACCAGGTGCC GGCCTTTGAC ACAGCTATTG CCAAACAAAT TTTGGAAGTG GAATGGCAAC GTCCCGTATC TGACGTGATT GTTGGAGAGT TAACGTCGCA ACCTATTGCC GCCGCTTCGC TTGGTCAAGT GTACAAGGCC ACACTGAAAT CGACAGGACT GGATGTAGCT ATCAAAGTAC AGCGACCCAA CATAAATGAA CAAATTGCGT TGGATATGCA CTTGCTGCGC GAAGCAGCGC CCGTTCTGAA GAGACTGTTT AATTTGAACT CCGATACAGT AGGAACAGTG GACGCCTGGG GCGCGGGGTT TGTTGACGAA TTGGATTACA TTCAAGAAGC TCGCAACGGT GCGTTTTTCT CGGAACGCAT TCGTCAGACG CCTCTGAGAG ATGTGGTGCT GGCCCCCGCT ATCGTGGAAG ATTTTACAAC CGGATCCATC CTCGTGACGG AATGGATTGA CGGGGAACGA CTGGACAAGA GCGAAAAGGG AGACGTGACG GTATTGTGCA GCATTGCCAT GAACACGTAT TTAACCATGT TGCTAGAGCT AGGACTACTC CATTGTGATC CGCACCCAGG CAATCTCTTG CGCACTCCGG ACGGAAAACT ATGGTACGTG GCTGGTCTTT TCGTCGTTTG CATGCACTCA TTGCGAGCAG GTCTCTCACT AACTTTATCC TTCATTGCAG TGTTTTGGAT TGGGGCATGG TAACAGCTAT CGACAAAGAC TTGCAGTTGA CTTTGATTGA ACACATGGCG CATTTGACGT CGGCAGATTA TGGGGAGATT CCTCGTGACT TGCTGCTACT GGGATTTATT CCGTCCGACA AGGCACATTT GATCGACGAC AGCGGTGTTG TTGACGTCCT GGCGGACATT TACGGAGCCT GGACCAAGGG TGGTGGCGCA GCAGCGATCA ATGTGAATGA TGTTGTCAAC CAGCTACAGG ATTTGACGTC CAAGAAAGGA AACCTCTTCC AGATCCCGCC CTACTTCGCG TACATTGCTA AGAGTTTTTC TGTATTGGAA GGCATCGGAT TAAGCAACGA GCCGAACTAC TCTATCATCA ACGAATGCGT TCCTTACGTA TCCAAAAGGC TTTTGACAGA CAAAGAAAAG ATGGGGCCGG CACTTTCGAC ATTTATTTTT GGACCAGCGA AATCAAATGC AGATCGCATT GTAGATTATC GTCGTGTGGA ACAGCTCGTA GAAGGCTTTG GTGAATACAC AACCTCAGCT TCTGGTTCTC TTTTGGGGAA GCAGAACATG TCCAATACAG AAATACTGGA AGATGTCGCG GACGAGGTGC TCGATTTGGT GTTGACAGAA GAAGAGACCC CTTTGCAAGA AATCTTGTTG GAGCAACTAG CAAAGATCAT CACAGCTAGC AGTCGATCTA TCTGGACACA AATCCGGGAG CGCTCTGGTT ATCTACCTTC TGGCCGTACT GTGCTTGGTA CAATCGTCGA CCCTTTTGGC CTTTTCCGAA CTAGTCCTCT CGTGCGGATG AACGAGCTTG ACGAACGCAC TGTAGAGACA ACTCAGAAAC TCATCGCTTT GGCGCAAAAG CAGATTCAAA AGTCCGACAA CCCGGCTTTT GATCTTTCTA AGCTTTCTCG AGAAGAGGCA CTACAATTCT CCTCTATTCT GGTGCGAAAA GTTTGGGTCC GGAGGGGTGG TGTCGTGCAA ACTAGCAATC GCTTCGCTCG AAAGCTGCTG CAATTAACAG CCGAGAAGCT TGAAGCAGGC GAGCGTGATA CTCGTACCTT GCCGATCCGC ACAAACTTGA CAAGGACGGA ACCCCTTATT GAAGCTGAGA AGTCATTCAC AGAGCGATCA TCAATCGAAT TAGCTGATCA TCATCCCACG CCAGTAAAGG CAGAGAATCC TCGTCTCATA GCAGCACGAC GGCGCCTTGA CACTCTCAAA GCTGAAGATG GCAATGGTGT CATTGTAACT GAAGCTGCGG AGATTTCTAA TGTAGATATC TAGTGTAAGG ATACTAACGT TTTATGAACA GACTTACCGT TAATGCGCCA AAATTTGCTG AGTCGCAACG CTCTTGCGCC AATAAAGTAA GGCCAACAAA GAGCGAGCCT TCCGACAGAC ACTTGACGCC TAGATTTGAC GACCAATTCA GATTATCGGA TTCACAGTCA AGACAAATTT TGGCATTGTT GATGGTGCGG CAATCAGTCA GTGGTTCAGA CAAAGCCGTT CGGAATGATT TATGCTCATT TTATGATCCA TAGAAGCCCG TGTCAACAAA AACCGAATTC TTCTGATTGA AAACTTGCTA CTCAAGGGTT TGAGGAGTCG TAGCAATGTA CGTAGCAACA TGCGGGCATC GACATTCAAT CAAATACGTC ACTGTCACCT TGAATAGGAA CAAACTTTGA CACACTTCCC AGATGTTA
|
Protein sequence | MRRIPTYVLP WLWLCSVLQS PIPTVSFVPP FNHASSSSAS SSHLHARYTP PVSQVVMPPP PPPPPPIPEP PQLDGIDRIL QNLAQKLDSS LPDWKNLHSP VGKDVQGPLI ESVVSIQRQL GQLESDAASQ IHLVSIKFQS TVLQQIPQLE PVLQKVTAFL APLVQDPTTQ LIVSALVSYT IVSKLLAMTL PPPPLKPYPS GRYDPVASRA YFDQRLPLVI ARSFSILVQS LQFAAALLQD KMQNKLVQNE FQRGEQLAVL LSRLGPTFIK VGQSLSIRTD LLSPAYVRGL ASLQDQVPAF DTAIAKQILE VEWQRPVSDV IVGELTSQPI AAASLGQVYK ATLKSTGLDV AIKVQRPNIN EQIALDMHLL REAAPVLKRL FNLNSDTVGT VDAWGAGFVD ELDYIQEARN GAFFSERIRQ TPLRDVVLAP AIVEDFTTGS ILVTEWIDGE RLDKSEKGDV TVLCSIAMNT YLTMLLELGL LHCDPHPGNL LRTPDGKLCV LDWGMVTAID KDLQLTLIEH MAHLTSADYG EIPRDLLLLG FIPSDKAHLI DDSGVVDVLA DIYGAWTKGG GAAAINVNDV VNQLQDLTSK KGNLFQIPPY FAYIAKSFSV LEGIGLSNEP NYSIINECVP YVSKRLLTDK EKMGPALSTF IFGPAKSNAD RIVDYRRVEQ LVEGFGEYTT SASGSLLGKQ NMSNTEILED VADEVLDLVL TEEETPLQEI LLEQLAKIIT ASSRSIWTQI RERSGYLPSG RTVLGTIVDP FGLFRTSPLV RMNELDERTV ETTQKLIALA QKQIQKSDNP AFDLSKLSRE EALQFSSILV RKVWVRRGGV VQTSNRFARK LLQLTAEKLE AGERDTRTLP IRTNLTRTEP LIEAEKSFTE RSSIELADHH PTPVKAENPR LIAARRRLDT LKAEDGNGVI VTEAAEISNV DI
|
| |