Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44391 |
Symbol | |
ID | 7198055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 408354 |
End bp | 410105 |
Gene Length | 1752 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178502 |
Protein GI | 219115413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0416192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAGCCTAAG AGCACACAGG TCAACCCGAA CGAAGAATGA ATGAATAGGT CGTAGTCACG TGAAAGTGAG TCCTACCTTG TGAGACCCCA CCAAACTTGA CTTCACGAAT ACAAGCTTGC AACTATCCGT AGTACCACTT ACGAGCATTC AGCATGATTT CTGTCACCGA AGATATCGAA GGCACCGTCA CGATCCGGCA TGGCGAAGGT ATCGACACAG ATGCAATAGA AGATCTGAAA ACCCTAGATG CGAACGAAGA CGGCGGCTAC CATGACGATG ATCTAGAGAT TGATGACGAT GATGATACTA GTACCGACGG TTTAACACTC AAGCAAGTCG CTATGCAATG TTGCAAGCGC ATTGATGTCG GAGCAGCCGT CCTACCCAAA CCGCTCTTTC ATCCTGACAT CCAAGACTAC GATGGCAGTG AAGAACTTCA CGACGCCGCG GCGTCCATTC CGACACCGTA CGCGCGTGTG TTAGGCAACG AGTTTAACGC AATAGTGGTA GAGCATCACC TCAAGTGGAA TCCATTGTGT GTCTCCGAGC CGGGACCACT GCCTAGCGAA AAACCCTCTG ATCGCATCGG TCGGTATGAC TTCCATCATG CAGATTCAGT CGTGGACTGG GCCTTGAAAC GCAGTATGAA AGTCAAAGGA CATGTTTTGG TATGGCATGT TACGTCTCCG AAAATTCTGG AAGATATGGA GCCGGAGGAA GTACGCAAAG AGCTCAGACG TCACATTTTT ACGGTCATGG GTCATTTCCG TGGTCGCATT CAAGTCTGGG ATGTGGTGAA TGAAGCTCTT GCACCCGACG GAACACTCGC GGATAACGTT TTTCTCCGAA AGCTCGGACC CTCCTACATC GAGGATTCTT TTCGCTGGGC CCACGAAGCC GACGCTAGCG CTGTTCTTTT GTACAACGAT AACAAGGTCG AAGGGATTGA TTCGAGAAAG TCGGATGCAT TTTACGAATT GCTTGCGGAT CTCAAAGCCA AAAATGTCCC CGTCCACGGT TGCGGCGTGC AGGCTCACTG GAATGCAGCT GGCGTGGGCT GGAATCGTCC GCCAACGCCA AGAAGCGTCA AGGCACAAGT TCGGCGCTTA GGACAACTCG GGCTGACGGT AAATTTTTCG GAAATGGATG TACGTGTTAG TCAACTTCCA GCAAATCTAC GTCAAATAGC CCAGCGTCAA ATATTCCATG ATCTCTTGGC TGCAGCGCTG TCTGAGCCGG CCTTTGATGG CGTATGGTTG TGGGGATTTA CGGATCGTCA CACTTGGGTG ACCCATTTTT ACCACGATGA CGAGCCCCTT ATCTTCGACG AAGCCTACGC AAGGAAGGAA TCTTACTACG GGTTTAGAGA TGCTTTGAAA ACGATAGCTC CAGGCGGTTG TGTTGGAGGT GGAACTCTCT TGAGCTCGGA CATCGATGTG GATGGAAATC CATGGGGACA TCTTTGGATG CAGCCCGACC TGGTATCCCA CAATACAGAC GAGGACTTTC ATGGCGACTC ACGTCCTGAC TGGGAACAAA GCATCGAGGT CGATGACAAA GAAGAGGACC CTAGGAATGC AACAGACAGT ATGGAGCTCC CCCCTATTTC TTAAGAGATG CATATATCTG TTGGCTCAGC TCGGAAAGTG AATCGTGCAG AACTGGCATT ATCGCGGCAA ACACATGAAC ACCTAAAGTG ATCGGGGCTC AAGAAACACT CTACATCTAT TTAGAAAAGC GACTCTAGTT CT
|
Protein sequence | MISVTEDIEG TVTIRHGEGI DTDAIEDLKT LDANEDGGYH DDDLEIDDDD DTSTDGLTLK QVAMQCCKRI DVGAAVLPKP LFHPDIQDYD GSEELHDAAA SIPTPYARVL GNEFNAIVVE HHLKWNPLCV SEPGPLPSEK PSDRIGRYDF HHADSVVDWA LKRSMKVKGH VLVWHVTSPK ILEDMEPEEV RKELRRHIFT VMGHFRGRIQ VWDVVNEALA PDGTLADNVF LRKLGPSYIE DSFRWAHEAD ASAVLLYNDN KVEGIDSRKS DAFYELLADL KAKNVPVHGC GVQAHWNAAG VGWNRPPTPR SVKAQVRRLG QLGLTVNFSE MDVRVSQLPA NLRQIAQRQI FHDLLAAALS EPAFDGVWLW GFTDRHTWVT HFYHDDEPLI FDEAYARKES YYGFRDALKT IAPGGCVGGG TLLSSDIDVD GNPWGHLWMQ PDLVSHNTDE DFHGDSRPDW EQSIEVDDKE EDPRNATDSM ELPPIS
|
| |