Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17974 |
Symbol | |
ID | 7196969 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2458046 |
End bp | 2459706 |
Gene Length | 1661 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177522 |
Protein GI | 219111541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0964806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCGCTTC ATCGTTCCAA GTGGATTGGT CAATCAGATA TAGCGACAGA GAGAGTCACG GATGATAGAT TGGCTATCGC TGCTTCCATC ACCTTGCGTT TCGAAGCAAC AGTAAGGAAA TGGCGGGACC GAGATCCGGT TCCGGCCGAA CCCGTAAATC CACGCTGGCA TCCCGACGGA ACAATTCCCA ACCCCACACT ACGCACAAAA GAGCCCGCCC AACGCTCAAT CTTGCCGCAT CCGTCGATGA ACACATTGAT TCTGACGAAG AGGAGCACTT CGAAGATCGC TTGGAAGGTC GGGAGCCGTC CTTAGAAGAC GAACAATCCG AAGAAGAGGA AAATGTTGAA GCCAAACGAG TTCGGATGGC ACGGGAATAT CTCGAACGCA TGGATCAACA GTCCGACAGT GACGACACTT CCGATGATGA TGATAACGAC AACGATAGCG ACCAGGACGA TGTACAGGAT GATCGCGTGG CCCGAAAACT GCAACGCGTG CGACTCAAGC GCGAAGGTAC CTTCGAACGC GCAATTGCGG ACAAGCTCGC GACTCGCTTG TCTGCCCTCC AACCTCCATC CTACGATACC ACTACAGCAG TAGTATCAGT AGCAGTAGGA ACAACAGCAA CAGTAGCGGC AATTACGACC CCGCGGCAGC AAGCCCAGGC ATGGATTGCG GCGGGCCATC AAAGACTACT GCGGGGACAC GATCTGACCC CCACTTGCGT CGCCCTTCAA TCGGACGGTG CCACCGCCAT TTCGGGATCC AAGGATCATT CCGTCATTCT TTGGGATATC GAAACCGCTT CACGGAAAAC GCATTTATGC CCCGTGTGGA AGAAAGAAAC CGCCGACAGC AGTAAACCAC GTACCGTTGG GGAAGTTTTG TCCGTCGCCT GCTCGGACGA TGGTCGCTAC GCCGCCGTTG GATCCCGGGA CGCTACTGTC CGTGTTTTTG ACATTCGTTG TCGCTCACAG GCACTTGTCC AAACCTTTAC CGGGCATAAA GGCGCCGTCA CGTCCCTGGC CTTTCGGACT AATTCACTGC AGCTCTTCTC CGGAAGCGAC GATCGCTGTA TACGACACTA CAATCTCTCG GAAATGCTCT ACATCGAAAC ACTCTACGGG CACCAGTTTG GGGTCACCGA CTTGGATTGT CATCGGGATG AACGTCCCGT ATCGGTGGGT AAGGACCGGA CCGCACGGGC TTGGAAACTG GCGGAAGACA CACATTTGAT CTTTCGGGGC GGCAGTAAGC TACCCTCCGC TTCCAGCGTT ACGGTCGTGA AGGACGATTG GTTCGTGACT GGACACGAAG ACGGACACCT GGCTATGTGG AAAACAGACA AGAAAAAGGC GGTGGCACAA ATTGCCAACG CACACGGCGA CGACACGGAA ATTGTAGCGG TCCAAGCTTT GCCGGGCAGT GATCTGGTAG CGTCGGGATC GTACGACGGA TACGTTCGTT TCTGGAAGGC ATCAACCGGA CGGACTTTGG CGGAACGGGG TCTGCAGGCT GTGGGCGAGA TTCCGTTGTT CGGGTACGTG AACGATATTG CCTTTGGTCC CAAGGCCCGC TTTTGTGTGG CTGCGGTGGG ACAGGAACAC CGTCTCGGTC GTTGGAATCG AGTGGCCCAG GCCAAGAACC GCTTGGCTAT C
|
Protein sequence | MAGPRSGSGR TRKSTLASRR NNSQPHTTHK RARPTLNLAA SVDEHIDSDE EEHFEDRLEG REPSLEDEQS EEEENVEAKR VRMAREYLER MDQQSDSDDT SDDDDNDNDS DQDDVQDDRV ARKLQRVRLK REVAAITTPR QQAQAWIAAG HQRLLRGHDL TPTCVALQSD GATAISGSKD HSVILWDIET ASRKTHLCPV WKKETADSSK PRTVGEVLSV ACSDDGRYAA VGSRDATVRV FDIRCRSQAL VQTFTGHKGA VTSLAFRTNS LQLFSGSDDR CIRHYNLSEM LYIETLYGHQ FGVTDLDCHR DERPVSVGKD RTARAWKLAE DTHLIFRGGS KLPSASSVTV VKDDWFVTGH EDGHLAMWKT DKKKAVAQIA NAHGDDTEIV AVQALPGSDL VASGSYDGYV RFWKASTGRT LAERGLQAVG EIPLFGYVND IAFGPKARFC VAAVGQEHRL GRWNRVAQAK NRLAI
|
| |