Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47592 |
Symbol | |
ID | 7202647 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 195191 |
End bp | 196390 |
Gene Length | 1200 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182023 |
Protein GI | 219123420 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.823619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATCA ATCAGAAATA CTTCAACGCC TCACTGTTGC TGGTATGTTC AACTGTATGT ACCCTGTTCC AGTTTCGTAT TCTCTATCAA ATGGGCAGAA GGGCAGTCTT CCCCGGATCC AACTCAGGCG ATGAAGGACA GCAAGATCTG GAGTCGTGGC AGTTGCGTCC AAAATCACTT CGCACCGTGA TTGTGCCAGA GTCACCGACT AGGACGTGCG CCATCAATCT CTACGGGTTG CCAAGAGCCT TTGAATCCCT TGCTTTACCG ACACTCATAA AGAACGTGAT TCGTCCCAAT GCTATCAATG GTTGCGACTA CTTTGTGCAC TACTATTACT TGACAGAAGA AATGCCGGGG CGGTCCGGCG AAGGAGGTCG CATAAATCCA AATGAGGTAG TGAAGTTGAA GCAAGCAGTT CAAGAGTTCT CTCCAAAGTC AGTTATTCAA TTTCGCTACG ATAAGGAACA GGCCTTTTGG GATCAATATC AACCGCTTAT TGACAAAATC CGAGCAAGCA ACGATACGGA TGGAAAATTC TTATACTTTC CGTGGCGCAG CCCATCGTAT GTATACCCAA TCACTACTGA TAACATTGTT AAGATGTGGC ACAGCATTCA GTCAGCGTGG AGTTTAATGA CGCTGTACGA GACTCTGACA ACCCGAAAAT TTGACAGAGT TGCGATGCTA CGCTTGGATG TCGTCTACAT TACACCAATC AACGTATTTC AAGTGAATCG ACGAGAGGTT GGCAAGAATG AAAAAGTGGC CTTAATACCT GGCTTTGGAC GTCATCCCGT CAGCGATCGT TTGATTATTG GACCACGAGA GGCGATCGAA ATTTGGGCGT CGCAGCGATT TGATCGTCTC GAGGAGCACG TGAAGTTCGT ACACGAGAAG CATCCGGGCT GGGGTATGCA TTCCGAAATG TTTCTCCATT GGACAATCTT CCCGGCCATT CGAGACACTG GGACGAGCAT CGTGGAAGAC GACAGTTTGT GCTTTTTTCG GGCGCGCGCC GACGAGTCGG TATGGGTTAG TGATTGTGGT GGGAAACCGG AGTATGCCAA AACCTCCATT TTGAAGAATC TTGGTGGGGA CAAAGTTGAA GTATTGGAAT CGGTACTCGG ACGGAAATGC CGAGGCGAAG CTCAGAATCT TTCTTGGTCC TTTGTAGCCC TGGGCTGTCC AGCAGGGTAG
|
Protein sequence | MPINQKYFNA SLLLFRILYQ MGRRAVFPGS NSGDEGQQDL ESWQLRPKSL RTVIVPESPT RTCAINLYGL PRAFESLALP TLIKNVIRPN AINGCDYFVH YYYLTEEMPG RSGEGGRINP NEVVKLKQAV QEFSPKSVIQ FRYDKEQAFW DQYQPLIDKI RASNDTDGKF LYFPWRSPSY VYPITTDNIV KMWHSIQSAW SLMTLYETLT TRKFDRVAML RLDVVYITPI NVFQVNRREV GKNEKVALIP GFGRHPVSDR LIIGPREAIE IWASQRFDRL EEHVKFVHEK HPGWGMHSEM FLHWTIFPAI RDTGTSIVED DSLCFFRARA DESVWVSDCG GKPEYAKTSI LKNLGGDKVE VLESVLGRKC RGEAQNLSWS FVALGCPAG
|
| |