Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42887 |
Symbol | |
ID | 7196528 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1419979 |
End bp | 1421835 |
Gene Length | 1857 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176785 |
Protein GI | 219110066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0365267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGCTGTA CATGTAAATA CCAAATTCGT GAACCTAGTT TCGGAAAGTG GCTTAAAAGT TATATGTGAC CATAAAATTG AAATCGAGCT AAAAAAGTTT ATTTCGACGA GGAGATATGT GCTCACTGTA AAAGACACCT GAGCTTGATA CCACTGCCGG ATTAATTGTG TGCCAGAAAT TCCCATGTAT ACCAGTGTTC GCATTACTTT GTTGAGCTGA AACAAAGCTT CCGCGAAACC TGTACTGATT GAGAATTGGA ATTGTTTCCA CAAGATGATG ATGAGCTTGA GAAGTCTGGT AAGTGTAATT GCAATCGCGT TCACATTGGG AATCGGGATG TACAATGGCG TGCACGATAT GAGTCATGTG GTCGAGACTT TGTCGTCGAT TGAGAAAGAG CTAAATCCCT TGCAGCCCAT GACGGGATCG GAGCAAAGCA AGATCGTTAG CATGCCTTAC AAAAACTGGA GGAAAGACAT TATACGGGAA CCAATTCCTC GACCATCAGC TAAAGCTCTT TTCGACGGAT CAAATGTAAC GGGCGACGTA TCGTGGCTCC TAAATCTGGC TATTATAGGA TTCGGCAAAT GTGGCACCTC GTCTATGGTA GGCCATTTCA GCCAACACAA ACAAATATCC ATGATGCACA GAGAGCACTG CGAGTTTACA TGGCGTGACG ACGACACCAT TCTTCTTAAA GCCCTGGAGT CGGAGCTTCC ACATGGCAAC TATATGCGTG GGCTGAAGTG CCCCAGTCTC GTGAGAAGCC CTTTGGGAAT GCAACGGTTA GCCAAGTATT TTCCGAACGT TCGACTCATC GTTGGAGTTC GGCATCCAAT CCTTTGGTAA GTCATTGTTG ATGGAACAAA TTGATTCTTA CTTGTCAGCT AACCTGGGTT TTGGTATACC TGGACAGGTT CGAATCTTTG TACAACTGCA AGTATATTTT TTTCAGTGTC TAGTAAACTG TCTTTCCCAA TCGCTGTACC TTTCTGCTCA CCATCAACAA TACTTGTTAC TCTACAGTTC GCCAGAGACA GTTTGGATAC AGCCTCTTCC CAGCCCACAA ATTGATTGGC AAATGCCAGG ATTTGGGACC CAATAAAAAG GTGCACGGTG TATGTACAGA AGAAGCGAGA TTTCAAGAAG CATTAATTGG CTTGGGAAAG ACTAGCATGA GTACAACGGA CGAAATGCAG TACTTTTTAT CCTCTGAAAA GAAACCTGAG AATCTTACCG TTATCTCTCA AATGAAAGTA TTTGTCTACG ACATAGCGCA GGTAGAGGAT AAAGACGAGG AACGCTCCCA ACTGTTTATG GACGACATGC AAACATTCCT CCAGATGACG GAGCCTTTCA AGCCGATGGG TCAAAAAAGC GGCGGGAAAA CGAAGCAGTC GCGCATTGAC ATTTGCGAGC AAAAGTACGA CCATTTGCGC GAGGCTCTCT TGGACATTGG GGTGAATGCA TCGAGGTGGA TCCGTCGCTT TTTCGTACCT GCCGAGGGTG TGACCGTCTC ATCACCCAAA TTTTTCGAGC AGTCGTTGGC GAAGTGGGAA ATCGATCCGT GCGAAGAACG CCGAGCAAAT AACACATTCC CTCCCAAATG ATTTGATTGT GTCATTATCC CGACAATTTG GTTGTAAACT CTAGGCTGCT TGTCAAAGCA ACATGGAAAA TAGCCAAGAA GCCATTGTCG ACAAATTGGG ATGCCCAGGA TCACGCACCT ATGCAGTGCA CAGATAAGGT TTCTGAAAGG GGGAAAACGG ACAGCACCAT CACTCTCGAC ATTCGTGTGT ATGGGTTGGT ATTTACAGTT CAATAATTTA CACCGCAATC TTAAAGTTAA TTTTTTT
|
Protein sequence | MMMSLRSLVS VIAIAFTLGI GMYNGVHDMS HVVETLSSIE KELNPLQPMT GSEQSKIVSM PYKNWRKDII REPIPRPSAK ALFDGSNVTG DVSWLLNLAI IGFGKCGTSS MVGHFSQHKQ ISMMHREHCE FTWRDDDTIL LKALESELPH GNYMRGLKCP SLPSIFRTFD SSLEFGIQSF VRQRQFGYSL FPAHKLIGKC QDLGPNKKVH GVCTEEARFQ EALIGLGKTS MSTTDEMQYF LSSEKKPENL TVISQMKVFV YDIAQVEDKD EERSQLFMDD MQTFLQMTEP FKPMGQKSGG KTKQSRIDIC EQKYDHLREA LLDIGVNASR WIRRFFVPAE GVTVSSPKFF EQSLAKWEID PCEERRANNT FPPK
|
| |