Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42245 |
Symbol | |
ID | 7195096 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 218397 |
End bp | 220210 |
Gene Length | 1814 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183320 |
Protein GI | 219126136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTTG ATGTCCTCAT TGTGGGTGGA GGGCCAGCAG GATTGGCCGC AGCGATACGT CTCAAGCAGC TGAGTCTAGA AAAGCAGAAA GATCTTTCAG TCTGTGTGAT TGACAAGGGA AGGTACGCAT GCATAAACAT TCTAGCGCCG CTGCGATTAA AGAGTACATC CCTTTGTATA CTCACTTTGT TCTGTGGGTC ACGTCTAGTG AAATTGGTGC CCACATTCTA TCTGGGAACG TCTTTGACCC CAAAGCCATG CATGAACTCT TTCCCGACCA GGCAGATCCG TCTTCCGATA CACACTGGAC GAAAGAACTG GAAGCGACCC AGAATTCCGT CGCCACGCCG GTGACTGACG ACGAATTTCT AGTCCTGACC GAGACTGGGA GCACCAAAAT TCCGAACTTT TTGTTGCCAC CCCAGCTCGA CAATCACGGT AATTACATTG TCTCCCTTAG TCAAATTTGT CGCTGGATGG CGGGTAAAGC GGAAGAACTA GGTGTGGAAA TCTATCCCGG CTTTGCAGCC TCCGAAGTGT TGGTGGACCA AGAAACCAAC GCTGTCAAGG GAATTGCTAC GCGAGATGTG GGCATCGCCA AGAACGGAAC CCACAAACCC ACATTCGAAC GAGGAGTAGA ACTACACGCC CGACAAACTC TCTTGGCGGA AGGGGCCCGC GGATCGTGCT CCGAATACGT CATGGAAGCC TTTGATCTGC GCAGGGATTG TCAACCACAA ACGTACGGTC TGGGACTAAA GGAAGTATGG CAGGTTCCAC CTGAAAGCTT CCAAAAAGGA TTAGTACAGC ATACACTTGG GTACCCTCTT CAGTCCGGCC CTTTGGATAA AAATTTTGGT GGAAGCTTTT TGTATCACCA AGAACCAGAT TTGGTGTTGA TTGGTTTGGT GGTTGGTCTC GACTACGCCA ACCCGTATTT GAATCCTTAT CAGGAGTTCC AAAGATGGAA ATCCCATCCG GATATTCGTA AGCATTTGGA CGGTGGAACG TGTGTCTCGT ATGGCGCCCG AGTTTTGAAT GAAGGCGGAT GGCACGCTGT TCCAAAACTC AGTTTTCCAG GCGGGGCACT TTTGGGGTGT GGCGCGGGAT TTTTAAACGC AGTCAAAATC AAAGGTTCAC ACACGGCTAT CAAATCTGGT ATTTTGGCGG CAGAAGCCGC CTTTGATGCA TTAAAAGATG GAGACTCCGT AGCTGAAATT GGGGAATTAC CAGAGACTGG TCCTATTGAA TTGACGACGT ACGAAACTGC AGTTAGATCG TCCTGGATTA AGGATGAGCT GTATCAAGTC CGAAACACTC ACGAGGCATT TTCGCGCTGG GGTGTTGGTG GTGGGCTTAT CTACACCGGA TTGACAACTC ACGTGTTGAA AGGCCAGGAA CCGTGGACAT TGAAACACTT GACAAAAGAC TGTGAAAAAA CGGAGGCGGC GGCCAATCAT AAGCCCATCG AATATCCCGC ACCAGATGGA AAGCTAACGT TTGATTTATT AACGAATCTA CAACGAGCTG GCACCTTCCA CGAAGACGAC CAACCAAGTC ATCTCCGAAT TAAACCTGAG CAAGCCGAGA TTCCGAAAAA GACATCGCTA CAGGTATATG CTGGTCCTGA ACAGCGCTTC TGTCCAGCGG CTGTGTATGA ATACGTCGAC GTCGTAAACA CAAAAGGAAA AGAGCTGGTA ATCAATGCGC AGAACTGTAT TCATTGCAAA TGCTGTTCAA TTAAAACGCC GAAAGAATAT ATTCGATGGT CTGTCCCAGA AGGGGGCGGA GGTCCGCAAT ATCAGATTAT GTGA
|
Protein sequence | MPFDVLIVGG GPAGLAAAIR LKQLSLEKQK DLSVCVIDKG SEIGAHILSG NVFDPKAMHE LFPDQADPSS DTHWTKELEA TQNSVATPVT DDEFLVLTET GSTKIPNFLL PPQLDNHGNY IVSLSQICRW MAGKAEELGV EIYPGFAASE VLVDQETNAV KGIATRDVGI AKNGTHKPTF ERGVELHARQ TLLAEGARGS CSEYVMEAFD LRRDCQPQTY GLGLKEVWQV PPESFQKGLV QHTLGYPLQS GPLDKNFGGS FLYHQEPDLV LIGLVVGLDY ANPYLNPYQE FQRWKSHPDI RKHLDGGTCV SYGARVLNEG GWHAVPKLSF PGGALLGCGA GFLNAVKIKG SHTAIKSGIL AAEAAFDALK DGDSVAEIGE LPETGPIELT TYETAVRSSW IKDELYQVRN THEAFSRWGV GGGLIYTGLT THVLKGQEPW TLKHLTKDCE KTEAAANHKP IEYPAPDGKL TFDLLTNLQR AGTFHEDDQP SHLRIKPEQA EIPKKTSLQV YAGPEQRFCP AAVYEYVDVV NTKGKELVIN AQNCIHCKCC SIKTPKEYIR WSVPEGGGGP QYQIM
|
| |