Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29309 |
Symbol | |
ID | 7203265 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 195067 |
End bp | 197912 |
Gene Length | 2846 bp |
Protein Length | 842 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182486 |
Protein GI | 219124387 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCACGTCTC TACACTCTGC CGTCGGAACG GACGCGGAAT CCGTCACGAG CACGCCGGTG GAGTACTTTC GTCGCGATTA CCAACCCTTG CCCTTTACCG TCGATCAAGT AGCCCTGGAT TTCGTCATTC GCGACGGCCG TACCACGGTC ACCACGAAAA TGACGCTCGT TGCCAATCCG CGGTACGTCG GCGAACCCAT CCCGCTCGTG CTGGATGGTG ACGAAACCTG TCTCCAGTTG CGCTCCGTCC GCATGAACGG TCAGGAATTG CGGGAGGGGG TGGATTACGA ACTCGCACCG GGACGGTTGC TCCTCAAAAC GCCCGTGGAT GGGGCCGTCG TGCAAACCGT CGTGGATCTG GTGCCGGAAG AGAATACGCA ATTGTCCGGA TTATACAAGG ACGAGGCGTC CGCCATGTAC TGTACCCAGT GCGAGGCTAT GGGATTTCGG CGCATCACGT ACTATCCCGA TCGACCCGAC AACATGGCCA CCTTTACCAG CGTACGACTC GAAGCCGATA AGGAGCGTTA TCCTATTCTT CTCTCCAACG GCAATCGCCT GGATCAAGGG ACTGTGCCCG AAGACGACTC TCGCCACTAC GCCATATGGA GTGACCCATT TCCCAAACCG TCCTACCTGT TTGCCGCGGT GGTCGGAAAA CTCGGGAGTA TTCGAGACTC CTACACCACC GTGTCGGGCA AAAAGGTCCA ACTGGAGGTA TTCTCGGAAC CTCGCAACGT CCACAAACTT TCCTACGCCA TGGAAAGTTT ACAACGCAGC ATGCAGTGGG ACGAACAGCG TTTCGGACTC GAGTATGATC TAGAACTTTA CAACGTTGTG GCCGTGGACA GCTTCAACAT GGGCGCCATG GAGAACAAGG GACTTAACGT CTTTAACACG GCCTACGTGC TCGCCGATGA AGGTACCGCC ACCGACACCG ACTTTGAACG CGTCGAAGGA GTCATTGGAC ATGAAGTACG TATCCGACGG TCTTTTTCCG GGCATTTGGT GGACGACGTG TAAATGTACT TTTTCCTCAC TGTACCCCCT TGATGTTCTT CTCTTATGTG TAGTATTTTC ACAACTGGAC GGGCAACCGT GTGACGTGCC GTGATTGGTT CCAGCTCACC CTCAAGGAAG GGTTGACCGT CTTCCGCGAC CAAGAATTTT CCGGTGACAT GAACAGCAAG CCCGTAAAAC GAATTGAGGA CGTTCGGGTC CTGCGGGCCC GCCAATTCAG CGAAGACGCT GGTCCCATGT CGCATCCTAT CCGCCCCGAA TCCTACATTA GTATGGACAA CTTTTACACC GGGACGGTGT ATATCAAAGG TGCCGAAGTC ATTCGAATGT ACCAGACCAT TCTAACTCCG GAAGGTTTCA ACAAGGGGAT GAAGCTGTAC TTTGAGTGCC ATGATGGTTC GGCCGTGACG TGCGACGACT TTTTGGCTGC CATGGCCGAC GCAAACAACG TGGATTTGAC GCAGTTCGCC TTGTGGTATA GCACACCCGG AACTCCTACG GTGCAGTACG AAACTTCCTA CGCGGACGGG ACCTTTACGC TGAAACTCTC TCAGTCTAGT CGTAGTGTAA CGCCCATGCA CATTCCCGTG GCCTTTGGGC TGCTCGATAA GGCCACAGGT CAGGAAGTGG TCCCCACCAC GGTATTGGAA TTGAAGGAAG CCTCACAAGT CTTTACTTTT GACGGACTCG AGGGGGACGT GGTTCCGTCG CTACTGAGGG ACTTTTCCGC TCCGGTCAAG CTCGAGCCAG TTTCCGGCGA AGTGGACGAA AGTGATCTGG CCTTTTTGGC CGCGCGGGAT ACAGACGGCT TCAACCGTTG GGATGCCGGT CAACGCTTGT ACACGTCCTT GATTTTTCAA ACGCTTAACG ACCAGATTTC TTCCACTACA CAGGCGTTTG TGGACGAGGC GTTCCGAATG GCGCTGGAGC AAAAGACTAC GGATTACTCG ATTCAAGCGT ACGCACTCAC GTTGCCTTCT GAATCGACCC TGTCGGAAGA AATGAAAATC GTTGATCCTG TTGGTCTCCA CGAGGCCCGT GGTAAAGTGA AAAAGGCTCT GGCTCGAAAG TACGAATCAG AGATCCGAAC CACGTATGAT GGTTTGACGG AGACTATGCA AGCTGAGACA GAGTTCAAGG TGGATGCTGA AGCAATTGGT CGTCGTCGGT TGCGCAATAC CCTATTGGAG TATTTGTGCT CTATCCGCGA AACAGACCAG GAGCAAATTG CTGCTGCGGA ACTTGCGATG AAGCACTTTC AAAATGCCAA AGGCATGACG GACAAAATCG CTGGTCTAGG AGCCTTGGCT TCGATGGATG GAGAGGGTGC TGATGCCCGT GATGAAGCCA TGCAGACCTT TTACGACGAT GCCGAGGGTG ACGCTCTCGT GCTTAACAAG TGGTTCATGA CCCAAGCAGT TGCGGACCTG CCGGATGTTT TGAACCGCGT CAAAAAACTC AAGGAACACC CGGACTTTAC ACTAAAGAAT CCAAATCGCT GTCGCTCGCT GATCAGTGCC TTTGCAATGA ACTCGGCTGC CTTCCACGAC GAGAGTGGCG AAGGTTACAA GTTTTTGGGA AGCACGATTG CCGAACTCGA CAAACTCAAC CCACAGATCA GTAGTCGTAT GGCTAGCAGC TTGATCCAGT GGCGTCGGTA CGATGAGGAA AGAGGCCAGC TCATGAAGGC GGAGCTCGAA AAATTGAACG CCATGAAGCT GAGCGAAGAT TTGTTTGAAA TCGTAAGCCG TGGGCTTAAA GATTGAAGCG TCTCACGCAG CGGCTTTAAT TACAGCATCT AATTTACAAA ATTTCACTAG TCAAAC
|
Protein sequence | MTLVANPRYV GEPIPLVLDG DETCLQLRSV RMNGQELREG VDYELAPGRL LLKTPVDGAV VQTVVDLVPE ENTQLSGLYK DEASAMYCTQ CEAMGFRRIT YYPDRPDNMA TFTSVRLEAD KERYPILLSN GNRLDQGTVP EDDSRHYAIW SDPFPKPSYL FAAVVGKLGS IRDSYTTVSG KKVQLEVFSE PRNVHKLSYA MESLQRSMQW DEQRFGLEYD LELYNVVAVD SFNMGAMENK GLNVFNTAYV LADEGTATDT DFERVEGVIG HEYFHNWTGN RVTCRDWFQL TLKEGLTVFR DQEFSGDMNS KPVKRIEDVR VLRARQFSED AGPMSHPIRP ESYISMDNFY TGTVYIKGAE VIRMYQTILT PEGFNKGMKL YFECHDGSAV TCDDFLAAMA DANNVDLTQF ALWYSTPGTP TVQYETSYAD GTFTLKLSQS SRSVTPMHIP VAFGLLDKAT GQEVVPTTVL ELKEASQVFT FDGLEGDVVP SLLRDFSAPV KLEPVSGEVD ESDLAFLAAR DTDGFNRWDA GQRLYTSLIF QTLNDQISST TQAFVDEAFR MALEQKTTDY SIQAYALTLP SESTLSEEMK IVDPVGLHEA RGKVKKALAR KYESEIRTTY DGLTETMQAE TEFKVDAEAI GRRRLRNTLL EYLCSIRETD QEQIAAAELA MKHFQNAKGM TDKIAGLGAL ASMDGEGADA RDEAMQTFYD DAEGDALVLN KWFMTQAVAD LPDVLNRVKK LKEHPDFTLK NPNRCRSLIS AFAMNSAAFH DESGEGYKFL GSTIAELDKL NPQISSRMAS SLIQWRRYDE ERGQLMKAEL EKLNAMKLSE DLFEIVSRGL KD
|
| |