Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49084 |
Symbol | |
ID | 7195440 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 541825 |
End bp | 544732 |
Gene Length | 2908 bp |
Protein Length | 818 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183626 |
Protein GI | 219126777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0231566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGTTTCATC GTTTCTCTCT CCTGCCAAGT ACGAAAAGTT TATTCCACAC TGATGCGCGA CGAATGTGGC ATCGCTTCTA TACCTATTGT GAAAGCGTGT TGCCATCCGA TTTGAAAGCG TCTACGGATT TTCGCAAGTG ATCGAATCCG GTCTTTTTGC GAACCCCGTG TGCGGAAAGG CGACCATGTC GGACCCGACT TCATCATTTC AAAATCCCTT CGGAAACATG CCGAACTCCC CCATCTTTAC AGCTGACCAA CTGAAAGCCT TACAGCAACA GTTTCAAATG AATCTTAACA ACAATACTGG CTTACCGCAA AACCAAACCA ACACTGATTC TGAAATGGAA GCTCAGCCAT CATCTTCTGT TGCTGCGAAG CACAGTACTG CGAAGCACAG TACCGGTCCA TCGTCAACAT CTAGCAGTAG TGACGCGAAC CATTCAAATC ACCACGGACA AGCGTCGTCA ACACAGCAAC AGTCACATCA ACAACCGTTC TTTGTTATGA CACAGATACC TATTCAAGCA ACGCCATTTG CTTACTCAAG TAGCATGTCT TCCGATACGC ACGTGGTGCA GCCGTCTCCT CAGGTGCAGC CGTCTCCTCA GCTGCAAGCC TTACAGGAAC AGCAAAGGCA ATTTCTGCAA AATTACCAAA GCAATCCGTC CGCCTCTCCT ATGCCGCAAC CAACTCCCCA ATTTCAAGCT CTGCAACAGC AAGTACGCCA ACAGCAAGAA CAACAAAAGA AAGAATTTCA ACTCTACCAA TCTCAGCAGG CTTGGGAACC TCACTTTGAG AACCGCAGCG GCAACAGCGA CAACCACCAA CATCAGCAGC AACACACGGA GACCAACCAT GCGAAGCAAT CGCAGGCATC ACACAATAAT AGGCAGCAGC GTCCAAAAGA GATGCAGCAG CAGTACCAAC ACCAAAATTC CGGCATTCAG CCTGAGGCGG AAACTTCGAG CAATGTGGTG CCCAGCACAA GCTCCAATAA TACCAATTTT ATGCTGCAAC AGCTCATGAA GAATGCCTAT GCAGCACATC AACAACAGCA GCAGCAGAAT CCGAACCAAA TGTACCCAAC GCAATCCCAG CAGCAGACAA TACAGCAAAC TTCGGACAAC ACATTAGGTA CCATCTTGTC GAACAGCCAT ACCAAAAATG TGGGATCAGA AAGTGATTCC TCGAACAATC GGCGGCTCTT ACTGGGATCC ACCGTTAAAC CAGTGCAGCG GGAGGAAAGC AATATGAACG TGGATTTCTG GAAACAGTTC TGGGATGACG AAGAAAAAAC CTCGGCGACA TCTTCCGATC TTAACAACAC CCTGTCCATG CAAAACTCCA AGCGCACGTT CAGCGACGCA ATGGTGAGTT CTTTGTTATG TCTAATGTTG CATATAATAT TACGCATGCG CTCACAATCT GCTTCTTTTT TATGTAGGGT GACGGCTCTG GAACATCGAT CGGTTCTGAT GACGGCACGA ATGCCAATCC TCTTCTCCGG CAGCGCTGCG AACCACAGGA ACAGAAGTCT ACGTTCCAGT CACAAAGCGA GCAACGACGG TGCCCGCCAT TTCAGCAGCA GAATTCCACG GAATCACGAA ACGTCGAATC TGACGAAGAC GATGCGATTG CACCCACCCC GTTGAGCGAA ATTCGTGCCA AACATTTTCG CATGCAGACT TCACAAGCAT CCACTGAACC GCCTTCCCCT CAGGTTCCGC ACGCTCAGTT GCCTAAATCA AGCAACACTG GGTCATCTTC TTCCTCTATG CAATCATTTC ATCACCAACC CATTCAGCCA CAGTTACAAC ATCAGTACCA GCAGCCAAGG CAGCAACCTA TTCTTCCGTC TCCATCGCTG TCGGCGCCAT TTCTGCTTCC GTCAACTATG ATGCCGCCCT CATTATTAGC TAGCGTTTAC ACGTCCTCGA CGAAAGCAAC TTCGGAAACT GGGCAACCAA AAAAGAAGCC GGTGTGTGCT GTCTCCTCCA AGTCGGCTAG TGCTGCGCGT ACCGAAACAC CCCAGCAAGT ATTGGAGCGC ATTTTGACGA GTCGAGGATA CGGAAGCGAT ATTCGAATCA AAGCAGAGCA ATCCAACTAC GATGCAATGC CATCGCCTTT GCAATTGGCG TCGTTTGGTA CGGAGCTGGT CAAGGCCATT CACACGTCGG ATGTGGACAA GCTATCCTCC TTGCTAGCTT GTGGTCTGTC TCCGAATCCT TGCAACCAGT TCCGAGACTC CATTGTGGAT CTGGTATGCA AACGTGGTAG CGCGGACATA TTTCGTTGCC TGGTCGATTA CGGTTGCGAT TTGCGCGTTT GCGACGGGTT CGGGCGTACG CCGTTGCACC ATGCTTGCTG GGGCAGCGCT TTTCACCCCG AAATTGCCAA CAGTATTCTC CGCAACGACG CGCAGCAAAT CCTGATGGAA GACAAGCGTG GTCAAACGCC ACTGGAGTAC GTCCGGGAAG CGCAAGCCGG CGACTGGATA GACTTTCTGG AGAGTCACAA GGACGAGTAT TTTCCGGCTG GCGGTGCGTT GCCGGCGGCA CGAGACATTC GGGACTCTCG ACCGAACGGA TCGTTACCGA ATCCACTCAA CGCCTTGCCT TTGGCTTTGG CGGGGGCGTT GTCGTCGGGG CAAATCACGC CAGAGAAAGT CAGCGCCATG AGTGCCGAAG AGCGGGGTCG CTACAATCAG CGTTAATTGT ATAAGGCGGC TCAACACGTG TGCCTCTGGT GGTGCGGAGA ACCAACCTGT ATGCGGAAGA ATGTGGCATT GTCAAACTTT CTTTTATCAG CAGACACTTC TTTTTTCTCT TTGCTAGAAC TAGCCGATAG TAGTATACGC ATGGCTAATA TGGATAAAAA TGTGTGTAAT AGTATAGTTA ACTGTAAG
|
Protein sequence | MSDPTSSFQN PFGNMPNSPI FTADQLKALQ QQFQMNLNNN TGLPQNQTNT DSEMEAQPSS SVAAKHSTAK HSTGPSSTSS SSDANHSNHH GQASSTQQQS HQQPFFVMTQ IPIQATPFAY SSSMSSDTHV VQPSPQVQPS PQLQALQEQQ RQFLQNYQSN PSASPMPQPT PQFQALQQQV RQQQEQQKKE FQLYQSQQAW EPHFENRSGN SDNHQHQQQH TETNHAKQSQ ASHNNRQQRP KEMQQQYQHQ NSGIQPEAET SSNVVPSTSS NNTNFMLQQL MKNAYAAHQQ QQQQNPNQMY PTQSQQQTIQ QTSDNTLGTI LSNSHTKNVG SESDSSNNRR LLLGSTVKPV QREESNMNVD FWKQFWDDEE KTSATSSDLN NTLSMQNSKR TFSDAMGDGS GTSIGSDDGT NANPLLRQRC EPQEQKSTFQ SQSEQRRCPP FQQQNSTESR NVESDEDDAI APTPLSEIRA KHFRMQTSQA STEPPSPQVP HAQLPKSSNT GSSSSSMQSF HHQPIQPQLQ HQYQQPRQQP ILPSPSLSAP FLLPSTMMPP SLLASVYTSS TKATSETGQP KKKPVCAVSS KSASAARTET PQQVLERILT SRGYGSDIRI KAEQSNYDAM PSPLQLASFG TELVKAIHTS DVDKLSSLLA CGLSPNPCNQ FRDSIVDLVC KRGSADIFRC LVDYGCDLRV CDGFGRTPLH HACWGSAFHP EIANSILRND AQQILMEDKR GQTPLEYVRE AQAGDWIDFL ESHKDEYFPA GGALPAARDI RDSRPNGSLP NPLNALPLAL AGALSSGQIT PEKVSAMSAE ERGRYNQR
|
| |