Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49056 |
Symbol | |
ID | 7195426 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 448003 |
End bp | 449764 |
Gene Length | 1762 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183750 |
Protein GI | 219127036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.3635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCCCGCAC ACACTCACTC TCGCTGGATT CTTTCTGTGG TCATTGCGTT ACTTTGCCTT CGTCTTCTCG ACGACAGACT GTTACAACCA CATCCTTGTC GCTCTCACCA ATTCCAATCA TGCCTTCGGT GACGTACAGT AGTACCCGCG GCGGTCAAAC GAACCTCGCC TTTCGGGATG TGGTAATGCA GGGTTTGGCG CACGATCGGG GCTTGTTCGT CCCGGACCGT CTACCGACGG TGAGCACGAC GGAACTGGAA TCCTGGCGAT CTCTGTCGTA CGCCGAATTA GCGGTCAACG TCATTGCCAA GTTCGTCGGC GACGATCAGG TGCCCTTGCC CAATTTACGG GATATTGTCA CCCGGTCGTG TGCAGCCTTT CGCGACGCAC AAGTCACGCC CCTCGTTCAC GTCGGGGGAC ATTACGTGTT GGTACGTACG CCATCTGTTA CGGAGAGTCT CGGATCATTC TCGTGTCGAC GCTTCTTTTT GGAGAGACTC TCACACCACT GAACTACTCA CTCTCTGGGT CGTTTCCGGG GACGGTATCA ATTTCACAGG AGCTATTTCA CGGACCCACC TTTGCTTTCA AAGACGTGGC TTTGCAAATG CTCGGCAACT TTTTCGAATA CTTTTTGAGT ACGGGCAGTA ACGGGGGTCG CCTCGCCGTG TTGGGCGCCA CGTCCGGGGA TACCGGTTCG GCCGCCATTG CCGGATTGCG CGGCAAAAAG GGCATCACTT GCGTCATTTT GTTCCCCAAC GGACGCGTCT CGGCCATACA GGAACGACAA ATGACCACCG TGCCGGACGA GAACGTGCAC TGTGTCGCTA TCGACGGCAC CTTTGATGAT TGTCAAGATA TCGTCAAGGC GAGTTTCAAC ACACCGGCCT TTCGGGACAA GGTGCACCTG GGAGCGGTCA ATTCCATTAA TTGGTGTCGT GTTTTGGCGC AGACGACCTA TTACTACTGG AGTTACTTGC GTGTGACGGA CGCCCACAAG GACATCCCGG AAGTCCATTT TTCCGTCCCG ACGGGAAACT TTGGGGACGT TCTGGCCGGT TACTACGCCA AGCAAATGGG TCTGCCCGTA GGCAAATTGA TCGTCGCCAC CAACGAAAAC GACATTCTGC ATCGCTTCTT TACCGCCGGC GAGTACCACC GCGAATCGAT TGCGGAGACG ATCTCACCCA GCATGGATAT TTGCGTCAGC AGCAACTTTG AACGCTACCT GTTCCACTTG GCCGGCAACG ACGCGAGCAT GCTGAGCGCG TGGATGCAAG CCTTTGAAAA GACGGGCCAA CTGACCATTC AAGGTGACCT GTTGCGGCAG GCCCAGGCCG ACTTTGATTC TTGTCGTGGC GATACGTCGC AGACTCTCGC CACGATTGAA ACTTACCACC AGAAGCATCA GTACGTCTTG TGTCCGCATT CCGCGGTGGG TGTGTACGCT ATCCACCAGT TGTCCCTGGT TTCGTCGGCC ACGGTCTGTT TGGCCACGGC CCACGAAGCC AAATTCCCCG CCGCCGTGGC TCAGGTCGTG GAACCAATGC CACCCCCACC GACCGAACTC GCTGTTCTCC GGGACTTGCC CACCCGTCGC GTCGAATTAC CGAACGACCT GGCCACGGTA CAAGCCTTTG TGGAAGAGTG CGTCTTTTCT CGGCCACGCA AATGGCAAGC GGTGGCGAAA CATTTGGCCG CGGTGACCGT GGCGGTGGGT GTTGCCGCCG TGGCGTTGCA AGCCATGACC CGGCGACGAT AA
|
Protein sequence | MPSVTYSSTR GGQTNLAFRD VVMQGLAHDR GLFVPDRLPT VSTTELESWR SLSYAELAVN VIAKFVGDDQ VPLPNLRDIV TRSCAAFRDA QVTPLVHVGG HYVLELFHGP TFAFKDVALQ MLGNFFEYFL STGSNGGRLA VLGATSGDTG SAAIAGLRGK KGITCVILFP NGRVSAIQER QMTTVPDENV HCVAIDGTFD DCQDIVKASF NTPAFRDKVH LGAVNSINWC RVLAQTTYYY WSYLRVTDAH KDIPEVHFSV PTGNFGDVLA GYYAKQMGLP VGKLIVATNE NDILHRFFTA GEYHRESIAE TISPSMDICV SSNFERYLFH LAGNDASMLS AWMQAFEKTG QLTIQGDLLR QAQADFDSCR GDTSQTLATI ETYHQKHQYV LCPHSAVGVY AIHQLSLVSS ATVCLATAHE AKFPAAVAQV VEPMPPPPTE LAVLRDLPTR RVELPNDLAT VQAFVEECVF SRPRKWQAVA KHLAAVTVAV GVAAVALQAM TRRR
|
| |