Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39760 |
Symbol | |
ID | 7195338 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 636166 |
End bp | 637667 |
Gene Length | 1502 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183785 |
Protein GI | 219127108 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0199514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGCC GAACACTTGA ACAAACGCAA GCTCAGCAGC CTCAGCCCCA GCCCCAGCCC CCACGGCCTC GTCAGGGTCG CATTCCGGCG GCCTTACAGT CCGTTTTCTT GGCGTCTAAC CGAGGAAAGA CGGTGGATCG CCCTCCTCCG GGCAAAGTTG CCTACAAGGT ATTCTTTCGC AAGATATTCT TCCAAAAGAA GCCGGAAGCG GAAAAGAATA GCGGCTACAA GAATGGCTAC AAGAAGTGGA ACAAGAAAGG CAACAAAAAC AGCGGCAAAT TCGTCAAGAA CAACTTTTAC AAAAATAACA ACAACAACAA CAAGCAATCA CGCTTCTCGG CACCTCCGCA GAGCGTCGAC AAGTCGAGTA AGATTTTGTC CAGCACGTCG ATTAGTTTTG GAAAGGCTAG TAACGCCAAA AGCAAACCAA CTCCGTCAAC TCGTGCAGCA GCTGTGGTAC GTGAAGTAGT CAATCATTGG TTGATCGAGG ACACTCGTGT TCACGTTTCC ATTGTATACG TAACTCACGC TCTGAGCTTT CTTCTCTTTC GCAGATTGAA GAAAAGCAGG CTATTTCGCC CCAATCCTAC TTGGATGATA TGATCGCGGC ACGCGGTTAC TCCACGGAAA AGTTTAAAAC CTTGCAAACG GCCTACTATA ACAAGCCCAC TGCGTTGCAG CAAGCTTCCT ACGACGTCTA CCTCATTGAT CTCGTCAAGA AAAACGGAGT GGAAACCCTT CGGAATATCT TCAAGTCCGG TGTTTCGCCC AACCCCTGTA ACACTTTTGG GGAAAGTCTC TTGCACATGA TCTGCCGTCG CGGTGACGTC GATTTGTTAA AGGTACTTTT GGAGTGCGGT ACCAACCTGC AGGTGGCCGA TGATTACGGC CGAACGCCGC TACACGACGC TTGCTGGGCG GCGAAACCGG CCTTTGCGGT TGTCGACTTG ATCCTCGAAC GCGATCCTCG TTTGCTGTAC ATGTCCGACT GTCGAGGCGC CTTGCCGCTC TCTTACGTGC GCAAGGAACA CTGGTGTGAG TGGGTCCCGT ACCTTGAGGC GAGGAAGCAC ACGTATTGGC CGGTGCTGAC CAACAACACC GACACGGATA GTCAAGTTAA GGCAGAAGCA CCCCCGCTGC TGTGCACACA AGGAGCCAAT ACCCGACCGT TGCGCGACCC CAAGGACGCT TTGACTTGCG AAATGGCCAA AATGGTAGTT TCCGGTAAGA TGCAACCGGA CGAAGCTCAG TTTTTGCAAT ACGATGTGAC CGATGAAGAC GACGAGTCTC GTTCTAGTAG CGAGGCGGAG GAAGACGCCG AAGAATCTGG TGACGAGAGC GGTAGTGATG AAGGAATTGG CAGTGGTACT GAGAGCGACA GCGATGATGA CAGCGAGTAC GATAGTGAAG ACGATGCGAG CGATTTCAGC TTGGACGAAG ACGAGATGGC AAGTATTCTG AATACATTGG CACCCCGGGC AGCGTCAGTC GAGAAACAGT AG
|
Protein sequence | MVRRTLEQTQ AQQPQPQPQP PRPRQGRIPA ALQSVFLASN RGKTVDRPPP GKVAYKVFFR KIFFQKKPEA EKNSGYKNGY KKWNKKGNKN SGKFVKNNFY KNNNNNNKQS RFSAPPQSVD KSSKILSSTS ISFGKASNAK SKPTPSTRAA AVIEEKQAIS PQSYLDDMIA ARGYSTEKFK TLQTAYYNKP TALQQASYDV YLIDLVKKNG VETLRNIFKS GVSPNPCNTF GESLLHMICR RGDVDLLKVL LECGTNLQVA DDYGRTPLHD ACWAAKPAFA VVDLILERDP RLLYMSDCRG ALPLSYVRKE HWCEWVPYLE ARKHTYWPVL TNNTDTDSQV KAEAPPLLCT QGANTRPLRD PKDALTCEMA KMVVSGKMQP DEAQFLQYDV TDEDDESRSS SEAEEDAEES GDESGSDEGI GSGTESDSDD DSEYDSEDDA SDFSLDEDEM ASILNTLAPR AASVEKQ
|
| |