Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42307 |
Symbol | |
ID | 7198187 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 45215 |
End bp | 46930 |
Gene Length | 1716 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184291 |
Protein GI | 219128169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000154334 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCGG ACGCTACGAC GGAGGCCCAC GGCCAAGGTC CGCAACAACC CCATCTGGCC AAACAAGACG CGGCAACGGT AGATCCCGCA CAACTCACAG CCCTGTCTCC CGAAGTGGTA CGTACCGGTA CGGTATAACG CAGCGTAAGG TTGTTTGCGT TCGAGGGATG CTGTTGGTAG GGTTGGATCA GACTGGGTAG GGTTGGGTAG GAAACTATGT TCGTCTCGTC TCTAGCGACG TATTGTTACT GTCTGTTATT GCGAAATGAA TCGAGGCATA GCGACTGCAC ACAGTCTCAC TTTAATATAT CTCTCTATCT GCTTTGTTAG ATCTCTCGCC AGGCCACCAT TAATGTGGGA ACCATTGGAC ACGTGGCGCA CGGCAAGTCC ACCGTTGTCA AGGCCATTTC CGGAGTGCAA ACGGTCCGCT TCAAGAACGA ACTCGAGCGG AACATTACCA TCAAACTCGG TTACGCCAAC GCCAAGATAT ACCAAGGACA GCCCGTGGTA TCGGAGGAGA ACCTCCACGA TAACGAAGAC GCCAGCACCA CAACCGCGGA GACTCCTCTG GACGCGGACG GCACCGTCCA CAACACCACC ATTGCCACAG GGCCCCTCTA CACTTCCCGA GGCTCTTCAC ACGCCGATAT CTTCACCGAA GGAGGTCGAA CCTACCATCT GCGCCGTCAC GTGTCCTTCG TCGATTGCCC CGGACACGAC ATTCTCATGG CGACCATGCT GAACGGGGCC GCCGTCATGG ACGCCGCCCT CCTGCTCATT GCCGGAAACG AGACTTGCCC CCAACCGCAG ACTTCCGAAC ATTTGGCCGC CGTAGAAATC ATGCGCCTCG AACATATCCT CATTCTGCAA AACAAGGTCG ATCTCGTCAA ACCCGATGCC GCCGTCGCGC AACAGGAACA GATTCGCAAG TTCGTCGCTG GAACCGTGGC CGACGCCGCA CCCATTCTCC CCATTTCCGC CGTACTCCGA TACAATATGG ACGTACTCTG TGAATACCTC ATTCGACGAA TCCCTCTACC GGTACGGGAC TTTACCTCCC AACCCCGCCT CATTGTTATT CGATCATTCG ACGTCAACCG CCCCGGACAA GACGTTTCCA AACTACAAGG CGGCGTCGCT GGAGGAAGTA TTCTACAGGG TGTCCTCCGT GTTGGTGACG AAATCGAAGT CCGACCCGGA ATCGTACACA AGCAGGACGA TAAAATTGTT TGCGTACCCA TTTTCAGCAA GATTTCCTCC CTTTACGCCG AACAGAACGA TCTCCAGTTT GCCGTCCCCG GAGGCTTGAT CGGCGTCGGG ACCAAAATCG ATCCCACCCT TACCCGCGCT GATCGTCTCG TCGGACAAGT CCTCGGTCTC AAGGGACAGC TGCCGGATGT CTTTTCAGAA ATCGAAATCT CCTACTATCT GCTTCGCCGA TTACTCGGAG TCAAAACATC CGACGGTGGC AAACAGGCCA AGGTACAGAA ACTCACCAAG AACGAAATTC TCATGGTGAA CATTGGTTCC ACCGCCACCG GTGGACGAGT GTCCGCCGTC AAGGGAGAAT TGGCCAAAAT AACCCTCACA CAACCCGTGT GTACCGAAGA AGGTGAGAAG ATTGCCCTCT CCCGACGCGT CGACAAGCAC TGGCGGTTGA TTGGTTGGGG ACAAATTCGC AAGGGAAACG TTGTGGAAAT AGCCGAGTCG GCGTAA
|
Protein sequence | MRADATTEAH GQGPQQPHLA KQDAATVDPA QLTALSPEVI SRQATINVGT IGHVAHGKST VVKAISGVQT VRFKNELERN ITIKLGPLYT SRGSSHADIF TEGGRTYHLR RHVSFVDCPG HDILMATMLN GAAVMDAALL LIAGNETCPQ PQTSEHLAAV EIMRLEHILI LQNKVDLVKP DAAVAQQEQI RKFVAGTVAD AAPILPISAV LRYNMDVLCE YLIRRIPLPV RDFTSQPRLI VIRSFDVNRP GQDVSKLQGG VAGGSILQGV LRVGDEIEVR PGIVHKQDDK IVCVPIFSKI SSLYAEQNDL QFAVPGGLIG VGTKIDPTLT RADRLVGQVL GLKGQLPDVF SEIEISYYLL RRLLGVKTSD GGKQAKVQKL TKNEILMVNI GSTATGGRVS AVKGELAKIT LTQPVCTEEG EKIALSRRVD KHWRLIGWGQ IRKGNVVEIA ESA
|
| |