Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45844 |
Symbol | |
ID | 7201092 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 448506 |
End bp | 450319 |
Gene Length | 1814 bp |
Protein Length | 560 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180050 |
Protein GI | 219118560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATGGTGCA TATTGAAGAA GTGAAAAACG TTCCTACTGG CATTCAACCC GAAGCGAAGG AAATCTCTAA GAAAGAAGCG TCCAATGTTG AAAAGTACAA CGATCTCTTG TCGACCATGC AGAAAACAAA GGAGAAATGG GACGAAGAAG ATGGTGTAGA AGACGAACAA GATGCGCTAT TATCACAGGC TATTGAGTTC GCCATCGAGC AAGGAAGAGG ATGGGCTCCT GGCGAAAAAG ATGCTTATCT GGAAAAGATA TTGGACGACG ATTTCATTCC CCCCATGTTT TGCTCGACGC CGGAAGAGCT GGAGAAAACG GGCTTACAAG AAGCCTTTAC CAGCCTAATA TATGACGGAG AGGTACGTGG GAATCGCCTA GAAAACATGA ACAATTGCTT TACAGCTCTA ATGCATGCTA CTTTCCTTGA ATCAGAGCCC AACAAGCTTG ATGCTGAGTT TCAAAAAGAA AGGTAACGAA GCATTCACCA ACGGCAAACG GAACGAGGCC AAAAACATGC AGTACTATCG GGACGCAATC AATCATTACT ACGAAGCCTT TGCCTGGGCG CAGAAAATAG AGCCTATGAT GGCCGGAGAT TTGGCGCAGG CGGATACAGA CGAGCCAACA TACACGGAAG ATGAATTGGA TGAGCTGCGA TCAAATATTT GCAACAACGT GGCTCTGGCG CACACTCAGC TCAAGAACTG GGGCTTTGTG CGCGATGAAT GTCAGAAGGC ATTAACTTTC AACAACAACA ATGTCAAAGC ATGGTACAGA TTGGCTAAGG CTTACCAAAT GTTGCAACGC TGGGAAGAAG CAGGTGACGC CATTGAATCT GGATTGGCGG TCGACGGCGA AGAAAACAAC AAGGATTTGA GGAAGCTTCA AAAGCTACTA TCTGATCGCA TCCAAAAAGC TCGTAAATTT CGACAACAAC GTGAACGAAA GAGAGCGGAA CGCGTAATGA AAATCAAGAA AGTTTGGAAG CACTGCCAAG AAACCGGTGG CATTAAACTC GGCCGAATTC CACTCGTGGC CACCGTGACA GATGCGGAAG AGGATGACGA CGATCGCGAC GAGTCTCGCT GGCATTTTCA TCTACCACAT ACCGGACAGC TGCCCAGCGA AGAGCACGGT GTATGGGCGT GGCCTTGTAT GTTTTTGTAT CCGTCTCACA ATCAGTCCGA CTACGTCAAG CATTTTGCCG AAAGCGAAAT GTTAGCATTA CGCATGGCGG AGATGTTTCC AGAATTAGAA GACTTAGGCG GTGAGACTCC CATGCCTTGG GACTACAACA ACGAATTTAG TTGTAGCCAA CTAGCTGTCT ATTTTGAGAT TCAAGTGCCG GATACGGAAG AACGTGTGAT ACACCCAGAA CACGTTGAAT TGCTGCGCGA TCAGGCTACC ACGATGCGGT TTTACGAGTC CTGCCGTGCG CTACAGGGAG ACGAAGGCAC GGCAATGGCG GAAGTAGTAC GGGCGGTGGA GCGCAAACAT TTGTACCAGC AACGAAAGGC TTGGACAAAG CGTCACGGCA GTCTGTGGGC GAAGCCCGAT CCATGTTCGG TCGTGCGCGT GCATCCAGCC ATGACCTTGC GAGGGGTGTT GACCGATCAT CGGATGGTTG TGCCGAACGT AAGTAGTGGA AGCGTCTGCG ATCTTGGAGT TTTGGCGACA TTTCTGACTC ACAGGCTTGG TTGATCGTTT TTCAGTTTCT GGTAACTTTT GTGATTTTTC CAGAAAGTCA TCCAGCTCAT GCAGCCTACC TCAAAGAACA CGAATGTGTT GGTCTTTTGG AACCGACAGA ATGA
|
Protein sequence | MVHIEEVKNV PTGIQPEAKE ISKKEASNVE KYNDLLSTMQ KTKEKWDEED GVEDEQDALL SQAIEFAIEQ GRGWAPGEKD AYLEKILDDD FIPPMFCSTP EELEKTGLQE AFTSLIYDGE SPTSLMLSFK KKGNEAFTNG KRNEAKNMQY YRDAINHYYE AFAWAQKIEP MMAGDLAQAD TDEPTYTEDE LDELRSNICN NVALAHTQLK NWGFVRDECQ KALTFNNNNV KAWYRLAKAY QMLQRWEEAG DAIESGLAVD GEENNKDLRK LQKLLSDRIQ KARKFRQQRE RKRAERVMKI KKVWKHCQET GGIKLGRIPL VATVTDAEED DDDRDESRWH FHLPHTGQLP SEEHGVWAWP CMFLYPSHNQ SDYVKHFAES EMLALRMAEM FPELEDLGGE TPMPWDYNNE FSCSQLAVYF EIQVPDTEER VIHPEHVELL RDQATTMRFY ESCRALQGDE GTAMAEVVRA VERKHLYQQR KAWTKRHGSL WAKPDPCSVV RVHPAMTLRG VLTDHRMVVP NAWLIVFQFL VTFVIFPESH PAHAAYLKEH ECVGLLEPTE
|
| |