Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47797 |
Symbol | |
ID | 7203040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 69049 |
End bp | 70604 |
Gene Length | 1556 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182315 |
Protein GI | 219124028 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.631051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGCCGTCTT CTCACTATTC CGTCCAGCAC ATATATTTGC AATGAATTTG GTCAAGTCTA CTAGCACTCT GCTTGCGCTC TTTCTGGTCG GCACCGTGGT GGCGTTTACG CCGACTCGCC ATCCGTTCCA AGCCGCCGTC GGTCGTATCG CTATTCCCGG CGCGACTCGC ATTCGTCCGG ACGGCTTGCC CTTGCGATAC CGTGCCCTCG ATGATCAGGA CGATGATTCC GCCATGCTCC AAGTACAAAC CCGCACACCC CCAGGATACG ATCCTCGAAA AGCCGTGGGG GAGTCGGAAT TTCCCACGGC TCCGGGGGGC GACGATGCAC CGGGCCGCCG CCGTCTGCCC CGGACTGCCA TGAACACACG TCTCATTCTC GCCTTGTTGT TGAATCAGGG TTTGGTCCTG GCGGTGGCGA GTACCTTGGC TGGGGCTGCC TTGTTCGCCA CCAGTGGAGG CTTGGCCGCC TTTGGGAATC TCAACGAACT CTTACGCTGG ACGGGAACAG GTCCCGACGT GTGGGATTTG ACCATTACCG GCGAACGCTT GTTGTGGGGT ATTGGTGGTG CCTTGCCTCT CCTAGTACTC AACGCTGTGT GGGAATCTTC GGACGATCGG CGGCTGGCCA ACATCAACTT TTCCACTATC GCCATGGTCA TGACCTTGTT TGGGCGTCGG ACGGCATTGC CGGACGAGTT TCGGCCCGCG TCGTGGAAAG GCCAGACTCT ACCCACCACG ACGGGTGGTC AAGTAGCCCT ACAGAGTTTG GTTTTATCCC TGGTCACTGG AATTTGCGAA GAAACCATTT TCCGTCGAGA ACTTCCGGCC GTTTTGCGAT CGTTCGGGGC GGACCCGGTG TCCGCCATGA TTGGACAAGC CGCCCTGTTT GGTCTCGGAC ACGTGCAACC GACAGCCAAA GCAGCCGACA ACACCGTCGT GGCCTCGTTG CAAACCGTGA ATGGTCTCGG TTTTGGCTTG ATCTACGCAT TGTCCGGTGG AGATTTGGTC CCCTGCATGA TCGCGCACGC GTGTTACGAC TTTGTCGTCT TCTTCCAGAC GTGGACTCGG GCCAACGATC AACTCGAATA CGCGGAACGT TGGTACAAGG AGCCCTTACC GGCCGGGACG GAACAAAAGG TCCGCCGGAT TCTTCAAGCC GCTCAAAAGG ATCCCAGCGT TCTTTTGCCC CGCATCAAAC GGCTGTTTTA CGTGTTTGAT TTTGATCAGA ACCAGTCGTT GTCTCTGTCG GAAGTGCGCA AGGGAGTGGC GTACCTCGCG ATGGAACGCG CGGGGACACC GCCTCCCCAA CCTGAGGTGG ACCAGGCCTT TTGGGAAGTC GTGGCCCAGC ACGAGTCCGA CGCCCGCGAC GATCGCCTCG ATTTGGTCGA CTTTCTGCGT GTGTACAGCA AGCTGGCCGA CGGAAATCGG CAACGCCTCG CCAGCGCACA GTCGTAAAGC ACAATTTCAT TTTTCGGCGT GTGTACAGCC AACATCGGCA TCGCCTCGCC AGCGCACAGT CGTAAAGCAC AATTTCATTT TCGGCGTGTG TACAGC
|
Protein sequence | MNLVKSTSTL LALFLVGTVV AFTPTRHPFQ AAVGRIAIPG ATRIRPDGLP LRYRALDDQD DDSAMLQVQT RTPPGYDPRK AVGESEFPTA PGGDDAPGRR RLPRTAMNTR LILALLLNQG LVLAVASTLA GAALFATSGG LAAFGNLNEL LRWTGTGPDV WDLTITGERL LWGIGGALPL LVLNAVWESS DDRRLANINF STIAMVMTLF GRRTALPDEF RPASWKGQTL PTTTGGQVAL QSLVLSLVTG ICEETIFRRE LPAVLRSFGA DPVSAMIGQA ALFGLGHVQP TAKAADNTVV ASLQTVNGLG FGLIYALSGG DLVPCMIAHA CYDFVVFFQT WTRANDQLEY AERWYKEPLP AGTEQKVRRI LQAAQKDPSV LLPRIKRLFY VFDFDQNQSL SLSEVRKGVA YLAMERAGTP PPQPEVDQAF WEVVAQHESD ARDDRLDLVD FLRVYSKLAD GNRQRLASAQ S
|
| |