Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47036 |
Symbol | |
ID | 7202137 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 244755 |
End bp | 246292 |
Gene Length | 1538 bp |
Protein Length | 430 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181336 |
Protein GI | 219121985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAC CTGGACCTCA AGCATTTCAA TCATTCGAAG AAACCGAGAG CTCCGGCGAG GCTGTCCGTC CTCTGGATTT CCCACCACCT GCAGTTCGTG CCGGCGTCCG CGTAAACGCT TTGGTGGTGC ACCCCGAGTC GGGAACGCGA CAGGTATGTT CCGGCGTAAT TCATCGTGAA GACTTGTCGG AAGCGTCGGA ACCTATATTG CAAGGCAGCG GGCCTGCGGG AGTATCGCCT ATGACGACGC ACGAACGAGA CCCCTCGGTA CAGGAAGAAG TGTTAGCCTA TTGGCCGCAA CGCCGCTTAC AGGATGCCAT TTACGGATCC GTATGGGCCT GCCTGGTTCT GCGACGGCAC CACGGGATTG CAGCCGATGA CGCCGCACGG GCAGCTGGTG TGGAACCAGG GTCTGCCAGT GCTCCAATTG TATGGGAAAT ATCAGGCCAG CATGTTGCGA TCAAAATGGT AGAATGGGCA CGCGTTCATC ACATGCGAGG ACGGCTTCTG GAAGATCCGG TGAAAGAAGT TGCTGCTATG CAGCTACTGG GTGCGCGTCA TCCAAATGTG CTGGGAAGTA CTGAAGTATT ACAAGATGGT GACTTCTTAT ACTCGATAAT GCCCTACTGT CGAGACGGTG ATTTGTTCGG CGTTGTTGTC CAGTACGCTG AGGACAGCGG TGGCGAATCT GGCATGCCCG AGCCGGTCGC ACGCTTTTGG TTTCGTCAAA TTTTATGGGT ACGTAAAATG TGATGCACTG AAAAATCAGC GGGCTTACAT TGAGTCCGAA GGTCTAACAG AATTTGCTCT TGGCATTTTG GTCGTAGGGT CTTCATCATC TTCAAACGCA AGGCGTATGT CACCGAGACC TTTCTCTCGA AAATATTTTA GTTGATGGTG ATCGCTGCAT GATTATCGAC ATGGGCATGT GTCTACGAGT TCCCTACAAT GATCCTCACA AGCCCGGAGC AGTTACCGAT GTCACGCGTG GAAGTACTCG GCGATTGATG CGACCACAGG GAGTTTGTGG AAAGCACAAC TATATGTCCC CAGAAGTATT TGCCAACACC GACAGCTTTG ATGGTTTTGC TATTGATCTG TGGGCAGCAG GTGTCATTCT TTATATCATG CTGACAGGAT TCCCGCCTTA CGACCAAGCT AGTCGAACCG ACCAGCGATT CGAGCTGATT GCCACTGGTC GCCTAATGGA GCAACTTCGA AACTGGAACA TCCAACTTTC TGAGGAAGCA GGAAATCTGT TACAGCGAAT GCTGACATTA GATCCTCGTG AACGGCCGAC GCTTGCGGAA ATTCTTGCCG ATCCATGGGT AACGAGCGAC GACGTACATG TTCCTCCTCC GCCGGAGCCG CTTCCGTTCT AACGATGGGT ACGCCTTGTT GGTTTCATTT ATCTTTCCCC TTCGAAAATT GTTTTCAGGG ATTTCGTGGA CACAACCGTC CTCTACGTTA CAATCTTGCT ACTAGGATGG TCAGAAGGCA CGAATTACGG CTAACGCAAA AGAAAAACTC ACAGCTTG
|
Protein sequence | MEEPGPQAFQ SFEETESSGE AVRPLDFPPP AVRAGVRVNA LVVHPESGTR QVCSGVIHRE DLSEASEPIL QGSGPAGVSP MTTHERDPSV QEEVLAYWPQ RRLQDAIYGS VWACLVLRRH HGIAADDAAR AAGVEPGSAS APIVWEISGQ HVAIKMVEWA RVHHMRGRLL EDPVKEVAAM QLLGARHPNV LGSTEVLQDG DFLYSIMPYC RDGDLFGVVV QYAEDSGGES GMPEPVARFW FRQILWGLHH LQTQGVCHRD LSLENILVDG DRCMIIDMGM CLRVPYNDPH KPGAVTDVTR GSTRRLMRPQ GVCGKHNYMS PEVFANTDSF DGFAIDLWAA GVILYIMLTG FPPYDQASRT DQRFELIATG RLMEQLRNWN IQLSEEAGNL LQRMLTLDPR ERPTLAEILA DPWVTSDDVH VPPPPEPLPF
|
| |