Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22117 |
Symbol | |
ID | 7203014 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 739291 |
End bp | 740609 |
Gene Length | 1319 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182448 |
Protein GI | 219124306 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.701766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTGCCATT GGCAAGCCGA ATAGGTTTTT TTTTCATGAG TGATCATGGT TGTCCTGGAA AAGCTGTTTC TGTATCGGAG TATCTGCGTC GGATCCCGAA AGTCGAACTA CACGCGCATT TGAACGGATG CATTCGGCAC GAAACCTTGA TGGATCTAGC TCACGAGAGA GGCGCGACGC TGAGTAACAG GCACTTTTCT GCGGAACCGC TCCACGAGAA CCTCGCTTCA CCCCCAAACA ATGGCGAGCA CCACAGCATG TACAATATCA TGCCACGATC TCTGCAGAAC TGCTTCGATA TATTTGCCGA AATTCCGGCT TGCGTTAACG ACTTGTCGGC ACTGCGAAGA ATAACGCAGG AAGCTCTGGA AGATTTCGCA GCACATCACG TTGCCTATCT CGAATTGCGT TCTACACCGA AGCGCTTACT GCGGTCACAT CAAGATGATC AATCGCAAAA GGTTGACAAA CAGGTGTACA TTGAAACAGT GTTGGAGGGT ATACGCGACT TCCAGAGCAA AGAAAAGGAA CGCTTCAGTC ACGATCCAGT ATTGTCATCG TCTCGGTTAC CTATCGTGTG TAACTTCATT GTCGCTATCG ACCGATCGCA GTCCCTGGAA GAAGCAACGG ATACTGTACA TATTGCAATC GACATGTTCC AACGCCAGCA GAGTCGGCCT TCCAATCTCT CGCCGTCAAT TGTCGGAATC GACTTGGGGG GCAATCCGAC CAAAAATGAT TTTCGGACTT TTCAGACCCT CTTTCAAAAG GCGAGACAGG CCGGACTCAA GGTGACGATC CATTGTGGTG AAATCCCATG TGCAGAAGAT GATAACAGCA AACACGAGCG TCGCGTTGCC ACCGAATCGA AACGGAAAGC CCGGGACGAA GCCGTGGCCA TTTTGGCTTT CCGACCGGAC CGTTTGGGAC ACGCCTTGTT GCTCCCATCC TCGCTTCAAA AAGTGCTGGA AGACACCAAG ATCCCCGTGG AAACCTGCCC CACAAGCAAT GTCATGACGT TGGAACTCGC CAGATCCTCG AACGGGAATC TCGTGCACGG ACTATCCCAG CATCCCTGTT TGGCACAATG GCTCCAGAAC AATCATCCAT TGTCTATTGG TACAGATGAC CCGGGTGTCT TCCATACCAA CGCAACTAAA GAACTGGTGT TACTGGTCAA TACCTTTTCT TTGGATCCTT GTGCAATGGC AGAAAAGGTT GCTGATTCTG TCAACTACGC GTTTTGCAAT GAGACTCTCA GGCAAGAGAT AAACGCCAAG ATGCGTGAAA TCATGAAAGA GATTCATCAT TCTTCCTGA
|
Protein sequence | MSDHGCPGKA VSVSEYLRRI PKVELHAHLN GCIRHETLMD LAHERGATLS NRHFSAEPLH ENLASPPNNG EHHSMYNIMP RSLQNCFDIF AEIPACVNDL SALRRITQEA LEDFAAHHVA YLELRSTPKR LLRSHQDDQS QKVDKQVYIE TVLEGIRDFQ SKEKERFSHD PVLSSSRLPI VCNFIVAIDR SQSLEEATDT VHIAIDMFQR QQSRPSNLSP SIVGIDLGGN PTKNDFRTFQ TLFQKARQAG LKVTIHCGEI PSRDEAVAIL AFRPDRLGHA LLLPSSLQKV LEDTKIPVET CPTSNVMTLE LARSSNGNLV HGLSQHPCLA QWLQNNHPLS IGTDDPGVFH TNATKELVLL VNTFSLDPCA MAEKVADSVN YAFCNETLRQ EINAKMREIM KEIHHSS
|
| |