Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37463 |
Symbol | |
ID | 7202372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 201712 |
End bp | 203508 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181506 |
Protein GI | 219122343 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CCGTAGAAAA TCGTAGGAAG ATTCTTTTGA CAAAGACTGG CATTGCAGTC GCTGGAGAAC GTCTGATGGT GTCTCTGAGT AAGCTTTTGG TTGGCATCGA GCATCGAACA GTCGAGAAAA TAAAAATAGA GCGAATTAAT CGATGCTTTG CTCGGGGCAA CCCCGAATCG GAATTCGACC GTGAGGTCGG CGGCTCCCAA ACCTTTCGGC ACTTGGAAAT TATCCGAGAC GCTCTCAAAC ACGATCCCAC CCTTCGTTCC TTCGTTGTCG ATAACAATAT GTTGGCAGCG ACGAATCTTC CGACTAGAAC CGAGGATAGC ATTTGGCTGA ATCTCCCTAC TTGGCAAGCT CAACTAGGAA TCCAGGATAT TCAATTTTCG TGCTGTACTG TTCAAACAAT CGAACCTACT GTATTTCTGC AAGGCTTTCC GGAGTGGAGG ATCCATCACG GAAGCGGTCC TCGCGGATTC GCGTTGGTGT CCAACATAAC AAAAGGAGTG TCCGCTTTCT TTTCTGATCA GCCTCGTCAC ATTAATACGG AAGGATCGAG ACAGGGAGAG GCCTGTCCCG ATGCGATGAT TCTGCGAGTT TTTCGATGTG CGCCTCGACC TTCGTTACGG GCCCCGATTG AACATGTGCT CAATCAATCC CGTCCACCTA ACGTCTTTGA CCTGTGGATT CGTCACCACG AAACGCCAAC CGACCTGCCT GGACTAGGAT GTTTCAAAAT TGACGCAAAT GCAAAAGCAA CGCTTTTAGA ACTATTGACG GATGTCGACT TTGTCGAAGA GACCAATCGA GCAGAGATAG AAGTAGAGCC GCATCAGTGC ATGCCATCAC CCCTGGTTCC AAATGATGTT ACGCCCGGCT CTCCTTTAAT TCGCACGGCC CCTCCAAATA AATTTTCGAC ATCACTGGTA TTACAAACTG TTTCCAATCC TGGATCTTTG TTAGCGCCTT CCATGCCCTC GCCAGTATCT ACTCAGCAAG CCATTGTCAA TTTGTTGTCA AGCAGCGATA GCACGGAGTC TAACATTTCT CTTGAGGATG ATCATAACGT CGGTCATCGC TGCAATGCTT TTGAAGTCAT AATTTCCCCC GCAGTTAGCT TTGCGGAGAT GGAAGCACTG GTTGCGAAAG ATAGCGAAGA AAATAGTCAA TTCGGTTCCA TTTTTGTAAA GGGTGAATCC GAAACACTGG GGACAGCGCC CTGCAATCTT GCTGAGAAAA CAAATTGCGA TCGACACAAG TGGGCTTTTT GTTTTCCTGA AGAGCTTCAG TTTTCTGACG TACTTAGTCG AGATTGGCTG CCAAGTGATT TGTCTAACAA GGATTCTCCA CACGACGAAA ACCAACCTGA AGATTTACTG TTGGACTCTT TTTCAGGTCT CGCAACGTTT AACTCAATAT TCGGAGACCC ATGTACGGTA GAGGATGCAG AGGATCCTTA CAGAGAGACC TTGAAGATTG ATACAGCTCT CGCCACTCCA ACAGGGCAGT CTCAATTTGA CGATTATTCT CCGCTTCGGT CGTTACTTGC TGCACTTGCT TTTGAGGGCG TCTATTGCGA CGAATGCAGC TCAGCTATCA CGGATACACG TCCCATACAA ATTTATGAGA CTCGGAACCT ACCGGGTCCG GCAGAGACGT CTAGGATGTT GACAATGCAG CGAGAGAGTT TTAGTATCAG CAACACAAAG GAACAAATAG GAGACGTATC ATACACAGCA GACGGTAGAG AAAACGTAGA GCCCTGCATT GAGGAAAACA TGCGGTGCCA TTGGTAA
|
Protein sequence | MKKSVENRRK ILLTKTGIAV AGERLMVSLS KLLVGIEHRT VEKIKIERIN RCFARGNPES EFDREVGGSQ TFRHLEIIRD ALKHDPTLRS FVVDNNMLAA TNLPTRTEDS IWLNLPTWQA QLGIQDIQFS CCTVQTIEPT VFLQGFPEWR IHHGSGPRGF ALVSNITKGV SAFFSDQPRH INTEGSRQGE ACPDAMILRV FRCAPRPSLR APIEHVLNQS RPPNVFDLWI RHHETPTDLP GLGCFKIDAN AKATLLELLT DVDFVEETNR AEIEVEPHQC MPSPLVPNDV TPGSPLIRTA PPNKFSTSLV LQTVSNPGSL LAPSMPSPVS TQQAIVNLLS SSDSTESNIS LEDDHNVGHR CNAFEVIISP AVSFAEMEAL VAKDSEENSQ FGSIFVKGES ETLGTAPCNL AEKTNCDRHK WAFCFPEELQ FSDVLSRDWL PSDLSNKDSP HDENQPEDLL LDSFSGLATF NSIFGDPCTV EDAEDPYRET LKIDTALATP TGQSQFDDYS PLRSLLAALA FEGVYCDECS SAITDTRPIQ IYETRNLPGP AETSRMLTMQ RESFSISNTK EQIGDVSYTA DGRENVEPCI EENMRCHW
|
| |