Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25133 |
Symbol | |
ID | 7197060 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 14354 |
End bp | 16444 |
Gene Length | 2091 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177845 |
Protein GI | 219112187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAGGAAGC CGCGTGCCCA AGCGAACGAA GGAACCGCTT TACCAGTACA CTATCAAACG CGAGCACTGT TACCATGGCC AAGACAGGAT GGAAGTCGGG GATCTCCAAT ACTTCCTCAG CGGGAAAAAC CAATCGGACT CCCAAGGGAA ATGCGAGTTC CAAACTCACG CAAGGCCGAC GCCGCGACAA GCAACCCGGG TCGGAGCGTT CCGCGGAAAC AATCCAGCGT TTGAAAATGT ATTCCAATGG AAAAGCCATT CGTAACAAGG CGGGGAAGAT TGTGGCGGGA ACCTTTATGA TGCAAGACCG CGCCGGTGAT CGAAAAATCG AAGCCTCCAC TGGACGCATA CAACCCGATC GCCGATGGTT CGGGAATACG CGCGTGGTTG GTGCCACCGA ACTCGATCGT TTTCGTCAAG AAATGACGGA GAAAGTGGCT GATCCATACT CCGTAGTCCT TAAACGCAAA AAGCTGCCAA TGGGCCTCTT ACAGGATGCC GCCGAGTACA AGGCCGGAAA CCAGAAAGCC GCGCTTTTAG AACAAGAGCC GTTCGATCAA GCATTTGGAA AGAATTCTCG TCGAAAGAGA GTCAAACTTG ATCAGCTTTT CGTACAACGC GCGGCTGAAT CGGCAGACAG CAAAGAAGAC ATCAAGACTG GCGAAGGAGA GGCTATAGTA GTGTTGTCCA ATACGGAACA TGAGCCTTCC GCCTACTCGT CGCTCTTAGA AACGGCACAG AAGAGCCAAG GAACGTATCA ACAAGTCAAC ACTCGTGAGG GTATCGTTCC CTGGGGCCGA GACTCACACT TGGAGCGCTC CGAAGGCGAG GGCATTGACT GGCGGCATGA GAAGAAAGAC GATCTCTTCC TTAAGGGACA ATCAAAACGA ATTTGGGGAG AGTTCTTTAA AGTCGTGGAT TGTTCAGATG TTGTTTTGCA TATCATTGAT GCTCGAAATG TACCGGGTAC CCGCTGTACC ATGATTGAAC GTCATATCGC CAAAAATGCT TCTCACAAGC ACTTGGTCTT TGTTTTGAAC AAGATCGATC TCGTGCCGAA CTGGGTTGCC AAGCGATGGA TGGGCGAGTT GGCTGCCGTT CGTCCAACCA TCGCCTTTCA TGCATCACTA ACAAATGCGT TTGGAAAAGG GGCCTTAATA AGCTTACTAC GACAGTTTGG AAAGCTTCAC GAAGACAAAA AGCAAATCAG TGTTGGTGTT ATTGGCTATC CCAATGTTGG CAAATCGTCG GTCATCAATA CACTCATTTC AAAAAAATCT TGCAAAGTGG CACCCATTCC TGGTGAAACC AAAATCTGGC AGTATGTTAC CCTCTTCAAA CGAATATCCC TTATCGACTG TCCCGGTGTT GTTGTCGATA CAGCTGGTGA TACGGAAGAG GATTCCGTGC TCAAGGGAGT GGTTCGAGCG GAGCGCTTGG AAAATCCAGA GGACTTTATC GATGCGATTA TGGGCAAGGT GAAACGTGAG CACATTGCGG CACAATACAA GCTACCAAAG GATGGAGAGG AGACATGGAG TAGTAGCGGA GAGCTTATGG AAATGATCGC GAGGCGATCA GGTCGTTTGC TAAAGGGTGG TGACCCCTGT ATTCGGACTG CGGCACTTAT GATTATCAAC GATTTCCAAC GGGGCCGTCT CCCCCATTTT ATTCCTCCAC CTGAGCTTAA GGTAGAGGAA GATCAGAAGA CAGTTGCTAC TGAGCAGGCA ATCCAAATCG AGGAGCAGAA TCTGGATGAT GTGGTTGTTG TCGAGAAAGA AATGGAACAA GACGAAGAAG GTGTCGAAAC TAAAACGAAT GATCATGCAG CCATTGCTGA CACGGAGACA GTAATTGACA CTTCGAGTAT CTTGATCGGA GACGGTAAAT GGGACGAATA GTGTGGTTTT AAAGATACTC GCTTGAGAGA TAGGATTGGT CTGAAAACAT AATTTTGCTT GAAAACTCAA CGTAACAAAT TCATTGCGTT CGAACGGTCC ACACATGTTT GAGACTTTGA ACAAAAAGGG GCTTAGTATT AACTTTCTTA ATGGAGGTAC GCCAAGACGT TGCTGGAAAT TTTTCTCTCT C
|
Protein sequence | MAKTGWKSGI SNTSSAGKTN RTPKGNASSK LTQGRRRDKQ PGSERSAETI QRLKMYSNGK AIRNKAGKIV AGTFMMQDRA GDRKIEASTG RIQPDRRWFG NTRVVGATEL DRFRQEMTEK VADPYSVVLK RKKLPMGLLQ DAAEYKAGNQ KAALLEQEPF DQAFGKNSRR KRVKLDQLFV QRAAESADSK EDIKTGEGEA IRSEGEGIDW RHEKKDDLFL KGQSKRIWGE FFKVVDCSDV VLHIIDARNV PGTRCTMIER HIAKNASHKH LVFVLNKIDL VPNWVAKRWM GELAAVRPTI AFHASLTNAF GKGALISLLR QFGKLHEDKK QISVGVIGYP NVGKSSVINT LISKKSCKVA PIPGETKIWQ YVTLFKRISL IDCPGVVVDT AGDTEEDSVL KGVVRAERLE NPEDFIDAIM GKVKREHIAA QYKLPKDGEE TWSSSGELME MIARRSGRLL KGGDPCIRTA ALMIINDFQR GRLPHFIPPP ELKVEEDQKT VATEQAIQIE EQNLDDVVVV EKEMEQDEEG VETKTNDHAA IADTETVIDT SSILIGDGKW DE
|
| |