Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44799 |
Symbol | |
ID | 7199754 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 274640 |
End bp | 275943 |
Gene Length | 1304 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178966 |
Protein GI | 219116342 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCAG GCCGCAACGA CGACAGCCAT TCCGATCACC GTTCCGCTTC CACCAACACA TCTTCGAGCA AGCTCCAAAC AGATTGTGCC ACCGAACACG GAAACTCCTT GCAGTGCATT CAGGATAATT TACACGACAA GAATGTATGT CAGCCCTTTT TCAAGGCTTA CAAGGAATGT CGCGCCGAAG AAAACAGACG TCGGCTCGAA GCTAACGCGA AGAGGTTTTT CTGGCAGTAA CATTCAGTCA CAACCCTTCT CGGCAATATT ACTTTGGAGT TGCGCTCTCG TGATTTGGTG TTCTACGACG CGCACAGTAC AAGGCTTTGT TTCAATGAGG CAATCTATTG CCCCGTCTTT TCCCTCTGTA CGACTGCATG ATATTGACAC CGGGCACGAC GACAGTATTG TAGGAGCACG TTTCTTGCTT TCGTTTGATC TCGACGACAC TCTTTTTCCA ACGAGTCAGG TTGTGAACGA AGCCAACACA GTCATGATTG CAGCCATGAA TGCCTTGGGG TACGCAATCT CGGTGGACGG GTTTTTGGAA ACGACGCGGC GGATTCGAAA AAGACTGACC CAGCCCGTGA CTTATACCGT CTTGCGCAAA ATGGCGATCC GGGAGGTTAT ACAGTGTCAA CTTTCTCAAG GCGATACGCG AGGGCCAATA CTGATTGAGG AAAAAGCGAT AGACGAATTA TTCAACGTGT GGCTGGAGGA ACGACACGCG GCAGCGGAAC GCTATCTATT TCCAGACGTA ATTCCTATGT TACAATCGGT TCGGGCACGT TTTCCAGGTG TCTGCATTGC CGCCATTACA AATGGTCGGG GCGATCCACT AGCAATGAAG GATACACTGG CCCCCTATTT TGAGTTTTGC GTTAGTGGAG AAGATAGCAA CGTGTTTCCT GATCGCAAAC CCCATTTAGG AATTTACGAA GCGGCGTTGG CACTCTACAA TGCAACATTC CGGGATCAAT CGTCAGAGGA GCTCTTGTGG TGTCACGTGG GCGATTGCTT GGCTAATGAT GTGGGAGCCA GCGCCGGGTG TGGCGCCTTT GCAGTATGGT TTTGTCCGGA CGATCAAACC ATTGAATCTG CCGCTTCTCG TTTGAAGGAC ATACGGAGCA TGCCGTCCTG GTCCACGGCA TCCGCAGCTG ATATAGTTGA AAGAGCGAAA ATGGCCGAAC ATGCCAAGGA AAAGGTTGCC ACTCGAATTC GTTCGCTATC GGAGCTCGAC GAAACGTTGC TCAAATTCGT AGAACGTTCC CACTCTCTTG CTTCCGCTAC GTAAAGAAAA AAAG
|
Protein sequence | MPPGRNDDSH SDHRSASTNT SSSKLQTDCA TEHGNSLQCI QDNLHDKNVL QGFVSMRQSI APSFPSVRLH DIDTGHDDSI VGARFLLSFD LDDTLFPTSQ VVNEANTVMI AAMNALGYAI SVDGFLETTR RIRKRLTQPV TYTVLRKMAI REVIQCQLSQ GDTRGPILIE EKAIDELFNV WLEERHAAAE RYLFPDVIPM LQSVRARFPG VCIAAITNGR GDPLAMKDTL APYFEFCVSG EDSNVFPDRK PHLGIYEAAL ALYNATFRDQ SSEELLWCHV GDCLANDVGA SAGCGAFAVW FCPDDQTIES AASRLKDIRS MPSWSTASAA DIVERAKMAE HAKEKVATRI RSLSELDETL LKFVERSHSL ASAT
|
| |