Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37873 |
Symbol | |
ID | 7202665 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 279890 |
End bp | 281761 |
Gene Length | 1872 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182041 |
Protein GI | 219123458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.126531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCCG CCGAAGCGGA GCAGTATGTC AAGCGACATG ACAAAGACGA ATGGTATTAC AGAAAGAATT CAATAATGAT AGGTAGTGCG CGGGCTCTTG GCCGTCGTCA AGCAACCGTG CCAAAAACTG CGTATACCCA CTTTTTGCAA AGTAACACGC GACGTCAAAA GGAACACGCA GACAATTTCT TAATTCTTCC GATTTTTCGT GAAGGATGGA GCTAAAAGCA ACACAACAAG GGGAAAAAAC AATACTGGAG CTCGTGTTCA TTCGACTGTT GAATGTAGAT GGAACGTTTG ATAGAACGAT GGCGTGAAAT GTTGACGAAA TTCAACGGTG CGCGAAAAAA CGTGAATTCG CGCCTTGGTG ACAATGACGG CTGTTTTACA GTTGGTAGGA TTGTCGATTC ACATTCGACC CACGGTCAAG AATTTTTTAG AGGTTTTGCT GTCTTAAGGA TGAGACAACG CCTCTCTGCG ATTATTGTAG TAGCAGTACT GCCTCTTCGA ATTTTGTTGC ATGTGAGCGG CGTCGAAGCC ATAAACATTG GGAACGTTGC ATCAGCTCGT CGAGGACCCT CCCCAGTGCG CGGAAGTCGT CGGGACCTTG TCAGAATCTT GACAAATTCA TCCACCCCGT ATAGCAATGC TGAAAAGGAT AGACCGGAAA AGGGAAATCC AGAGTGTGAC GACTATTGGT GGAGTTACCA AGGGAACAAC GGAACAAGTT TGCCCTGTAG AGATCCAGTA GCGGACGCTG AACAACTCCC GGGCGAGGGA GGCGTCGGTG GCAGCGACGG CGATGGTATC GAAGGCGAGG ATTCTGGCGA GAATGGGGAA GGCATAGTTA CACAAGCACC CTCAGACACA GATTTCGGCG GCTCAGACGG TCCAACCGGA GTATCTACTG TTTCGAATGA CGTTCAAATA ACGCTATACG CTGCTTTACG TCCTAGGCAA CCAAGTGTTG GCCATACGAT CATTTGGGTG ACTACCCATT TTCTGAATAC CACCATGGCA AAAGACAATT ACATGTTTGC GATACCGCAG CGTCAAGACA ATGACGGCAA TCTGGATGAT CGAAGCTTAT TGAGCACAAA TAATACACGT GCTCAAGTCG CGGGTTTCTT ATTGATCCAT GACTACACCA GTTCAAAAGT GGTCCATAAG CGTGAGTCTC GTTGGTGGTG GCAGTATAAC ATAACCTATC ATTGCTACTG GCCAGAAGGA ACAGAGGCAG TTACTGACGC GTCCATTCTA TCTTCGGTGG ACCAACAAAT CAAAGATGTA CTGTTCTCAG CAATCAATAG TAGGTTGTTT CAACAGTGGA TGGACGACGA AACGCACGAC GATTTCATCC AGGTATGGTA TTCGTTTCAC CAGATTGAAG GTGCTACTGC TGAGCCGCCC GTTGACGAGA ATCCAATTCC ATCACCTCCC CAGACTCCAA TCGGAAAGCC CACTCAGGCA CCAAGTATTT CGGATGGTAA ACTACGGGAT GTAGGCTCCT TTACGACTCC ATTGGATCAA AAGGACTGGG ACTGGCGCAG GTATCTAGGG CTGGGACTTT TCGTCGGAAC ATTATTTGGA ACGCTCGTTT TGACTCAATT GGCCGCTTAC CGACATCGAC TGATTACTCG AAAGGAATAC TGGGGTAACA TTGGGACTGA AAGGGGTGTG AATGAAATTT TAAATCTAGG CTGGAAAGTG CGAGGCGGAA ACTTGGAGGT GTATGACAAG GCTGGGGTAG GGTATCGAGA TGATGATTCA ATTTTGATCG GTGGCTACGA GCAGAAACAA GTTGTGGGCG CCGAGATTAC AGTAACACTC CCATCATCGG AGACGACACC CGACAAAACG CATGGAAGCT AA
|
Protein sequence | MLSAEAEQYV KRHDKDEWYY RKNSIMIGSA RALGRRQATV PKTAYTHFLQ KRWREMLTKF NGARKNVNSR LGDNDGCFTV GRIVDSHSTH GQEFFRGFAV LRMRQRLSAI IVVAVLPLRI LLHVSGVEAI NIGNVASARR GPSPVRGSRR DLVRILTNSS TPYSNAEKDR PEKGNPECDD YWWSYQGNNG TSLPCRDPVA DAEQLPGEGG VGGSDGDGIE GEDSGENGEG IVTQAPSDTD FGGSDGPTGV STVSNDVQIT LYAALRPRQP SVGHTIIWVT THFLNTTMAK DNYMFAIPQR QDNDGNLDDR SLLSTNNTRA QVAGFLLIHD YTSSKVVHKR ESRWWWQYNI TYHCYWPEGT EAVTDASILS SVDQQIKDVL FSAINSRLFQ QWMDDETHDD FIQVWYSFHQ IEGATAEPPV DENPIPSPPQ TPIGKPTQAP SISDGKLRDV GSFTTPLDQK DWDWRRYLGL GLFVGTLFGT LVLTQLAAYR HRLITRKEYW GNIGTERGVN EILNLGWKVR GGNLEVYDKA GVGYRDDDSI LIGGYEQKQV VGAEITVTLP SSETTPDKTH GS
|
| |