Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50310 |
Symbol | |
ID | 7199056 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 260839 |
End bp | 263325 |
Gene Length | 2487 bp |
Protein Length | 699 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185160 |
Protein GI | 219129993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.049035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGAATTCAT GTGACCACAA TGCAAAACTC AATTCGTCGA TTTGCAGTTG GTCTCTAACA AATACAGTCG GGTCTGCACA GCTGTGACGT AAAAGCTCAA AGCGTTTGTT GCTTGCAGCT TCGTGAATAG CGAGATGAGC AGGCATCGTG CGGGCGCCTC GGCTCCAATC GATTTATCGT CATCAGACGA AGAAGATGCC TTTTCGGCTT TGAATACGAA ACACAAGATC CCACCTGGAA TTTCGACAAG GAAGGCTGAC ACCTCGCAAT CGCAAGCCAC CGCTAAATTA CCCGCTTCTG TAACCTCTTC CATGAAGCGA CACCACGTCG AACTAAGCGA TTCACGAAAA CGAAAAATGG ACGCCTTGCT TTCCGAGTTG GAATACGAAA AGAACAACAT ATCCATCAAA CCGCATCGCT TTGTCCCAGA GAAAAAGGGA TCATTCGTCG AGCCTGGTGA AGAGCTTTTG ACGACGAATA TATTCGTAGG AAACTTATCG CCTACCTTGA CGGAAGAACA GGTTGCGGAG GTTTTTCGGC AATTCGGTGA GCCTGGCTTT GCGCTGGTAT TTGTTTGATA CTGATTATGT TGATGTGTGT ATGTGTACAT ATGCTATGGT CTATATCTCT CCTCGTGACT CCAGAAGATT GTTCGTTTCG CATATTTACG TGTGCACTCT TCTTTGCCGT TGTCCCGAGT ACTTGTGATT TTCTCACGTG TTTCTTCGAG ATCAGGTGCG CTATACTCGG TCAAGATTAT GTGGCCGCGA ACGCCGGAAG AAAAGATGCG GAACCGGCAC ACGGGGTTTG TGTGCTTTAT GAATCGCCGA GATGCCGAAG ACGCTATGGA CGCCTGCAGC GAAGCGGACC CGTTCAACGT GGGACGCCCG CTGATGATGA GATGGGGCAA GAACGTAAAA CGAACTGGTC AGCGGCCTCC GTTGGAATCT GATCTAGCTT ACAGGAAGAA GGTGCCCAAT ATTGCTGATA CACCCGCGAG GCAAGTCAAT AATGATAACC ACATCGATCG TGATACTATT GTTCATGAAT CGAACATAAT TCGTGTGATT GCACCTTCCG ATCGGCAACG AGCACAATTT ATATCCACCG TGGCTTCGTT TGTGTCTAAA GACGGGCTCG CTTTTGAAAA GAATCTTATT GATCGAGAGA GAAACAATGT ACAGTTCAAT TTCTTGAGGT GGCAAAGTAA CGGAGATACG ATCGAAAAGG ATGAACACAT ATTTTACCGC TGGCGAGTGT ATTCCTTTTG TCAAGGAGAT GGCTTCTACA GCTGGAAGAC GATTCCGTTT CGTGTGTATG AACCAGGCGG TTGTCACTGG ATTCCTCCTG TGATTGACCC CGATGCTGCA CGGTTCGAGA TGGAGCACGA AAGAGAAAAA GAAGAGGCCA TCGAACGCCA AAAAAATCAG CGTCGCGTTC AACACGGTCG ACGCGGCTTT TCTACTGGAC GCCAGCTTGA GCAAGCTCGC CGTGGTGGAT CCGATGGTGG CGCTGTCATG GCGCCTGAGG AAATGATTGA CTTCAACAGA TTGTGCCGCG ATAATCTTTG CGCATCTAGG GAGGCTATTT GCTCGGCCAT GGCCTTTTGC TTCGAGAAGA GTGTGGCTGC GAAGCAGATT TCTATACTGT TGAAAGACCT GTTGCTCGAC AAGGGAAATG CCGTTAGCGT TGAGACCAGA ATTGCCCGTA TGTACCTTAT GTCGGACATC TTGTTCAACT CACAACAACC AGGTGTACGG AATGCATTTT TGTATCGAGA TGCCGTCGAG CGCATGGCAT CAGAAGTCTT TACTTTCTTG GGCGACTACG GTAATACGAT CGGTCGGTTT TCGCGTACTA AACTTGCATC AGCCGTGAAG GCGGTGCTTG GGGCGTGGAC CAACTGGGGT GTGTACAATC CTACTTTTAT TGATGAGCTC GATGACCGAT TTGAAGGGAA AGAACTTGTC CCGGAAAGCG AATACGGAGC CAACCCTATT GCAAACGAGG ACGACGATAA GATCGAGGAG GTACAAATGG AGACAACGCC GGCTGTTAGG TTGAATCCTC AGGGCGACTG GGTAACAGTG ATGGAGGGAG AAAATGATGA GACGCAGGGA CAGTCGTCTC GGTTGACGAA GCAGGACAAG GGAGAGTGTA GCGACCACAC TGCATCCGAC GACAGCGATG CTGATGGTGT AACATTGGAG GAATGTGACA ACGTCGATGG CGAACCTGTT GGAGATATTC TGCCGCTCAA AAGTGATGCC GACAACAACG AAGATGGTGA ACCCCTTGGC GAAACCTTTG ACCATACTGA CGGTGCTCCT TTGGAGGGGA GCGACCTAGA TGGTAGTCCT TTAGAAGATG AAGTCTCGCA TGATGTTGCT CTCGAGAGCG AACTAGATGG CGAGCCCTTG TAACGAATAT AGGGTTATTG AAGTGCATTT TGAGGGACTG CATGTAGACG AGTGAGAAAG CTATTGA
|
Protein sequence | MSRHRAGASA PIDLSSSDEE DAFSALNTKH KIPPGISTRK ADTSQSQATA KLPASVTSSM KRHHVELSDS RKRKMDALLS ELEYEKNNIS IKPHRFVPEK KGSFVEPGEE LLTTNIFVGN LSPTLTEEQV AEVFRQFGAL YSVKIMWPRT PEEKMRNRHT GFVCFMNRRD AEDAMDACSE ADPFNVGRPL MMRWGKNVKR TGQRPPLESD LAYRKKVPNI ADTPARQVNN DNHIDRDTIV HESNIIRVIA PSDRQRAQFI STVASFVSKD GLAFEKNLID RERNNVQFNF LRWQSNGDTI EKDEHIFYRW RVYSFCQGDG FYSWKTIPFR VYEPGGCHWI PPVIDPDAAR FEMEHEREKE EAIERQKNQR RVQHGRRGFS TGRQLEQARR GGSDGGAVMA PEEMIDFNRL CRDNLCASRE AICSAMAFCF EKSVAAKQIS ILLKDLLLDK GNAVSVETRI ARMYLMSDIL FNSQQPGVRN AFLYRDAVER MASEVFTFLG DYGNTIGRFS RTKLASAVKA VLGAWTNWGV YNPTFIDELD DRFEGKELVP ESEYGANPIA NEDDDKIEEV QMETTPAVRL NPQGDWVTVM EGENDETQGQ SSRLTKQDKG ECSDHTASDD SDADGVTLEE CDNVDGEPVG DILPLKSDAD NNEDGEPLGE TFDHTDGAPL EGSDLDGSPL EDEVSHDVAL ESELDGEPL
|
| |