Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39734 |
Symbol | |
ID | 7195449 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 571369 |
End bp | 574427 |
Gene Length | 3059 bp |
Protein Length | 829 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183635 |
Protein GI | 219126796 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACC AACGGTGGCG CGATACGGAC GGTGACGATT CCCGGAGTCG CAAGAGTCAA CGCAAACGAC AATCCCGGGG CACCAATTCG TACCAACGTC CCTGGAACGT GGAAGACGAC GACGACAACA ACAATAATAA TGGAAGTAAT CGTCGACGCA GTACCCAGTC GACGTCGTCG TCGAGATCTC AAAGTCGTCG GAGTTCAACG ACGCCATCCC ACACCCAAAC GACCAAGAGT CGATCCTCCC GTTCCGCCTC GTACACATCT CCCGCAATCA CCACCACGAC GCGAGTCCGC AATCGACGAT CCTCGCACGG TCCCGTCGGA CCCTCGACGA CTCCCGTATC GGTGGACATT GACGTTTCGG ACGATGACGA CGATCACGAC CATTACCGCA ACGTGGACTC GGCGCGGGGC TCCCATGGGG CGGCGTACCC CGCACGGTCG ACACCGTCGC CCAGTGCCGA ATCTACCACT TCGGCGGCCT CGTTCCGGAC TTCGGCCGCG CTGGAAAATC GGACCGCCCG ACACTATCGG CGACGCACCC CGTCGCCGCA GGCGGTCGTG TCGGGCGGTC CGGAAGCGAT GGGGTCCACC CAACGCGTGT CGCTCACTTC CAGACAGTAC GAAGAACGAC GTCGACAAGA ACAAATACAA GCCGCCCAAC TAGCCGCGCA AGAACAATTG CAAGCCGCCG CCCTGGCGCA ACAGCAGCAA CTACAACAAC AACGGCAACT ACAACAACAA CGGCAACTAC AAGAACAACG GCAACTACAG GAACAACGGC AACTACAAGA ACAACGGCAA CTACAGGAAC AGCGGCGACT ACAGGAACAA CGGCATCAAC AAGAAGAACG GCAACGGCTA CTGAAGCAAC AGCAAATGCA ACAGCAGTTG CAAAAAATAC AACAGCAGCA AGAAATTGTG CGACAACGAC AAAAAGAAGC TGCCGCGGCC GCTGCGCAAC TCCAATCGTC ATCCCGTGAA CCCACCGCAT TCGTACTCGG ACCCGGCGGA ACTTTACCGC CGCCCCCGCC TCCACCACCA CGTACCAGCC CCCCGGGATC GATCTCGCGG GAACTAAGGA ACCGCCGATT GGCATCCCAA CGACCATCGT CTCCTGTCGC GATCCGTCCC GCCGTACCGC TTCCCGTACA GGCTGCTGCG TCCATACCAC CCCGTCCGCA CAACCCACCC CAGGTACCCG CAGTCGAATC GGCTCCAAAA ACACCCTCCC CGCAAGCGGT TGCGGTCGAC TCACGTCGCG GTGGCGCTGC CTTCTTCAAA CGCACCGCCC CCGTCCCCAT CGACGCCGAA GATTCCGCCC GACTCCTACA GCTCGAACGC GAGGCCCGCG AAAATGAGCT CGCCAAATTG CGCCAAGCCC GTTGGGTGCG GGATAACGAA CGGGAACAAA CTCGTTCCGC CGCAGCACGG ATCGAGGATC CACCCCTCAC ACAATCATCC CACCCACCTA CCAGTCGCAT CGTACCGTCG GTGATTCCAC TCGACCCCAA TCCTACCGTT GTTGCGGAAT CGTCCCGTCG TGGGCACACC GATGACGTAC AACCATCGAC CATTCCCGCG GCGGTAATCG CGTCCCCACG GAAGGCGGTG GTTCGTCCGG ACCCCGAACC CCGGGAGAAA GTCGATCGTC CCGTCCCTCG ACGGTCCACT TCGCGTCCCG TAACGCCACC ACCGGGTCCA ACGTCGGACG ATGTCTCCGA CGTCGGTAGT GCCGCGAGCG ACGTGGGGCG GTATTTGGCC CAGCGCAAGT CATCATCATC CCCCAATCCT GCCTTTTTGG CATCCCGCGC TGCCTTTGCG GGGAAGACTT TAACGACGAA CGGGACTCGG CCAATGAGAA ATGCGTTGCC GCTGGCGGCC GATAACGAAG CCTCTGACGG CAACGAGTCG GACGATTCGG AAGAAACCAC CACGGAAGAG GAATCCAGTA GCGACGAAGA CCCGGACGTG CCGGAATGGA GTGTACGAGT GTGTGTCGTA TCGGCAATAG ACTTGCCCGC CAACGTTGTA CCCAATCTAC CCTTTAGTCC CGTTTTTGAT ATTGGTTTGG TTCGATTTCC AGGCCCCGAC GTGGACTCCC GGATCCAACT ACCTGGTGAC GACGCCAGTC GACAAAACCT GAGTGCCTTG CTGACTAAAG GAGGCTTGCA CTCTGTACCA CGGTCGCGTG TCCAGTGTAC CAATTTGAAA ACTCTCAGTC AACGAGACAA CGGATCGGTC GAATTTCACG AAGAACTGCG GTGGGACCGT GTCAAGTACC CGGAAGAGCT GGCGCTGGCA ATCCAGCTCT CCGCTAAAGC GGTGATTGCT CCGGCAAACA TCAAGGAAAA CCCACCAATG CAAAAAGTCC AACCACTCCA GTTTGGTAAC CCGTCGGCAC AAAACGGAAC GGGGTGGAAC TCCGCGGGTA AGACCAGCTC CGGTAGTAGC AGCGAGGCAG GGGGGTCTCT CTTTCGGCGC ACCCAAAAAA GCTCGGCCGA AATGGAAACC GCCAATGCGG CTGCAGCCGT GGCCCAGCTT TTGGTGACGG GTGAAAAATC GAAAGCGGAC GACGAAGGGC GTCTGTTCGA CTCGAAGGCG GCAAATTCAA AAGCCGTGTC GGGCAATCGG AGTGAACTGA ACGTAAAGCT CAAGCATCGC AAGCGACGTC GAAAGCCAAA ATTAACCGAC GACTTGCGCC TCGGTTCCCA GATAGTCCCG CTCACCGCGT TACCACTGCG GAAAGCCTAC GGCCAGGAGC ACGAGGCAAG AATCGAACAG TGGTTTGAGC TCGATGGTGC GAGCGACGTA ACACCTACGA CACCGTCGCC AGCCAGCTCT CCCGCGAGAG GCAATCGGCG AAATCCTAGC ATATTATTGG AAATCACGTT TTCTGCACCG GAAGTACTGG ACGACAGTGA GGACGAACTG GATGATAATA TAAACTCCCC TCGGACCAGT CTGAATGCAT CATTTTCCAA AAGAGAGTCT ATTAAAATTC GGAACCAACT TAAACAACAG GTTGTTGTCA AGGCGAAGGA GAAGGTTGA
|
Protein sequence | MNYQRWRDTD GDDSRSRKSQ RKRQSRGTNS YQRPWNVEDD DDNNNNNGSN RRRSTQSTSS SRSQSRRSST TPSHTQTTKS RSSRSASYTS PAITTTTRVR NRRSSHGPVG PSTTPVSVDI DVSDDDDDHD HYRNVDSARG SHGAAYPARS TPSPSAESTT SAASFRTSAA LENRTARHYR RRTPSPQAVV SGGPEAMGPP GSISRELRNR RLASQRPSSP VAIRPAVPLP VQAAASIPPR PHNPPQVPAV ESAPKTPSPQ AVAVDSRRGG AAFFKRTAPV PIDAEDSARL LQLEREAREN ELAKLRQARW VRDNEREQTR SAAARIEDPP LTQSSHPPTS RIVPSVIPLD PNPTVVAESS RRGHTDDVQP STIPAAVIAS PRKAVVRPDP EPREKVDRPV PRRSTSRPVT PPPGPTSDDV SDVGSAASDV GRYLAQRKSS SSPNPAFLAS RAAFAGKTLT TNGTRPMRNA LPLAADNEAS DGNESDDSEE TTTEEESSSD EDPDVPEWSV RVCVVSAIDL PANVVPNLPF SPVFDIGLVR FPGPDVDSRI QLPGDDASRQ NLSALLTKGG LHSVPRSRVQ CTNLKTLSQR DNGSVEFHEE LRWDRVKYPE ELALAIQLSA KAVIAPANIK ENPPMQKVQP LQFGNPSAQN GTGWNSAGKT SSGSSSEAGG SLFRRTQKSS AEMETANAAA AVAQLLVTGE KSKADDEGRL FDSKAANSKA VSGNRSELNV KLKHRKRRRK PKLTDDLRLG SQIVPLTALP LRKAYGQEHE ARIEQWFELD GASDVTPTTP SPASSPARGN RRNPSILLEI TFSAPEVLDD SCCQGEGEG
|
| |