Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18549 |
Symbol | |
ID | 7204174 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 504413 |
End bp | 509119 |
Gene Length | 4707 bp |
Protein Length | 1250 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186361 |
Protein GI | 219113555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACTGGCGCC TTACGGATCG AACCAGGACT GTGGGAATCG GTCTCGTCTT GGCCCTCAAT TTAGGTATGG ATCCTCCAGA TGTTGTGGAT AAACCTCATC CCTGTGCAGT GCTTCATACT TGGATCGACC CCAACGCCGA ACGTGCGGCC ACGGCACGCG AAAAAATCGG AGAACGCCTT GAGCAACAGT ATTCCAAGTG GCAGACAGCC AGCACGGCCT GGCCGTTGCG CAAACGACGG ACGGTAGATC CGACCATGGA AGATGTACGG AGCCTTTGCC AGCATCTCCG TCGACACGCC CGGAACGAGA GAGTCTTGTT TCATTACCAC GGACATGGTG TGCCGAGACC AACCCACAAC GGAGAAATAT GGGCGTTCGA CGCCAAGCGT ACAGAATACA TTGCCTTATC GGTTGCAGAT GTTCGCACGT GGCTGGGAAA ACCCAGTATT GTAAGTGTGT ACTTTTGGCA GATGTTCGGT AGCTGTTGAG TATCTACTGA CGATTTTTAC TTCATTCGTG ACTATTACCG GCCAAGGTTG TTTTGGACTG TTCCTCCGCT GGCGTGCTAG TGCCTTTCTT TACGGCGCCA ATGGCCGATC CTTCGGTTGC ACCCTCCTCG ACACCATCGC GAGCTCAGAC CGAACCGACA AATCTCAGGG AACAAGCATC GGAGTGGGTT AGCGACACAG TCGTACTATG TCCTTGCTCT GAAGGCGAAT GGCTACCCAT GAACCCAGAC TATCCCGTCG ACATTTTTAC GTCGTGTTTA ACCACTCCAC TCAAGATTGC TCTTCGGTGG TTCATTCAAA GGAACCCCGT CAGCATGGCT TCCCTTCACC CCGATGCAGT AGATCACATT CCTGGTGTTT TCAACGATCG AAAAACTCCA CTCGGAGAAT TGAATTGGAT ATTTGCCGCC GTAACGGACT CAATTGCTTG GAACGTACTG CCCAAACCGC TATTTCAGCG ACTTTTTCGT CACGACTTGC TGATGTCCAG TATGTTTCGT AACTTTTTAT TGGCTGACCG AGTTCTGCGA TCGATGGGTT GCACCCCGGT ATCCTATCCT CCGTTGCCAC CGGGCATTTC GAATCATCAA CTGTGGCAAT CTTGGGACTT GGCCTGTGAA ACAATGTTGG TGCAGCTCAT GAATGACGGA GTGCTCGGAA ACCATCTGAT AGAAAGCAAA AGAAATCATG ATGCAGAAGG CAGGGATGAT ATAGACGACG ATAAAAACGT GGATACGGCT CCGGCTCCTA CAGAAAAGAC TTCCAGGCAC CCCACCAATC ACACAGTGAG TGTCTCATCA CCGTTTTTCT CAGAGCAATT GACAGCTTTT GAAGTATGGC TAGAATTTAC GTCAATCCAT AAGATGCAGT TGAAACTCGG AATGCTGGAG CCACCCGAAC CGCTGCCAAT GGTACTACAG GTCCTACTTA GTCAGACACA CCGTATTCGT GCGTTGCACC TCCTTCGGCG GTTTTTAGAA TTTGGCCCAT GGGCAGTTAA CCTGTCACTC ATTCTGGGAA TTTTTCCCTA CGTGTCGAAA TTGCTGCAAT CTCCTGACTA CAAGAGCTTA TTGGTTGGTA TTTGGGCATC TATCTTGGCT TTCGACCCTT CTTGTCGCGT TGACATGCTG AAAAACGGAA TTCTTCACCA CTTTGTCCAG TATTTGACAT TGGACCCGGA TGCATATGGT TCAGTTAAAG AAGCCAAAGC CGCTCGGGAG CGAACCTTGG CTGCGTTCGA CTTGTCCATT ACTTGCTGTG GTTACCCTAT GGGGCAGTCC GAATGTGTTC GACTTCGCCT CCATAATCAT TTCTGCGCGC TCTTTTCTGC TTACGAAAAA ACAGAAAAAT GTCAGTATGA CGATATGGAA CTTTATTTGC CAGCGCGCTT TCGACTTTGG TTATGTATTT GCATCGCGAA TATGATGAAA GACAACAATG CAACGCAGGC TGAAGCGTAC AACGCTGGTG TACATTCACG CCTATTTGTG CGAATGAGTG ACCAAGATCC GGACGTGCGA GCCGCAGTTT GCTATGCTCT TGGTTGCCTC CTCGGTACTT CCGCGAAAAA TGAACAAACA GGGAAAAGTC ACTCACTGCC GATTCAGAAC AACCCAGGTC AATACAGAGT ACCGTTCCAG TCTACCCCCG CACTAAACCC TCAGCAACTT CAGCAAAGTG TTCCCGCGAC TACGGTACCG GGAGCCCTAA AGCCAACCTT CATTTCCCAA GGTGGTACAT CGAGCCTTCA GTGGCGGCCC CGACAACTTC ACTCTGTCCA AGGTGTTCCA TTTCCTGGGC AACCGATGGC ACCTCTTCAT GGTCAGTACT TTACAGCGCA GCCGGGAGCT GGTCTGAACC AACAGCCTAT GCAAATGCAA TCACAAAACG TACAAGAAAT GCAACCCCAG TACATGGTGC AGGGCCAGAC GCCTAATCCT TCTCCCAATG TCAATAGACC CAGTGGCTTT CTGACCGGTG GTGGCTTGAT GAACCCGCAG GCGAAGCAAC CTCTTCTTGA CGCCGACACT TTTCAGAACA GACCGATACT CGAGCAGCGG CAACTCAGCC CGTCAGTTTA TGAAGATCAT CAACGACTCG ATCTAGATTT GTTAACGATT GAGGTCTTAT TGAAAGCAGT TGAAGACGCG AGTGTTGTAG TACGGTACGA AGCTACACTC GGTCTAGCAC GAGCAGTTGG AAAATATCTT GATGCCTTTG TATCCGTTGC CTCCACCGCA GCAAGCAATG AAGATAGAAA TATGTATGTT TCTTTGCCTC CTGCCGTTGA TCTCAAAACA GGCGATCGTT TTGCACTGGT TTGGAGCATC TTGCGGTTAT TGCAGCACAA TGACCCCTTT CCTTTCATTG CTAAGGCGGC AAACGATATC GTGTGCGTAG TCTTTGAGCA TTTACTTCGC CGCCGATTGA CTACCAGCGA TCGGGATGAC AAGGGGCGCC AGAGTCTAGC GCTGCGCGCT GAGAAACTGG CATCAAATTT GACAGGGATC GAAGAGGAGC AGGACTCCGA TGGCATTGCG GGCCATTCTA TCGATTCAAC TCCACCACTT AGCGAAGCCC CCACTCAGCA ACCCTCCGCC GTCATGCAAA AGTCACCCCA AGCCGATCTA CGTCGCGTTG CTTCAGAATC TATAACAGGT AGGAACACAG CTCACTCGAT GGAGCAGAAT GCCCTTGATA TGAATTTATC TGGAGTCTTA AAGGAAGATA TGGTTAAAGG CGGTCCCGTA CATTACATAC TTCCGAAGTC TCAATTTTTC GAGTGGAAAA GGGACTCTTT CGACATAAAC TTTGAGTTTG TAGATGATGA TACTCCAGCT GATACAGACC CTCTCAGTCC TGAAGGCGCA GCAAAGATCT ACTTGGAGCG ACGGAACTTC TCGGTTCGCG AAAACGCCAG AAAACTGTCG GATCGATATG CTTTGCTAGC ACCGAAACTT TCCGGCCCGA AAGGAAAAAG TATAGAAGAA ATGCTTTACG AACAAGAAAG CGAGGATGCT TTGCTAGCAG TAGAAGAAGA AGCCACTTTG AAAAAGGGCG AGCTGGAGCT ACAAGAAAAC ATGCTTTTAC GCAACGAAGG AGTCAAGATG ACGTCTATGG TTCGGTTTCA TGCTTATGAA GATGTACTGA TGGCTTGCGG CGCGTCAGAT GCCGTTTCAA TGTGGTGCAC AAACTCCGGC AAACAGCTTA CGAAATTTCT CAATAGCAAA ACAAAAAAGG CTCGATTGAC TACATCGGCT TGGATCAATG AACACGCATC CAGTCTTTTC ATGGTTGGCG GCGACACTGG CAACGTTCAC GTGTGGGGCA ACCTCCTTGA GAGCAACGGA GAGGCATGCA GAACACCTGC CAAGTTGATC TCAGCCTTTC AAGCCGCTCC CATGACAGCG GGACAACGCG GTAGTGGACT CATTTGCGAA TGGCAATCGT ACAGTGGGAC ATTGTTGGCT GGTGGCAGCA GCAAGACTCT GCGCTGCTGG GATCTAGAAT CGGAAAAACT TTCTAATCAA TTTGAGACCA ACACAGACGC AAACCTGACA ACTTTGACGA CTGCTTGGGA TTACGATGAA CTTGGAATGG GGCCCGGGCC TAAAGGATAT CAAGGCATTG GACAGGATAT CGTTGTCGGC GGTTTTAGCG ATGGAGCTTT GAGGATATTT GATATTCGCA CCAACCAGGC AGGCAGTAGC TTGCAGTCTA GCCAACCTAC ACGACCTCGC CGAAAGAGAG CTACTGAGTT TTCTGAACAC AAGACTTGGG TCGTTTCGAC TTCGTTCACT GCGTATGGAA ACCGTTACGA GCTCATTTCG GGAACAATAT CCGGGGAAAT AAAGATTTGG GACTTACGTA TGTCTTCAAG TATTCGAACA TTTGCTGCAC AGAGAAGCAC CATGACGAGC TTTGCGGTCC ACTCGAAGAT TCCAATACTG GCCACTGGAT CGCACGCTCA GTTCATTAAG TTATTGACTC TGGACGGCGA CGCATTCCAG GTCATGCGGT ACCACGGCAA AATGGCGAGT CACCGGATCG GACCGGTGAG CTGCCTTGCA TTCCATCGTT TCAAACCCAT GCTTGCGGCT GGAGCGACGG ACGCTTACAT TGGACTCTAC ACTACCAAGA AAAGATTCAT TTAAACAGTT ACATAAATTG TGTAAATTGT AGAACAATAC TGAATTG
|
Protein sequence | MDPPDVVDKP HPCAVLHTWI DPNAERAATA REKIGERLEQ QYSKWQTAST AWPLRKRRTV DPTMEDVRSL CQHLRRHARN ERVLFHYHGH GVPRPTHNGE IWAFDAKRTE YIALSVADVR TWLGKPSIVV LDCSSAGVLV PFFTAPMADP SVAPSSTPSR AQTEPTNLRE QASEWVSDTV VLCPCSEGEW LPMNPDYPVD IFTSCLTTPL KIALRWFIQR NPVSMASLHP DAVDHIPGVF NDRKTPLGEL NWIFAAVTDS IAWNVLPKPL FQRLFRHDLL MSSMFRNFLL ADRVLRSMGC TPVSYPPLPP GISNHQLWQS WDLACETMLV QLMNDGTSRH PTNHTVSVSS PFFSEQLTAF EVWLEFTSIH KMQLKLGMLE PPEPLPMVLQ VLLSQTHRIR ALHLLRRFLE FGPWAVNLSL ILGIFPYVSK LLQSPDYKSL LVGIWASILA FDPSCRVDML KNGILHHFVQ YLTLDPDAYG SVKEAKAARE RTLAAFDLSI TCCGYPMGQS ECVRLRLHNH FCALFSAYEK TEKCQYDDME LYLPARFRLW LCICIANMMK DNNATQAEAY NAGVHSRLFV RMSDQDPDVR AAVCYALGCL LGTSAKNEQT GKNLLTIEVL LKAVEDASVV VRYEATLGLA RAVGKYLDAF VSVASTAASN EDRNMYVSLP PAVDLKTGDR FALVWSILRL LQHNDPFPFI AKAANDIVCV VFEHLLRRRL TTSDRDDKGR QSLALRAEKL ASNLTGIEEE QDSDGIAGHS IDSTPPLSEA PTQQPSAVMQ KSPQADLRRV ASESITGRNT AHSMEQNALD MNLSGVLKED MVKGGPVHYI LPKSQFFEWK RDSFDINFEF VDDDTPADTD PLKMLYEQES EDALLAVEEE ATLKKGELEL QENMLLRNEG VKMTSMVRFH AYEDVLMACG ASDAVSMWCT NSGKQLTKFL NSKTKKARLT TSAWINEHAS SLFMVGGDTG NVHVWGNLLE SNGEACRTPA KLISAFQAAP MTAGQRGSGL ICEWQSYSGT LLAGGSSKTL RCWDLESEKL SNQFETNTDA NLTTLTTAWD YDELGMGPGP KGYQGIGQDI VVGGFSDGAL RIFDIRTNQA GSSLQSSQPT RPRRKRATEF SEHKTWVVST SFTAYGNRYE LISGTISGEI KIWDLRMSSS IRTFAAQRST MTSFAVHSKI PILATGSHAQ FIKLLTLDGD AFQVMRYHGK MASHRIGPVS CLAFHRFKPM LAAGATDAYI GLYTTKKRFI
|
| |