Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44165 |
Symbol | |
ID | 7203906 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1166545 |
End bp | 1171002 |
Gene Length | 4458 bp |
Protein Length | 1356 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186196 |
Protein GI | 219113225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG ATCCTTCGAC GAGTCTCAGC CCTCCGGCGA CGCCATCTTT GGTGGAAGAC GTGACGCCAG TCAAACTTAG GAACGGAGAA AATGAGACGC CCCCGCGTCT CATGATTAGC AAAATGGTAC GTTGCGTTGT CGCGACGCCG TGTCGGACCC GTCGTCAGTC CACGATGTGT CGCGGTACGG TACCCCGCAC CAGCACCTCT CTGTCCCTCT TGTCGGCACG GGGCATTGCA ACACGCATGC ATGCATGACT ATGCATATAC TAACTTGAAT AGACGCTTAC TCACTAAATC AATCACTTTC TCGATCAATC AAATGACTCA ACTCCCTCCA TCTCGCTGTT CCCTTTGGTA TGTCAGGAAC TGGAGAATTT CAAGTCCTAC GCCGGCGTCA AAACGATTGG ACCCTTTCAC AAATGCTTTT CCGCCGTGGT GGGGCCGAAC GGTTCCGGCA AGTCGAACGT CATTGACGCC ATGTTGTTCG TCTTCGGCAA GCGCGCCAAG AAACTCCGCC TCAACAAGGT CTCGGAGCTT ATCCACAAGT CGCAGGATCA CAGTGACTGT GTCTCGGCGC GGGTCTCGGT ATATTTCCAG GAAATTATCG ATACCGGCCC CGGTGATACC GATTACGTCG TCGTCCCGAA GACGGACTGC GTCGTTACAC GCGTCGCCCG TCAGGACAAT TCATCCACCT ACAAGATTCA GGGAAAGTCC TGTCAATTCA AGGACGTCGC GGCCTACCTA GACAGTAAAG GTATTGATCT AGACAATAAC CGCTTTCTCA TTCTTCAAGG AGAAGTCGAA ATGATCTCCA TGATGCCACC CAAGGGAAAG ACTGATCAGG ATGAAGGTTT ACTCGAATAC CTGGAAGACA TCATTGGAAG CAACAAGTAT CTCGAACAAA CCAACGAAGC GGCACTCCAG GTGGAAGCCT TGTCGGAACT CCGTCAGGAA AAACTCAACC GTGTCAAAGC CGTAGAAAAG GAAAAGGACA ATCTCCAGGC GGCCAAACTC GAAGCCGAAG CCTTGCTCGG CAAAGATCGC GAGATCCGTC GCAAGCAGAA CGTTCTCTAC CAAATCCACG CCGCTCACGC CTCACGGGAT GCGCAGCACG CCACCTTGCA ACAGACCGCG GCCGCCACCA AGCTCGACGC CGCCCGCCAA AAACTCCAGG CCGCCAACGA CCGCGTCCAC GAAATTGAAA ACGGACACGC CGCGCAGAAA CTCGCCTACG AGAAAATTCA CGCGGAACTC GTACAGACCA AGGAAGAGTT CGCCGCCTAC GAGCGTCGGG ATATTAAGCT ACGCGAAGAA ATCAAGCACG AGAAAGCGCA ACGCAAAAAG TTGGTGGCAA AAATGGCCAG TGAGGCACAA AAACACGAGC AAGCCGTTCA AAAGGGGCAG GATGCAACCG AGGCCATTCC AACGTTGGAA CAAGAGATTG TCACTCTGAC GGATGACAAA GCGACGGAAG ATGCCAAATT GGAGGACATT TACGAAGCCA TGAAAGGCGT AACACAGCAG CTTCGTGGCG AGCTCGAAAC TAAAACGCAG GAACTCGCGC CCGTCCACCA GGAACGCGCC GTCTTTCAAG CCAGGCTCGA CACCGCTCTG ACGCAGGTTC AATTGTTGGA AGGCTCTACC ACGCGAGCGA AGGAAAAACT GCTTCAGGCC GAAACCGAAC TCGCTTCGAT TAATCAGACG CAGCAGTCGA AACGGGAGGA ACTCATTGCG GCACAAGACG AACTCCAGCA GGCGCAGGAA CGTATCACGC AGGCGGAAGG CGAAGAAACG GTCCTCGCTA CAAAAGAAGT CCAAATCTCC CAACGTAACA AGGATTTGTT GGTACGTGAC ATGGTTCGGT GGTGATCATC GTTGTGTACT CTATGTATGT TGTGACGCTG ACCCTGTTTC GTTGACTTTT TGATACCAGG CCCGCGCCGA AGAAGCAAAG GCTGCATTGC AGTCCAAGGG CGGGGGTCGA TCGAGTGCGG TCAAAGGAGT TCTGCAAGCG GCACGCAAGG GAGGTGAGCT CGGGAATGTT GGTGTCCTCG GTCGTCTCGG AGATCTGGCC ACGATTCCGG AAGACTATGA CGTGGCCGTT TCCACCGCTT GTGGGATGCT TGATCACATT GTGGTACAAA CGACAGCGGG TGCGCAGCGC TGCCTCGAGT TCTTACGCAA ACACGGACTG GGCCGCGCAA ACTTTATTCC TCTCGATAAG ATGAAAAAAG GAGCGCACGA CCGAGTCGTC GAGACTCCGG AAGGCGCCCG TCGCTTATTC GAACTCATCC AGCCTTCCAA TTTTGCTATT CTTCCCGCAA TCTTTCTCGG CGTTGGTGAT ACTTTAGTAG CTCCGGATCT CGAAACCGCC ACACGCTGGG CCTACGAATT CGGTAAACGT TGGCGCGTAG TCACACTGGA TGGCAAGTTG ATTGAAACGG CGGGAACCAT GTCGGGAGGT GGCAAGAGCC TTCGCCGTGG TGGTATGCGA TTGGCCAACG CTCGTTCCAA GAGTACCGCA GATAGTACTG CGGATGAAGA AGAATCCATG GATTGCCAAA AGTTGCAAGA TGAAGCCACA AAGGCCCAGG AGCTGTTGCA GCAGGTTCGC TTGCGCCGTA AGGAACTTAC AGACGAGGTC CGGGGACTGA AAAAGCGTGT TAAAGCTCTA GAGGTTGTCC TCCCCAAACT TGCCATGGAA ATTGAGGGAT GCGATACGAC GAGGAAAAAC TTGACAGAGT CGATCCCTGG TCTCCGCGCG CAGTCTGAAC TGAGCCAAAA GGACGCCGCC AAACTCGTCG ACCTTACCCG TGAAGTGGAA AAGTGCAAAA CAGACATGGC ATCGTGTTCC ATGCTGGCCT CCAAACTCGA AACAGAGGTA GCTCGTCTCC AAAAGGCTAT TCTAGATGCT GGTGGAACAA AGCTCAAGAA GCAACAAGCT GCGTGCGAAA AGGTTTTGTC TGTCCTCAAT GATGCGGAGA AGGCGCTCAA CTCAGCCAAG GTCGCTATCA CAACGTCTGA GAAAGCTGCG ACAAAAGCCG AAAAGAACAA GGCTGCAGCC GAAGAGCAGC TCGAAAAGTG CAAGGTCTTG TTGGGAGAAA AGGCTGCCGA ATTCAAAGCG CTCGAAGAGG ATGCTTTCCA CGTCATGCAA GCATTCGAAA AGGTCAAAGA AGTGGAGGCC GAAAAGCGTG AAGCACTAGA AGCCGTCTCG AAGGAATCGG AAGAGCTGCG AAAGTCACAG TCCGAAGTCA AATTTGTGGA AGTTGATCTC GTTGGTCAAG TTGACGCCTT TGCAAAGCAA ATTTCCGACG CCGAGAAAAA GATTCAACAC TGGTCCAACG AGATCGAAAA GCTACGCGCC GTTGCCAATG ACGACGACGA CTTTGACATG TCGGACGACG AAGAAGAGGA AGTGTCGACG AAGCTAAAGC ACGACATTGT CGACGAAGCA GAGGATGTTG ATATGGAAGA CGACAGCAAT GTAGCAAACG CCGATACGGA ACGACAACCG CTCGAAAAAA TACCAAAGAG TTCGTTGCCC ACCCTCTCCG AAGCCGCATT GCGACAATAC AACAAAGACG AAATCAAAGA AGAAATCACG GTATTGGAAA CAGAGCGAAA TGCGATTGCG AAGAATGCCA ATATGGGGGC CATTGCCGAA TACCGTAAGA AGGAGGCCGA TTACCTTGCC CGAGTCACCG AGCTGGATGG CGTATCTGAG GAGCGCAACG CCGTACGCAA GACTCATGAA GAACTTCGTC GGCTGCGTTT AGAAATGTTT ATGGATGGCT TCGGACAAAT TACGTTGAAA CTGAAGGAAA TGTATCAAAT GATTACCCTT GGAGGCGATG CAGAGTTGGA GCTTGTCGAT TCGCTCGATC CGTTTTCCGA AGGTATTGTC TTTTCGGTAC GGCCACCAAA AAAGTCTTGG AAAAACATCA GCAACTTATC CGGTGGCGAA AAAACACTGT CATCGCTGGC TTTGGTCTTT GCCCTGCATC ATTACAAGCC AACTCCCTTA TACGTAATGG ACGAAATCGA TGCCGCCCTT GACTTTAAGA ACGTTTCAAT CGTGGCGAAC TACATCAAAG AGCGGACCAA GAATGCACAA TTTATTATCA TCTCCCTGCG CAACAACATG TTCGAGCTTG CGGATCGACT AGTAGGAATC TACAAAACAA ACAATGCCAC CAAATCTGTC ACCATTAATC CACGAGCTTT TGGTGTCGAT GCCCAGCAGA ATGGACAAAT TCCGACGACT CCGGCGTTGT CGGAACGAAC AAATGCGGGC GGAGCCGACC GATCCGCTTC AGCCGTAAAG GACACATCGG TACGGCGACG AGTTCCGTTC GAGGGGAGTT CGGAGGATGT CAAAATTTCG GAAGCCTAAA TATCTGGGGA TCAGTACTAT ATCTATTGCT GCCTCAATAT AGTATTTGTA TGTAGCTG
|
Protein sequence | MDDDPSTSLS PPATPSLVED VTPVKLRNGE NETPPRLMIS KMELENFKSY AGVKTIGPFH KCFSAVVGPN GSGKSNVIDA MLFVFGKRAK KLRLNKVSEL IHKSQDHSDC VSARVSVYFQ EIIDTGPGDT DYVVVPKTDC VVTRVARQDN SSTYKIQGKS CQFKDVAAYL DSKGIDLDNN RFLILQGEVE MISMMPPKGK TDQDEGLLEY LEDIIGSNKY LEQTNEAALQ VEALSELRQE KLNRVKAVEK EKDNLQAAKL EAEALLGKDR EIRRKQNVLY QIHAAHASRD AQHATLQQTA AATKLDAARQ KLQAANDRVH EIENGHAAQK LAYEKIHAEL VQTKEEFAAY ERRDIKLREE IKHEKAQRKK LVAKMASEAQ KHEQAVQKGQ DATEAIPTLE QEIVTLTDDK ATEDAKLEDI YEAMKGVTQQ LRGELETKTQ ELAPVHQERA VFQARLDTAL TQVQLLEGST TRAKEKLLQA ETELASINQT QQSKREELIA AQDELQQAQE RITQAEGEET VLATKEVQIS QRNKDLLARA EEAKAALQSK GGGRSSAVKG VLQAARKGGE LGNVGVLGRL GDLATIPEDY DVAVSTACGM LDHIVVQTTA GAQRCLEFLR KHGLGRANFI PLDKMKKGAH DRVVETPEGA RRLFELIQPS NFAILPAIFL GVGDTLVAPD LETATRWAYE FGKRWRVVTL DGKLIETAGT MSGGGKSLRR GGMRLANARS KSTADSTADE EESMDCQKLQ DEATKAQELL QQVRLRRKEL TDEVRGLKKR VKALEVVLPK LAMEIEGCDT TRKNLTESIP GLRAQSELSQ KDAAKLVDLT REVEKCKTDM ASCSMLASKL ETEVARLQKA ILDAGGTKLK KQQAACEKVL SVLNDAEKAL NSAKVAITTS EKAATKAEKN KAAAEEQLEK CKVLLGEKAA EFKALEEDAF HVMQAFEKVK EVEAEKREAL EAVSKESEEL RKSQSEVKFV EVDLVGQVDA FAKQISDAEK KIQHWSNEIE KLRAVANDDD DFDMSDDEEE EVSTKLKHDI VDEAEDVDME DDSNVANADT ERQPLEKIPK SSLPTLSEAA LRQYNKDEIK EEITVLETER NAIAKNANMG AIAEYRKKEA DYLARVTELD GVSEERNAVR KTHEELRRLR LEMFMDGFGQ ITLKLKEMYQ MITLGGDAEL ELVDSLDPFS EGIVFSVRPP KKSWKNISNL SGGEKTLSSL ALVFALHHYK PTPLYVMDEI DAALDFKNVS IVANYIKERT KNAQFIIISL RNNMFELADR LVGIYKTNNA TKSVTINPRA FGVDAQQNGQ IPTTPALSER TNAGGADRSA SAVKDTSVRR RVPFEGSSED VKISEA
|
| |