Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21548 |
Symbol | |
ID | 7202417 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 405522 |
End bp | 410298 |
Gene Length | 4777 bp |
Protein Length | 1360 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181721 |
Protein GI | 219122788 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.50837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCATTTGGAT TCTCGGGAAC ACATCCTCAC ATTCAGAATC CGTTTAGTAA GCATTAGATC CTGCCTTTCG CATGCAAAAC GAATGTCACG ACGGCAGTCT CACCGACCTT TATTGTTTGT TCTCCACGCA TGGCTAAAGG CAAGTTGTCG CAAAAGACTC CAATAGAGGC GTACTACGTC CATATCGCGT TTCATCATGC CGGCGGCCGA CGACTTGGAA GAAACCAATC ACTCGACGTC CGCGGGAGGC AAAGCGAAGA AACCCAAGGC GAAGCAAGCT TCTGCTGTCG AAACGCTCCA CTTCGCTTTC GACTGCGGGG TCCGTGTCAA GGTCCTCTTT TTTTTCGGCG TAATTGCAGG AATTGCTAAC GGACTCGTTT ACCCGATCCT GGCGTGGCTC TTTTCTTCCT CCTTCTCCGA CATTTCCGCC GCTTCTACCA ACGGTCTCAG TCAGATTCGC GAGCTCGCGT TTACGTTTCT GATCGTGGGA GTCTACGCGC TGGTCTGTGC AACCATTCAA TCCTTCTGTT TTGAACTGGT GGCGTACCAC GCGTCGCAGA ACTTTCGTCT CCAATGGTTC GGCGCGTTGC TGCGCCAGGA CGCGGCATTT TTCGACGTTT ACGACGTCGG CGGTATCGCT GCTCAAGTGG GACCCAACGC CAATAAGTAT CGACGCGGTA TGGGCCGTAA GTTTGGGGAG GGCGTTCAGT TTTTAACCAC CGGCATCGGT GGAATTGGTT TTGCCTTTTT CGTTTCGTGG CGAATCGCGT TCGTGGTGCT GTGCGTTATT CCCTTTGTGT CAGTCGCGGC ACTCATGGTG GTACAGCTAA ACCAGCAAAA AGGCGCACGG GCGAGCAAAA GTTACAAACG CGCTGGTAGT GTTGCGTATT CCAGCGTTTC CGCTATCAAA ACGGTGCTTT CCTTGAATGC TGTTCCGACA ATGCTCAAGC AATACTCCCA GGCAACACAA GAGGCTTTTG CCGATGCCGT CAGTATTCTT CTCAAACAGG GTCTCGCAAA CGGTACGTCC AGGAAATCGC GCTGCTATCA CATTTTTGCA GCCAGGGCCT GACTAACGGC TTGATCCTTG AATTAGGTTC CATGTTGGGC GCTTTCTTGA TGCTGTACGC CATCTTGGCC TTGTACGGTA GCGCGCTCCT GTACCGTGAT GTAGAGGATA CAGGATGCGA TCCGTCTGGC GGAGTGAACG ACAACGCGAC CTGCCCCAAT AGTGGAAGCG ACGTATTCGG AGCGATGCTG GGTGTCGCTT TTGCTGGTCA AGGTGTTTCC CAAGTCGGCA ACTTTTTCGA AGCCTTTGCG GCTGCCCGAA TTGCTGCTTT TGAAGCCTAT TCGGCTATTC GACGTACCGC CGGGGCCCCG GCCGAAACCA TTTACAAGGA AGACGATGTG GAAGATCTGA ACAGTACCGT GCACTCCCGT AAATCAAAGA AGAGCGAGCC CGATGTGGAA TCTGCCGAAA GACCGATCAA GGCGATTCTA CCGAAGTACG AAATCGATTC GACATCCGAT AAGGGAAAGA AACCGTCCGA CATTGCGGGT ACGCTTGCGT TCAATGACGT ACGCTTCAAC TACCCCACCC GTCCCACGGA AGCTATTCTG AAAGGTCTTT CTGTAGAAAT TGAAGCCGGC AAAATATCTG CTTTCTGTGG TCCCTCCGGC GGTGGCAAGA GCACAGTTAT GTCTTTGATC GAACGCTTTT ACGATCCTCT GTCCGGTAGT GTTTCTTTGG ATGGAGTGAA CTTGCGGGAT ATCAACGTGT CGCACCTTCG CAGCATGATT GGGTACGTCG GGCAGGAGCC TACCTTGTTT GCGACTAGTA TTCGTGGAAA CATTCGTTTC GGAAATCCCG ACGCGACCGA CGAGATGATT GAGAGCGCGG CTCGTATGGC GAATGCGCAC GATTTCATCA TGTCGTTTTC GGATGGCTAC GACACGCAGG TAGGAGACCG AGGGAGCCAA TTGTCCGGAG GTCAAAAGCA GCGTATCGCC ATTGGTAAGT TCTCCTGTTT CCACTCCGCA AAATGGCATT CAAAACTTTT TCTCACAGGG TTTTGTTCTT GTTGTATCTC GTAGCGCGTG TTCTGGTGCA CAATCCAAAA ATTCTATTGC TGGACGAAGC TACCAGTGCT TTGGATGCAG AGTCAGAACT TGTGGTACAG GATGCTTTGG ACAAGATCTT GGAACAAAAG AATATCACCA CCGTGATCAT TGCGCATCGG TTGTCGACCA TCCGCAACGC TGATGTGATC AATGTGGTTG TCGGTGGAGT TGTTGCCGAG AAAGGTACTC ACGACGAGTT GATGGCGGGA GACACGTACT ATCGTAAGCT AGTTGAGAAA CAGGAAGGCC AGGATAGAGC AGACACTGAC AGCTCCCCTG GTACGTCTCG GAATAGTAGC TCGGTGGATT TGGTTCAGCT CGCAGAAACA TCCAAGGAGA ACATGCGTGC TTCGATAGAT GCGAAGCACG AAACTCCGTT ATTGCAATTT CGAGATGTTC GTTTTGCGTA TCCGACGCGG CCGAAAAAGA AGGTTTTCGA CGATTTTAAC CTTACCATCA TGAAGGGTGA AACAGTAGCT TTGGTAGGAC CTAGCGGTGG TGGTAAGAGC ACGACGGTTG GCTTAATGGA ACGGTTCTAC GATCCGACCG AGGGCACGCT TGAGTATTTA GGGATGGACG TGAAGTCTTT AAATGTGCCT TGGTATCGCG ACCAGATTGG CTATGTGGGG CAGGAACCAA CTCTTTTCAA CGATACTATA TCTCGAAATA TTGCGTACGG TGCGCCGGGT GCGTCGCAGT TCGAGATTGA GGAGGCCTGC AAGCGCGCGA ATGCTCACGA CTTTATCATG GAGTTTCCGG ACGGTTACAA CACACCCTTG GGCGAGTCGT CTCAGCTGTC GGGTGGCCAG AAGCAGCGTA TAGCCATTGG TAAGTTTACA ATGGGTTTTT AGATGTACGG CAATCGACGT AATACTCACG ATCTCCTCTT CTTTCGGTAG CCCGCGCCTT GGTGAAACGA CCTAATATCT TGATCTTGGA CGAAGCCACG AGCGCCCTAG ACAACGAGAG CGAGGCAGTT GTGCAGGCTG CCATTGACAA GCTGATGAGC TCAAGTGAGC ACACGGTTGT ATTGATCGCG CATCGTTTGT CTACGATACG AAACGCTGAC AAGATAGCGT TCGTGGCCGA CGGCAAAGTT TTAGAATATG GCAGTCACGA AACTCTAATG GAGCGCCCTC ATGGCCGCTA TAAGCGTCTT TTTGAATCCT CTCGACGGGA TGCCACTCTG TCAGCTCTCA ACAGCCAATC GAAGAAAGCT TCCGGTAAAG ACGTAGATCG GGAAGAAGAC GAAGAGATTG ACTGGGAGGG AAAGATCCAG GCAGAAGAGG CCGCTGCATT CAATGCCAAA CGCGCTCGAG ACATGGCCAA ACCAGATTCT TCATACATGC TTATTGGTGC CATTGGAGCA GTGATGGCTG GAGGTGTATT CCCGATGTGG GGCGTTCTTT TTTCTGAAAC AATTGACTTA CTTTTCCAGC CTGTACTCCT TTGTCCCGCT GAGGATGGAA GCATTCCGAA CAATTTCCCA ACTTGTGAGG ATTACTGGAA AGGTATCGCC AACGATATGC AGGACCGCTC CTTTGCGTTA GCTGGCTATT GGGCTTGTGT AATGTTTGGG TGTCTTGTTG GCAATGTGCT GACCTTTTAT GGCTTTGGCA CTGCAAGTGA GCGTCTGAAC AAACGGGTTC GTGACATGTC CTTTACCTCT TTGTTGCGCC AGGAAGTCGC TTTTTTTGAC ATGCGAAGTG TCGGAAGTAT TACCTCGCAG TTACAGGACG ACGCAGCCCG TATTCATGCG TTTTCTGGTG AACCGGTTCG ATCGTTTATC ACAGCACTTT CCTCCATCGT TACAGGTGTA GTACTATCTT TTATTGTAAG TGGCTTACCG TTTTCAAATG GGTGGCCATA GCTCAAGAAA TATCTGACGC GTATCTTTCT TGCACGAAAT ACAGTTCATG TGGCCTTTCG CTCTTTTGGC AATTGGTTGC GTTCCTCTGA TGGGATTTGC TACATCGCTG GAGATGAAGC AGATGCTTGG AGAGGACGAA GGCGATGTGG ATAACGTTGT TGAAGCACTG AACACCCCGG GTGGCCTTAT TGTGGAAACG TTGTTGAATA TACGCACTGT GTCGGCTTTG ACGCTTGAGA ACAAGCGTTT TACGGACTAT CAAGATTCTT TACTGAAAAC GGAGCCAGAC TTTAAATTTG ACGCTTTTAT GACTGGTTTT GTCAGCGGAA TTTCTATGTT TATTCAGCAA TGGATCAATG GATTGCAGCT TTGGTTTGGT GGATATATTC TTTCCAAGTT TCCGGATGAC TACGACTTCA ACGACTTCCT CATTGCCAAC TTTGCTGTTC TATTCGCCTT GTTTGGTCTC GGTGCGGCGT TTCAGGACAT TTCTGACCGA AAGGAAGTGG AGAAGAGCGC GGGGCGTATT TTCTACTTGC TGGATCGTGC CTCTTCGATT GATCCTCTTT CCACGGAAGG AAAAAAATTG TGATTATTCC TTCCATTAAA GTAGTCTGAA CTTTGGCTCA CATGTCAACT CTGTCTCTGT AATTATGGAT CGTTGACCAA TTTACAATCT CTTTGCGTTG GTGAATGGTC AAACTCTGCA AAATTCGCTT GAGAAGATTG TAAATTCCAG TAGCCTTACA GTTACAGTTA ATAATAGTAT TCTACGGAAA ACAAGTT
|
Protein sequence | MPAADDLEET NHSTSAGGKA KKPKAKQASA VETLHFAFDC GVRVKVLFFF GVIAGIANGL VYPILAWLFS SSFSDISAAS TNGLSQIREL AFTFLIVGVY ALVCATIQSF CFELVAYHAS QNFRLQWFGA LLRQDAAFFD VYDVGGIAAQ VGPNANKYRR GMGRKFGEGV QFLTTGIGGI GFAFFVSWRI AFVVLCVIPF VSVAALMVVQ LNQQKGARAS KSYKRAGSVA YSSVSAIKTV LSLNAVPTML KQYSQATQEA FADAVSILLK QGLANGSMLG AFLMLYAILA LYGSALLYRD VEDTGCDPSG GVNDNATCPN SGSDVFGAML GVAFAGQGVS QVGNFFEAFA AARIAAFEAY SAIRRTAGAP AETIYKEDDV EDLNSTVHSR KSKKSEPDVE SAERPIKAIL PKYEIDSTSD KGKKPSDIAG TLAFNDVRFN YPTRPTEAIL KGLSVEIEAG KISAFCGPSG GGKSTVMSLI ERFYDPLSGS VSLDGVNLRD INVSHLRSMI GYVGQEPTLF ATSIRGNIRF GNPDATDEMI ESAARMANAH DFIMSFSDGY DTQVGDRGSQ LSGGQKQRIA IARVLVHNPK ILLLDEATSA LDAESELVVQ DALDKILEQK NITTVIIAHR LSTIRNADVI NVVVGGVVAE KGTHDELMAG DTYYRKLVEK QEGQDRADTD SSPGTSRNSS SVDLVQLAET SKENMRASID AKHETPLLQF RDVRFAYPTR PKKKVFDDFN LTIMKGETVA LVGPSGGGKS TTVGLMERFY DPTEGTLEYL GMDVKSLNVP WYRDQIGYVG QEPTLFNDTI SRNIAYGAPG ASQFEIEEAC KRANAHDFIM EFPDGYNTPL GESSQLSGGQ KQRIAIARAL VKRPNILILD EATSALDNES EAVVQAAIDK LMSSSEHTVV LIAHRLSTIR NADKIAFVAD GKVLEYGSHE TLMERPHGRY KRLFESSRRD ATLSALNSQS KKASGKDVDR EEDEEIDWEG KIQAEEAAAF NAKRARDMAK PDSSYMLIGA IGAVMAGGVF PMWGVLFSET IDLLFQPVLL CPAEDGSIPN NFPTCEDYWK GIANDMQDRS FALAGYWACV MFGCLVGNVL TFYGFGTASE RLNKRVRDMS FTSLLRQEVA FFDMRSVGSI TSQLQDDAAR IHAFSGEPVR SFITALSSIV TGVVLSFIFM WPFALLAIGC VPLMGFATSL EMKQMLGEDE GDVDNVVEAL NTPGGLIVET LLNIRTVSAL TLENKRFTDY QDSLLKTEPD FKFDAFMTGF VSGISMFIQQ WINGLQLWFG GYILSKFPDD YDFNDFLIAN FAVLFALFGL GAAFQDISDR KEVEKSAGRI FYLLDRASSI DPLSTEGKKL
|
| |