Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46672 |
Symbol | |
ID | 7204410 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 143222 |
End bp | 148883 |
Gene Length | 5662 bp |
Protein Length | 1347 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185831 |
Protein GI | 219121206 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.390324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGATACAA GCCAACTAAC TGAAGGAATG AAACAGCTCC AGGGATAGTT TTGTGAGATC CGGCCCGACG GCGAGTTAGG AAAGCCTTTT GCGATGGACA AACCTGAACT TGATGGAACT TCAGAAATCT TGCAATTAAC GAAGGCGCTT CTCGAATATT CCAAAAAAGC GCGCTCAGGA AATCTGATGT TGCCTTCTAA TCCTCGATCT TCAATCTGGA AAAGAGTGGT GATGGCGGCC ATCTGACACG AGCTCTAGCT AAAGGTCAAC AGAATATGTT GACACCATCC TTCTGGATCA CACTTAGAGA ACCAGTTTCC AAACGAATAA CAGAAATGGA CGAAACCCTC CATGGAAGCA TCTAGAAGCA TCGACACTGG GTCGAAGAAT GTCAATTCTC TTGATACGGA CCAATCATAG GGACCTCGCG GAGGGCTCAC GCTTACACTT AAGTGCAAAT GCGCTGGTGA GTACACAGAA CATCAGATGC TTCAAAATAT TGTGTACCTA CCACTACATT CTTTTTTTGA CAAAAACCCT CGTTACGATT GGTTGGCGTG TAAAAAAGCT CACAACAATC TCACCCCGAA TTTAATCGCC ACGATGTCGA TATCTATTGA GCACGAAGAC AATACCTTCA AACAAGGCGA GCATGTAGAA GAAAACATTG TCGATGTCTA CGACGACGAG GAAGAAGTAA TTTCCAACTC TGGATGCGGC GTTCTCGGAC GTTACCCTGT TCTTTCTGTC CTCATATTTG CAGCTACTGG AATCGGAGTT GGCCTGGACT TTCTTTCTGG GAACCTGGCG AAGGCGACGA CACCAAGGAA AAGGTTATTA ACTGGGTAGG TCTCATTGGA GATCTATTCA TCCGTTCGCT GAAGTGCGTT GTTCTTCCTC TCGTCTTTAT CAACGTCATT ATTTCGGTCG TTGATATGAT GACTATCGGT CGCGCGGGAT CGATCGGATG GAAAACAATT GGACTGTACT TAACGACTAC TGTTATTGCC TCCATACTGG GCATCCTTTC TATCTTGTCG TTCAAGCCAC TGTTTGAAGA AGGAACCTTT ACAGCTAGTA GCCCAGCAAC AATCAAGCTC GGCTGCAACG ACTCTGGTGA ATACAAAACT GAATTGCAAG ACGGCACCCT TACGTGCACC GCTGATACAG GTAACAGCTC TCAGTTTTTC ATCGAAGATT TGTCCCAGAG TTTTGTTCGG GCAAGTGGAA GTGTCCGTTC AGATATCTCC CTCAGTGACA CCATCTACGA AGGCGTGTTT ACAAAGTAAG TCATGTAATC TCCAAAAATG CGTGTGGGCC TACTTCTACA TCACAGCATC CAAGTACCGA ATACTCAGAG ATGAATTTTC TTCCCTAACA GGCTAGTTAC TGCAAATATC TTTGAGTCGT TTGTGGAGGC TAATTTCGCG GCCGTTCGCC ATTGCCTTTA GTGTTGCTTT GAGCAAGGTC TTTGACCGCC AGCGCGACCA CGATACCAGT TTTATCATCC CTTTTCTGAA AGAGCTGGAC TCTGTGTTTC TGACGATAAT CAACTGGATC ATTATGTGCA CTCCTTTTGC TGTGCTTTCC TTGATTGCGT CTGCCATTGG AAAACAAGAA GATCTGGCCG ATTCATTTGC CAACGTGGGG TATCTGGTCG TGGCGACTAT AACTGCTATG GCTCTGCAAA TTGCCGTAGT CCACTGCCTC ATCTTTGGTT TGGTCACACG AAGCAACCCC TGGCACTACC TCAAGTATAT GATTCCGGCA CAAACAATGG CGTTTGCGTG TTCAAGCAGT GCTGCCACAA TCCCCATGAC CCTCAAGTGC GTTCAAAGCA CGAACCGGGT ACCTGAGCCA GTCGCACGTT TTGTGATTCC TTTGGGTGCG ACCGTGAACA TGGATGGAGG AGCCATTTAT TTTCCATGCG CCTGCATTTG GCTTGCTGTT TTGAACGGTA TTGACCCCGA TATTGCCTCG TACTTTTTGC TGGTTATTAT TTCGACAATT GGTAGTGCCG GGACGGCACC GGTTCCTTCG GCGAGTTTGG TATTAATTAT CACGGCTTAC AACACCGTGT TCAACACAAC TGGCGTGCCC GAAGGGTTCT CCTATATTCT TGCGATTGAC TGGTTCATGG ATCGTCTACG CACAGTCGTC AATGTGACGG GAGACACTGT CGTGGCTGGT ATGGTTTCCC ACTTATGTCC CGACGGATTA CACGAGACAA AGACCCGACA CGATAGCTCT GTGGAAGACG AGCCGTCCAC GGATGAGGAA GTTGCCAAGG TGGGCTCCAC GTAATTGCGT TTGTGAAGTG TTGACAGTGA GACCTTTCTG CGAGAAGGAC CGGATTTATA TGTACTTGCA TCTGGACAAC CTACTACAAC CAAGCCACTA TAGTAATATT TCACTGTAAG TTCTAGGAGC CTTTTGCTTC TCTCCAACTG CTTCGCGCTT TACCGATAAT AATGTTGGTT TATCAATGCC ATTGCTGGTA TGTGGTTTTA TTGTTAATAT TAGTACGCCG TTATGCCGGA TAGCGTAAAA CTAGGTCACG GGAAATCGTG AATAAACGAC ACTGTACCGA CTTCTCCGTG TGTTTTCGAA CCTCCATTGC TTTGATGTGA CGAACAAAAT GTCGATGGAT AACTTCTTTG TCTCCAACAA ACTTGCTATT GTACTCTCAC TGCTCGCTTT TCCAAACGCA CTACAACACC TTGACTAGCG CCTTCCCAGT ATGCCTCTTT TTGGCTTTCG CAAACAACGC AAAGCATCGC GGTCACCGTC TCCGACACGC GGCGCCGCTT CTTCGTCGAC CGCCAGTCGT GCTAATACCG GAACGACGAG GCCATTACGA AGTCGTCGGA CTCACGTGGA TAGTGACGAG CGCATGCATA ACCCCTTGGG ATCTCCCATG CTGCTTCCCA CCAATTACGA TCCCGATGGG CGGCTTGGAC AGAATGCGTA CCAGACTTCG CCGTCGATGA ACACATGCAA CAATTTTCTG CTGGGTGGAA ACTACTACCC TGGAAAAGAC AAGAAGCGAC GGCACTATCG GACAAAGGCT ATTTGGTATC GGATATTCTG TTCAAGCGGA CCGAGACAGA TTGTGACTTT TATCGTTTCT CTCTACTTGT TTCTGTGGCT GCCGATTACG CACTTGCTTT CCACATCCCA GGCTCCAAGC TTGACGAATT TTGACTGGCT CTTGCGGGAT ACTACTTTGC AATTACCCAC CATGGAAGAA CAGCGTGATA CGGCATCCAA GTTAAACATT GAACGCGATC AGCTGGATGC CGAGGGGGGT ATATCGGACC GGCTACGAAT ATTAGAAAAA ATTGTGCCGG ACTGGTTTCA CAGGAACGAC GAGCAACCCC TCGCAAAACC CGACGTGCCC GTCAGTTCCA AGACCGAGAC ACAGCCAATA CCAACAGGTG ATCGCATCCC ACAGCCCAGG AAAGACGTAG ATTCGGAACA CATCGTAGAA GATAGGCCCC CGGAGAATGA TGGAAAAGTC CACTCGCGGA AGGCGCGCAT TACGAGAATG AAAGAATCAA ACAAAGATGG CGTAGAAAAG GGAAATGAAC TCGCAAATCC CAAGGCGCTC GAACCCTATG GTGGGCCAGC ATTGGTCGTC CGTAAAGCAG GGGACCGCAT TCGGCACTTA GGCAATACCG ATCTTTTTGT AAATGAGACC AAATGCCCAG CCAACCTTAT CCCAGACGAT GTTCAAACAA CCCTCGTAAT TCAGTCGAGC TCCAATCGCT TATGGATTTT AGCGGAGACT TGTAAGCGAT GGAATGACCC GATCATAGCA GTGATCGCTT TGACAAGTAC AGAGGACCAT GACGAGGTTT CGGCGCTGCT TTCTGGTTGG GACGATAAGT GTCCTCACCT TCAAGTAATT GTGTACCAGA TGGATACAGA CGAGGAAAAG CCTGAGATGT ATCCTGTGAA CCGCTTGCGC AATATTGGAT TGGATGCAGT GCATACAAGC CACGTACTAG TTGCGGATGT CGATTTTATT CCGGCGCAAG ATTTGGCTCG GTTGATTCGC ACGACGCTGC TTGACCGTGC TAGACTTCGA GTACTGAAAG ATGACATGAT CCCGAAAGAC CAAGATGCGA TTGTTGTTCC CGCATTTGAA CGCCTACTGG CAGAACCGTG TACATCCGAG GATAGCTGCA AAGAACATCT TCGAAGCAAT AGCTCATTCA TTCCGCATAC TTTTGACGAT CTTCGTTCTT GTATTGCCGA AAAAAGCTGC ATCGTTTTCC AGAGTAAAGT CAATTGGGAA GGTCATTTTT CGACCCACTC GGAAGACTGG CTGAAGAAGA AATGGTACCA GGGCGAACCG ATCGCTTTTG GCGAGGGCAA ACCAAGCGTC CGTCACATTC GCACGCTCGA TTGCTTCGAC TCGCTACGCT ACGAACCATA CGTCGTCCTG CAATGGTGTC CGACGGGCAA ACCTGATCCA GTACGGGCCA CTGCTCCTTA CTACGATGAG CGCTTCCACG GCTACGGCAA GAACAAGATC CAGTATATTC AACACTTGCG GTTAGCGGGT TATCACTTTG CTGTTTTACC CGAAGGTTTT ATTGTGCACA ACCCACACCG CGACTCCAAA GCCAAGGAAA CTTGGAACAA CGTAAAGGAG TCTACGTTGC ATGCTGATAT GGATCGGCTT TACCCGACAT TTTTGAAGGA GTTGTTTCAG AAGTATAGGT ATTCGAAGCA TCGTGCTGTA GAGCTGTGTA AGCAATAAAT TGCGAGTTTT TGCGGGTTCG GAAATGACAC GATGTTACTG CTGTAAGGGT AGCAAATCGT GTGGCAACGG TATTTCGCGA CATCGCTTCT CAAACGCGTC GACTTGTCGC TCCGCAACTT GTCTCAGAAC CGTATCCAGT GCTCCGCGAA TGACTGGATC ACTGACCGTC ATTTCGACTT GAAAGTCTAC TTTGCATTGT GTGTTTTCGA AACCATCTAC CGGACTGAGC GTCCATTGAC TCCGCAGCGA ATCGAACATT TTGCTTTCGA TACTTTGCGT TTCCACAGTC CAGCGCTCCG GAACCACGTG CACCCGAGAA ACGTACGTTT CTTGAAAGAA TGGCGGCATC CCAACCGTCA AAGTAGCTTC AAAGGACCGT CCGTCGTTGT CTTTCCGCAA AATCTTGGAA TATGAGCACA GTGGGAGGAA TTTGGAATAG GCGTCGACAT CTTGAATGAT TCGAAACAAG TGAATGGGGT GTGTCTGGAG AATCTTCCGT TCCGTGTGAC GTTTGGTAAC GGGGGACGAT AGGAAACTCC TCAGAATCAT GCACCGACCG TTTCTTGCTA CAAAGTCCAA CAGCGGCAAC AATGGTTTGG GTGCCGGACT GATGCGTCTA GTCTAGGCAA CGAGAAGGCA ACTTGCTACT TGTACGGCAA AGAGATACAC CCGAAGCACG CCAAGGACTC GCGATTTCAT CCATCCGGTG CACTGGAATA GATTCGATTT TCCGAGACCT GCACTGTTTC TACCCGCCAT GCAGGAATGT TCACATTATT CGGCTCGGCA GGAAAACTCA TCCAACGATC AATCCGTACG AATAGCGGAC TCCTCGACTA TGACGTCATG AAAGGTTGGG ACGAGTTTCG TACGATCTCC CGTGACACGT GA
|
Protein sequence | MLQNIVYLPL HSFFDKNPRY DWLACKKAHN NLTPNLIATM SISIEHEDNT FKQGEHVEEN IVDVYDDEEE VISNSGCGVL GRDDTKEKVI NWVGLIGDLF IRSLKCVVLP LVFINVIISV VDMMTIGRAG SIGWKTIGLY LTTTVIASIL GILSILSFKP LFEEGTFTAS SPATIKLGCN DSGEYKTELQ DGTLTCTADT GNSSQFFIED LSQSFVRASG SVRSDISLSD TIYEGVFTNR LWRLISRPFA IAFSVALSKV FDRQRDHDTS FIIPFLKELD SVFLTIINWI IMCTPFAVLS LIASAIGKQE DLADSFANVG YLVVATITAM ALQIAVVHCL IFGLVTRSNP WHYLKYMIPA QTMAFACSSS AATIPMTLKC VQSTNRVPEP VARFVIPLGA TVNMDGGAIY FPCACIWLAV LNGIDPDIAS YFLLVIISTI GSAGTAPVPS ASLVLIITAY NTVFNTTGVP EGFSYILAID WFMDRLRTVV NVTGDTVVAG MVSHLCPDGL HETKTRHDSS VEDEPSTDEE VAKRLPSMPL FGFRKQRKAS RSPSPTRGAA SSSTASRANT GTTRPLRSRR THVDSDERMH NPLGSPMLLP TNYDPDGRLG QNAYQTSPSM NTCNNFLLGG NYYPGKDKKR RHYRTKAIWY RIFCSSGPRQ IVTFIVSLYL FLWLPITHLL STSQAPSLTN FDWLLRDTTL QLPTMEEQRD TASKLNIERD QLDAEGGISD RLRILEKIVP DWFHRNDEQP LAKPDVPVSS KTETQPIPTG DRIPQPRKDV DSEHIVEDRP PENDGKVHSR KARITRMKES NKDGVEKGNE LANPKALEPY GGPALVVRKA GDRIRHLGNT DLFVNETKCP ANLIPDDVQT TLVIQSSSNR LWILAETCKR WNDPIIAVIA LTSTEDHDEV SALLSGWDDK CPHLQVIVYQ MDTDEEKPEM YPVNRLRNIG LDAVHTSHVL VADVDFIPAQ DLARLIRTTL LDRARLRVLK DDMIPKDQDA IVVPAFERLL AEPCTSEDSC KEHLRSNSSF IPHTFDDLRS CIAEKSCIVF QSKVNWEGHF STHSEDWLKK KWYQGEPIAF GEGKPSVRHI RTLDCFDSLR YEPYVVLQWC PTGKPDPVRA TAPYYDERFH GYGKNKIQYI QHLRLAGYHF AVLPEGFIVH NPHRDSKAKE TWNNQIVWQR YFATSLLKRV DLSLRNLSQN RIQCSANDWI TDRHFDLKVY FALCVFETIY RTERPLTPQR IEHFAFDTLR FHSPALRNHV HPRNRQQWFG CRTDASSLGN EKATCYLYGK EIHPKHAKDS RFHPSGMFTL FGSAGKLIQR SIRTNSGLLD YDVMKGWDEF RTISRDT
|
| |