Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46562 |
Symbol | |
ID | 7201702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 716788 |
End bp | 721425 |
Gene Length | 4638 bp |
Protein Length | 1545 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181061 |
Protein GI | 219120654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.252798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAACC AAACGGCGGC GGATTCCTCG TATGAGGACT TGCCGTCGGT AAACGGTGGG GATCACGATC TTGTGCAGCG CGTTTCGTTG CAGGAATACT CGAGCGTCAA GGCTCACGAC GAAGACGAAG ACGACGAACT GAGTGATCCA GACTTGGAGC AGCGTGGATC TCCAGCGTTG AGACGAAGAG CACCACCATC CAATTCATTT TCAGTCTCCT CGTTTCGTCG GGGCAACGTT CTGCTTTACG TATGGTTGAC GATTATCGGG GTAGTTGCCT TACTCGGAGT GGCCTTGTTG GGCTTTCGTC ACTACTTGCT GGCAAGTGAG CACCAAACTA CCAATCTGGG GCAAAGATGG AGTCCAAATA ATCCTCCAGC AACAAGTGAC GGAAAGGGAG GCGCCCCAAT TGCGGTCGAG GGTGGATCCT CGTCGGTGTC CGGATTTACC GTTGCTCATG ATCCCAATGC TTACGCTACC TGGAATCCGT ACAATCTTTC CGCGGAGCAC ATTCGTGTCG TTCCCTCCTC TTCTGTACAT GTCGGTACCA ACGGCCTAGC CACTGAAGAA GGACTGGGAT ATCTCACGCA ACCTAGTATC GTCAACAATA CTGTCGTCTT TGTCAGCGAA GGAGATTTAT ACTTGACGTA CTTGGAGACT ACCGAGAACT CACGTGCACA ACGACTCCCG GCCGTCAAGC TGACTACGAC TGAAGGGAAC GTACGGACGC CGGTTTTACA TCCCAATCGA TCCCTCGTGG CCTTTACCGC CACCTACACT TCCCGACGCG AAGCCTACGT CATGGATCTC GTCACCCGTC GTACCAAGCA GGTTTCCTTC TTCGACAGTG CCTACGGCGT CTCGGCGATT GCGGGATGGT CAGATGTCGA TACGCTAGTC GTTGTGGCTG ATTCGAATCA AATCAGCTTG CCGGATATGC GCTTGTACAC GATTCGATTG CAACAACAAC ATCAATATTT AACGGTGGAT CAGGCCATGC GTGGCAAAGC CGTTTTGGAT GTCACACCCG TTCCCTTGGC GCAAGCGACC GAAGGTTTTT TTGAGGAAGG TTGCTGGTAC TTTGTCCGTT TCAAGCAAAG TTCCCATACA GCTCGCTATG TGGGTGGGAC AGCTGAAGCT CTGTGGGCAT ATTGCGATGG ACAAGCTCTG GCCTATCCCC TTACACCCAA CTACAACGGC ACTTCGAAAA CCCCAAGTAT ATATGAGACT GCAACAGAGA AGTATTTATT GTTCTTGTCA GATCGCAACA CGGATAATCG GCCATCCACA ATGAATCTAT GGGCAACGCC CCTACCGACT TCGTCGAACC TGAAAAAGGG ACACTTCGTG ATGCCCAAAC CAATACAGAT TACTAACGTG GCGTGTCAAA TGGAAGGACT AGCGTTGCAA GAGTATGCCG TCGATCCTAT CTTGAAAAAA ATTGTAATGC GGATCGGAGC AGATTTATTC GAATTGACGG CGGAGCAAGT CCAAACCATG TTGCAAAGCC TCAATACCGG CTCCACGCCT CCCACACCGA CTCGGTTACC GGTTCTAGTC TATTCCGACT TTCACGGACT CCAAGAGCGT ATCCGAGTCG TGAATGTGCT GCGCGATTTA AAGTCACTGG ATGTTTTCGA AACGGCCGTA GGTACACAGG CAGCCTTGTT GACAGTAAGG GGGCAGTTGT TTGTCGCTCC AGTCTCGGAA AACGTTGCAC ACAGCAAGAC GTATCAAGGC GCCGGTCAAA ATCTTCCACT TCGCCGGTAT CGTGTAGCAC CAGGAACCAT GACTGCCGGA TCCATGCGAG TTTTGAGTGC TCAGTACGTT CCCATTTTGG CCGATCGGAA CCAAGAAAAA CGTCGGATGG CCATCGTTTT GGCCACGGAT CCACGCAGTC CGACGGCGGA GCACTCCTTT TTCCTCTTAC CGATTGACAC GGATGCCGTC AACATGTTTT CCGCATCAGA TCTTTTGCCA AAACCCTTCT TGGGTGGCTA TGAAAATGGC GGATCGACTC GTCAAGGGGG TTTGGGCTCG GTCCGGTCTG ATAGTGTTAA AGTCAGCCCC TGCGGTCGGC GAATGGCTTG GACCGACACG GATGGACGAG TGTGTCTGAC AACCGTACCA CTCTATCAAA ATGAAACCAA CTATACTGTT TTGCCATCTA AGAACGAACT GGGAGAGCCT ATCAATGGCG CGTCAGCAGA GCTTGTTTGG AGTCCAGGTG GGCGCTACTT AGCCATCAGT CATCCGGCCA CCAATCAATT TCAGGTTGTT AGTATTGTCG ATTGTGGAGA CCCTAACTCT CCAGAAGATC CAACTGAGGT GGTAGATATT AACATTGGTC GGATTGTCCA AGCTACACCT TCCCGTTTCA ATTCGTACGA ACCCTTTTGG GGCATCACCG GTAGAGACCT GTCGACTCGG GCTATCGAAG AAGTCCTGGC CGATCTCCAA GGCACTGGAA GACCGGATGA GGTAGCGACT ACGCTCTACT TTCTTTCAGA CCGAGATATT CAAACCGAGG TTTCTAGTCC TTGGGGATCT CGTGCCCCAT CGCCATATTT TCCAACTATG AGTGCATTGT ACGGTCTTCC TTTGACTTCT GTCAATTTGG GCGACAAAGA GGATGCGTTT ATGGGGCGAT TTGCGGGTGG TGGTGTAGCC GAAGCCTTTG TTGACCAGCT CATGGCGTTA GACAAGCAGC TGGAGGCTCT CATGGTTGGT GATAGCAAGG ACTCAAGTCG TCGACTGGAA AAGGGCCAAG ACGTTCAAGC GCGCGCGATA GTCGCACGGA AGCTTCAGCG ATATCGTAGC CACGCGATTT CCCGTCTGTT GGACGACACC AAGGCTCCCA CATCCGCCCC AACAACTACA GCAGACCGTA AGACCGTATT TCCTTCGGAC ATGGAAATTG ATTTTAGTGG GAAGGACTTG ACTTTTGCTC GTCGGGCGTA CAGGCTTGCT CACGTTCCGG ATGCTCACTA CTTAGCGATT TTGACACAAG CACAGGACGA TGGCAGTGTT GCTCTCGTCG AAAATACTGA TGACGGACGA ATAGTCAAAT TATTTGTTGC TGACCCATAC CCAAGCGATG GTGTTGATAT TGAGAAATCA TCGATATCGG TCGTTGGATG GGGGCTGAGT ACAACTAGAG ACTTTCTTTA CCTTGTCTTT GCTTCCGGGA CGACGAAAAC TGTGTCAAAC ACCGCTGCAG GCATGATGGC AGCGTTCCTC GATGCCGCAT CTGACGAGAG CATTGTCGAC ACAAATAACA TGGCCGTTTC AATCTGGCCT CAGTTGGAGT ACGAACAAAT GTACAACGAT GCTTGGAGAA TGCTACGGGA CTACTTTTAC GATACCGACA TGCACCAAGT AGATTGGGCT GGAGTACATG GTCGTTACAA ATCTCTTGTT GTAAGGTGCA CGAAACGCGA AGAGCTGGAC GATGTCTTGG CACAAATGGC TTCTGAATTG AGTGCTCTGC ACGTCTTTGT TTACGGAGGC GAGTATAGCC TTCCTTTTGG GGGTGATACG AAGAAAATCT CCCTTCACGA GCCGGCCAGT TTAGGCGCCA CATTCAAGAG AGTACCAGAG TGGAAGGGGT ACATGATAAC CGAAATTCCT CAACGAGATC CAGACTTCAA CACCGTCAAT GGGGACGCGG TGTATTGCCC TGTCTCAGGG CAAGCGTTGG AGCCGACCGG CCAGAATGGG CTAGAAGTTG GCGACGTCGT AGTTGGGGTC AACGGTGAAA GCGTCATGCA CGCAACGGAT CTCCACATGC TACTACGTGG AAGTGCGGGT CGAAGTGTGC GCCTTGAAGT CCTTCGTTTA GAGTCTGGGA ATGTACGAAG TACAACGAAC GAAATGATCT CCGAGCCCTT GATTGTGGTG CCAATCACTC CAATGGCTGC CGCGGATTTA CGGTACCAAG CCTGGGAATG GCGAACGCGA CAAAAGGCTA AGGAGCTGGC TGTCAAGGCT GGTTTTTCAG TGGCATACAT TCACATGCAA TCTATGTTAC AGCATGACAT GAATGCATTT GCACGCAACT TCTTCCCGGA CTATGACGCA CAAGCTCTGA TACTTGATGT GCGCCACAAT CGCGGCGGCA ACATTGACTC TTGGATTCTC ACTCTTTTGC AGCGCAAAGC CTGGATGTAT TGGGGAGACC GCGTTGGTGT ACGTACAGGA GATTTGGATT GGGACGAACA GTTTGCGTTT CGTGGTCACA TCGTGGTTCT GATCGACGAA CACACGGCGA GCGATGGGGA AGGAGTGTCC CGAGGTATTT CGGAGCTAGG ACTAGGACGA TTGGTCGGAA CCAGGACCTG GGGCGGTGGC ATTTGGCTGT CGTCGGACAA TCGGCTGGTG GACGGCGGTA TTGCTTCTGC ACCCGAAATC GGTACCTTCA ACGATAGGCT TGGCTGGGGA ATGGGCATCG AACAACAAGG TGTAGTGCCA GACGTCGAGG TGGACAACAA TCCTCGGACC GCCTACAGTG GACACGACGA ACAGCTAGAA CGAGCGATTG CAGAGCTGGC AGAATGGCTT GAGGAAGAGC CTGTAATTCA TCCTCGTCCT CTGGAGCCCA AACACGATAT GTCGCTTCAT GACACATGTT CAGTGTGA
|
Protein sequence | MGNQTAADSS YEDLPSVNGG DHDLVQRVSL QEYSSVKAHD EDEDDELSDP DLEQRGSPAL RRRAPPSNSF SVSSFRRGNV LLYVWLTIIG VVALLGVALL GFRHYLLASE HQTTNLGQRW SPNNPPATSD GKGGAPIAVE GGSSSVSGFT VAHDPNAYAT WNPYNLSAEH IRVVPSSSVH VGTNGLATEE GLGYLTQPSI VNNTVVFVSE GDLYLTYLET TENSRAQRLP AVKLTTTEGN VRTPVLHPNR SLVAFTATYT SRREAYVMDL VTRRTKQVSF FDSAYGVSAI AGWSDVDTLV VVADSNQISL PDMRLYTIRL QQQHQYLTVD QAMRGKAVLD VTPVPLAQAT EGFFEEGCWY FVRFKQSSHT ARYVGGTAEA LWAYCDGQAL AYPLTPNYNG TSKTPSIYET ATEKYLLFLS DRNTDNRPST MNLWATPLPT SSNLKKGHFV MPKPIQITNV ACQMEGLALQ EYAVDPILKK IVMRIGADLF ELTAEQVQTM LQSLNTGSTP PTPTRLPVLV YSDFHGLQER IRVVNVLRDL KSLDVFETAV GTQAALLTVR GQLFVAPVSE NVAHSKTYQG AGQNLPLRRY RVAPGTMTAG SMRVLSAQYV PILADRNQEK RRMAIVLATD PRSPTAEHSF FLLPIDTDAV NMFSASDLLP KPFLGGYENG GSTRQGGLGS VRSDSVKVSP CGRRMAWTDT DGRVCLTTVP LYQNETNYTV LPSKNELGEP INGASAELVW SPGGRYLAIS HPATNQFQVV SIVDCGDPNS PEDPTEVVDI NIGRIVQATP SRFNSYEPFW GITGRDLSTR AIEEVLADLQ GTGRPDEVAT TLYFLSDRDI QTEVSSPWGS RAPSPYFPTM SALYGLPLTS VNLGDKEDAF MGRFAGGGVA EAFVDQLMAL DKQLEALMVG DSKDSSRRLE KGQDVQARAI VARKLQRYRS HAISRLLDDT KAPTSAPTTT ADRKTVFPSD MEIDFSGKDL TFARRAYRLA HVPDAHYLAI LTQAQDDGSV ALVENTDDGR IVKLFVADPY PSDGVDIEKS SISVVGWGLS TTRDFLYLVF ASGTTKTVSN TAAGMMAAFL DAASDESIVD TNNMAVSIWP QLEYEQMYND AWRMLRDYFY DTDMHQVDWA GVHGRYKSLV VRCTKREELD DVLAQMASEL SALHVFVYGG EYSLPFGGDT KKISLHEPAS LGATFKRVPE WKGYMITEIP QRDPDFNTVN GDAVYCPVSG QALEPTGQNG LEVGDVVVGV NGESVMHATD LHMLLRGSAG RSVRLEVLRL ESGNVRSTTN EMISEPLIVV PITPMAAADL RYQAWEWRTR QKAKELAVKA GFSVAYIHMQ SMLQHDMNAF ARNFFPDYDA QALILDVRHN RGGNIDSWIL TLLQRKAWMY WGDRVGVRTG DLDWDEQFAF RGHIVVLIDE HTASDGEGVS RGISELGLGR LVGTRTWGGG IWLSSDNRLV DGGIASAPEI GTFNDRLGWG MGIEQQGVVP DVEVDNNPRT AYSGHDEQLE RAIAELAEWL EEEPVIHPRP LEPKHDMSLH DTCSV
|
| |