Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46565 |
Symbol | |
ID | 7201705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 733916 |
End bp | 738668 |
Gene Length | 4753 bp |
Protein Length | 1545 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181064 |
Protein GI | 219120660 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.400003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTCAATAC CTCGCTGGCC TGCCCATACA CAACGTTCTC CCGCCCAGCC TTTGACGAAG CACCTTCAAG GCTTTTAGGT TTCAAGGCTT CCAGGTTTCG ACGGACGGTG AAAGAATGGG AAACCAAACG GCGGCGGATT CCTCGTATGA GGACTTGCCG TCGGTAAACG GTGGGGATCA CGATCTTGTG CAGCGCGTTT CGTTGCAGGA ATACTCGAGC GTCAAGGCTC ACGACGAAGA CGAAGACGAC GAACTGAGTG ATCCAGACTT GGAGCAGCGT GGATCTCCAG CGTTGAGACG AAGAGCACCA CCATCCAATT CATTTTCAGT CTCCTCGTTT CGTCGGGGCA ACGTTCTGCT TTACGTATGG TTGACGATTA TCGGGGTAGT TGCCTTACTC GGAGTGGCCT TGTTGGGCTT TCGTCACTAC TTGCTGGCAA GTGAGCACCA AACTACCAAT CTGGGGCAAA GATGGAGTCC AAATAATCCT CCAGCAACAA GTGACGGAAA GGGAGGCGCC CCAATTGCGG TCGAGGGTGG ATCCTCGTCG GTGTCCGGAT TTACCGTTGC TCATGATCCC AATGCTTACG CTACCTGGAA TCCGTACAAT CTTTCCGCGG AGCACATTCG TGTCGTTCCC TCCTCTTCTG TACATGTCGG TACCAACGGC CTAGCCACTG AAGAAGGACT GGGATATCTC ACGCAACCTA GTATCGTCAA CAATACTGTC GTCTTTGTCA GCGAAGGAGA CTTATACTTG ACGTACTTGG AGACTACCGA GAACTCACGT GCACAACGAC TCCCGGCCGT CAAGCTGACT ACGACTGAAG GGAACGTACG GACGCCGGTT TTACATCCCA ATCGATCCCT CGTGGCCTTT ACCGCTACCT ACACTTCCCG ACGCGAAGCC TACGTCATGG ATCTCGTCAC CCGTCGTACC AAGCAGGTTT CCTTCTTCGA CAGTGCCTAC GGCGTCTCGG CGATTGCGGG ATGGTCAGAT GTCGATACGC TAGTCGTTGT GGCTGATTCG AATCAAATCA GCTTGCCGGA TATGCGCTTG TACACGATTC GATTGCAACA ACAACATCAA TATTTAACGG TGGATCAGGC CATGCGTGGC AAAGCCGTTT TGGATGTCAC ACCCGTTCCC TTGGCGCAAG CGACCGAAGG TTTTTTTGAG GAAGGTTGCT GGTACTTTGT CCGTTTCAAG CAAAGTTCCC ATACAGCTCG CTATGTGGGT GGGACAGCTG AAGCTCTGTG GGCATATTGC GATGGACAAG CTCTGGCCTA TCCCCTTACA CCCAACTACA ACGGCACTTC GAAAACCCCA AGTATATATG AGACTGCAAC AGAGAAGTAT TTATTGTTCT TGTCAGATCG CAACACGGAT AATCGGCCAT CCACAATGAA TCTATGGGCA ACGCCCCTAC CGACTTCGTC GAACCTGAAA AAGGGACACT TCGTGATGCC CAAACCAATA CAGATTACTA ACGTGGCGTG TCAAATGGAA GGACTAGCGT TGCAAGAGTA TGCCGTCGAT CCTATCTCGA AAAAAATTGT AATGCGGATC GGAGCAGATT TATTCGAATT GACGGCGGAG CAAGTCCAAA CCATGTTGCA AAGCCTCAAT ACCGGCTCCA CGCCTCCCAC ACCGACTCGG TTACCGGTTC TAGTCTATTC CGACTTTCAC GGACTCCAAG AGCGTATCCG AGTCGTGAAT GTGCTGCGCG ATTTAAAGTC ACTGGATGTT TTCGAAACGG CCGTAGGTAC ACAGGCAGCC TTGTTGACAG TAAGGGGGCA GTTGTTTGTC GCTCCAGTCT CGGAAAACGT GGCACACAGC AAGACGTATC AAGGCGCCGG TCAAAATCTT CCACTTCGCC GGTATCGTGT AGCACCAGGA ACCATGACTG CCGGATCTAT GCGAGTTTTG AGTGCGCAGT ACGTTCCCAT TTTGGCCGAT CGGAACCAAG AAAAACGTCG GATGGCCATC GTTTTGGCCA CGGATCCACG CAGTCCGACG GCGGAGCACT CCTTTTTCCT CTTACCGATT GACACGGATG CCGTCAACAT GTTTTCCGCA TCAGATCTTT TGCCAAAACC CTTCTTGGGT GGCTATGAAA ATGGCGGATC GACTCGTCAA GGGGGTTTGG GCTCGGTCCG GTCTGATAGT GTTAAAGTCA GCCCCTGCGG TCGGCGAATG GCTTGGACCG ACACGGATGG ACGAGTGTGT CTGACAACCG TACCACTCTA TCAAAATGAA ACCAACTATA CTGTTTTGCC ATCTAAGAAC GAACTGGGAG AGCCTATCAA TGGCGCGTCA GCAGAGCTTG TTTGGAGTCC AGGTGGGCGC TACTTAGCCA TCAGTCATCC GGCCACCAAT CAATTTCAGG TTATTAGTAT TGTCGATTGT GGAGACCCTA ACTCTCCAGA AGATCCAACT GAGGTGGTAG ATATTAACAT TGGTCGGATT GTCCAAGCTA CACCTTCCCG TTTCAATTCG TACGAACCCT TTTGGGGCAT CACCGGTAGA GACCTGTCGA CTCGGGCTAT CGAAGAAGTC CTGGCCGATC TCCAAGGCAC TGGAAGACCG GATGAGGTAG CGACTACGCT CTACTTTCTT TCAGACCGAG ATATTCAAAC CGAGGTTTCT AGTCCTTGGG GATCTCGTGC CCCATCGCCA TATTTTCCAA CTATGAGTGC ATTGTACGGT CTTCCTTTGA CTTCTGTCAA TTTGGGCGAC AAAGAGGATG CGTTTATGGG GCGATTTGCG GGTGGTGGTG TAGCTGAAGC CTTTGTTGAC CAGCTCATGG CGTTAGACAA GCAGCTGGAG GCTCTCATGG TTGGTGATAG CAAGGACTCA AGTCGTCGAC TGGAAAAGGA CCAAGACGTT CAAGCGCGCG CGATAGTCGC ACGGAAGCTT CAGCGATATC GTAGCCACGC GATTTCCCGT CTGTTGGACG ACACCAAGGC TCCCACATCC GCCCCAACAA CTACAGCAGA CCGTAAGACC GTATTTCCTT CGGACATGGA AATTGATTTT AGTGGGAAGG ACTTGACTTT TGCTCGTCGG GCGTACAGGC TTGCTCACGT TCCGGATGCT CACTACTTAG CGATTTTGAC ACAAGCACAG GACGATGGCA GTGTTGCTCT CGTCGAAAAT ACTGATGACG GACGAAAAGT CAAATTATTT GTTGCTGACC CATACCCAAG CGATGGTGTT GATATTGAGA AATCATCGAT ATCGGTCGTT GGATGGGGGC TGAGTACAAC TAGAGACTTT CTTTACCTTG TCTTTGCTTC CGGGACGACG AAAACTATGT CAAACACCGC TGCAGGCATG ATGGCAGCGT TCCTCGATGC CGCATCTGAC GAGAGCATTG TCGACACAAA TAACATGGCC GTTTCAATCT GGCCTCAGTT GGAGTACGAA CAAATGTACA ACGATGCTTG GAGAATGCTA CGGGACTACT TTTACGATGC CGACATGCAC CAAGTAGATT GGGCTGGAGT ACATGGTCGT TACAAATCTC TTGTTGTAAG GTGCACGAAA CGCGAAGAGC TGGACGATGT CTTGGCACAA ATGGCTTCTG AATTGAGTGC TCTGCACGTC TTTGTTTACG GAGGCGAGTA TAGCCTTCCT TTTGGGGGTG ATACGAAGAA AATCTCCCTT CACGAGCCGG CCAGTTTAGG CGTCACATTC AAGAGAGTAC CAGAGTGGAA GGGGTACATG ATAACCGAAA TTCCTCAACG AGATCCAGAC TTCAACACCG TCAATGGAGA CGCGGTGTAT TGCCCTGTCT CAGGGCAAGC GTTGGAGCCG ACCGGCCAGA ATGGGCTAGA AGTTGGCGAC GTCGTAGTTG GGGTCAACGG TGAAAGCGTC ATGCACGCAA CGGATCTCCA CATGCTACTA CGTGGAAGTG CGGGTCGAAG TGTGCGCCTT GAAGTCCTTC GTTTAGAGTC TGGGAATGTA CGAAGTACAA CGAACGAAAT GATCTCCGAG CCCTTGATTG TGGTGCCAAT CACTCCAATG GCTGCCGCGG ATTTACGGTA CCAAGCCTGG GAATGGCGAA CGCGACAAAA GGCTAAGGAG CTGGCTGTCA AGGCTGGTTT CTCAGTGGCA TACATTCACA TGCAATCTAT GTTACAGCAT GACATGAATG CATTTGCACG CAACTTCTTC CCGGACTATG ACGCACAAGC TCTGATACTT GATGTGCGCC ACAATCGCGG CGGCAACATC GACTCTTGGA TTCTCACTCT TTTGCAGCGC AAAGCCTGGA TGTATTGGGG AGACCGCGTT GGTGTACGTA CAGGAGATTT GGATTGGGAC GAACAGTTTG CGTTTCGTGG TCACATCGTG GTTCTGATCG ACGAGCACAC GGCGAGCGAT GGGGAAGGAG TGTCCCGAGG TATTTCGGAG CTAGGACTAG GACGATTGGT CGGAACCAGG ACCTGGGGCG GTGGCATTTG GCTGTCGTCG GACAATCGGC TGGTGGACGG CGGTATTGCT TCTGCACCCG AAATCGGTAC CTTCAACGAT AGGCTTGGCT GGGGAATGGG CATCGAACAA CAAGGTGTAG TGCCAGACGT CGAGGTGGAC AACAATCCTC GGACCGCCTA CAGTGGACAC GACGAACAGC TAGAACGAGC GATTGCAGAG CTGGCAGAAT GGCTTGAGGA AGAGCCTGTA ATTCATCCTC GTCCTCTGGA GCCCAAACAC GATATGTCGC TTCATGACAC ATGTTCAGTG TGA
|
Protein sequence | MGNQTAADSS YEDLPSVNGG DHDLVQRVSL QEYSSVKAHD EDEDDELSDP DLEQRGSPAL RRRAPPSNSF SVSSFRRGNV LLYVWLTIIG VVALLGVALL GFRHYLLASE HQTTNLGQRW SPNNPPATSD GKGGAPIAVE GGSSSVSGFT VAHDPNAYAT WNPYNLSAEH IRVVPSSSVH VGTNGLATEE GLGYLTQPSI VNNTVVFVSE GDLYLTYLET TENSRAQRLP AVKLTTTEGN VRTPVLHPNR SLVAFTATYT SRREAYVMDL VTRRTKQVSF FDSAYGVSAI AGWSDVDTLV VVADSNQISL PDMRLYTIRL QQQHQYLTVD QAMRGKAVLD VTPVPLAQAT EGFFEEGCWY FVRFKQSSHT ARYVGGTAEA LWAYCDGQAL AYPLTPNYNG TSKTPSIYET ATEKYLLFLS DRNTDNRPST MNLWATPLPT SSNLKKGHFV MPKPIQITNV ACQMEGLALQ EYAVDPISKK IVMRIGADLF ELTAEQVQTM LQSLNTGSTP PTPTRLPVLV YSDFHGLQER IRVVNVLRDL KSLDVFETAV GTQAALLTVR GQLFVAPVSE NVAHSKTYQG AGQNLPLRRY RVAPGTMTAG SMRVLSAQYV PILADRNQEK RRMAIVLATD PRSPTAEHSF FLLPIDTDAV NMFSASDLLP KPFLGGYENG GSTRQGGLGS VRSDSVKVSP CGRRMAWTDT DGRVCLTTVP LYQNETNYTV LPSKNELGEP INGASAELVW SPGGRYLAIS HPATNQFQVI SIVDCGDPNS PEDPTEVVDI NIGRIVQATP SRFNSYEPFW GITGRDLSTR AIEEVLADLQ GTGRPDEVAT TLYFLSDRDI QTEVSSPWGS RAPSPYFPTM SALYGLPLTS VNLGDKEDAF MGRFAGGGVA EAFVDQLMAL DKQLEALMVG DSKDSSRRLE KDQDVQARAI VARKLQRYRS HAISRLLDDT KAPTSAPTTT ADRKTVFPSD MEIDFSGKDL TFARRAYRLA HVPDAHYLAI LTQAQDDGSV ALVENTDDGR KVKLFVADPY PSDGVDIEKS SISVVGWGLS TTRDFLYLVF ASGTTKTMSN TAAGMMAAFL DAASDESIVD TNNMAVSIWP QLEYEQMYND AWRMLRDYFY DADMHQVDWA GVHGRYKSLV VRCTKREELD DVLAQMASEL SALHVFVYGG EYSLPFGGDT KKISLHEPAS LGVTFKRVPE WKGYMITEIP QRDPDFNTVN GDAVYCPVSG QALEPTGQNG LEVGDVVVGV NGESVMHATD LHMLLRGSAG RSVRLEVLRL ESGNVRSTTN EMISEPLIVV PITPMAAADL RYQAWEWRTR QKAKELAVKA GFSVAYIHMQ SMLQHDMNAF ARNFFPDYDA QALILDVRHN RGGNIDSWIL TLLQRKAWMY WGDRVGVRTG DLDWDEQFAF RGHIVVLIDE HTASDGEGVS RGISELGLGR LVGTRTWGGG IWLSSDNRLV DGGIASAPEI GTFNDRLGWG MGIEQQGVVP DVEVDNNPRT AYSGHDEQLE RAIAELAEWL EEEPVIHPRP LEPKHDMSLH DTCSV
|
| |