Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44878 |
Symbol | |
ID | 7199804 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 505410 |
End bp | 511399 |
Gene Length | 5990 bp |
Protein Length | 1836 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178789 |
Protein GI | 219115988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAATTAC AATTAGAATT CATCTACCGC TGCATTACCT GCAAGCTATC GGCTCTACGC TGATTAAGTG ATTCGGTGAT TCCGCGGAAT CGTTGTAACT AGCCAGTCTT GTGCTTTTTG TGGTTGGCGT CGCGGCTACA AACAGGGATT GCATCAAATT CTTTGTTTGA CGCCGGAGTG TATACAATTT AGAAAAGATC TTTCCTCTGG CCTTATCCAT AGAATGACTT CCGCCGGAGA TTCTGGGATG CGAAAAAACG ACAAGCCTTC GCTTCGTATG AATTTGTCGG CAGCGAATTT GTCCAAGACT GAAGTGGTTC CTTCGGTACC CGCTGCAAAG AAGCGAAGGT ATGCGCCAAT CGATCTCCCA CTCAGTATGG CTTCGGTATC AGATGTGCGA GTGTTGCATC AGAAAACTTC GATGAGTTTT GCCCTCCAAT CGAGTCTCAA CCGCGAGGAA GCTACCTGGC ATGCGCAGGG CACAACGACG CTCTTTCTTA CAACGCCGCA TCCACCAACC CAGCCACTTT CCGAGTCTTC AACAGTCCCC CTTGCTCTAC ATTTACGAGG TTCCTGCCAA GTGTCACAAG TACATGTTCA AACTCTAACT TGGACAAAAA ATCAGCCCGA GCCAAAGCCG CAGACTCTGC GGACTTCCTA TCATCACGTG GACCCGTTGC AGCACGTATT GGTCAAACCG CCATCGTCGT ACACTTTCGA AGAGGATCAC AAGAAAGAGT TTCGATTCGA TGCTGATGCT CAGTCCTCGC GTGGTGCCGT CGGTATGACG TCAGCTCTTC GGGCTGCGAG TGTTGCGAGT AATCTTGGAG AGCTGCGAAT CACGTTCGAG GCTTTGAATA GTAAAAGCGA GGAAACAACG ATAGATGCAG TCCAAAGTTC GTGGCGACAC CTATTGGAAG CTACCTCCAA TGAAGGGGAT GTTGTCCAAC GCCTGAAATC CAGCTGGAAT GGACGAGCTG GACAGCGTCG GAATCGCCGA TTGAACTTAA TAGCATCCTG TCTTGCTGCA GCATCGAATC AGAATCGTGC TTTTTGTGTT ACAGTCCGAT ACAACATTGC GATTGAAAAG TCAATCGCAC ATCTAGGTGG GCTCCACGCA TTGACAACGA ATCGCACGCC TCATGTCTAC ACGACGGCCG GAGTCTACGG TGACCACGAG GGAACCCGAT GTTGGCTACC CTGTGCAGAT TCAGCAGCTG CTCATCACCG CACCTCCCAC GAATTTCTTG TACATGTCAC CGCAGGTATG ACGCATGGCC TCGCGACGAT GGGGCTGGGA GAAGACATGG GCTGTAGAGA AACTCTCCTG CATGACAGCT TTACGAAGAC AGGGGCCAAA AGTATCCCGT CTCGAGCCGC AAATGAGTTT GGCGCACAGC ATGTGAAGTT TTTGGAAGAT CTCTCAGCCA AGAATACTGA CAACTCTATT TCTCACTGGA TTCCTCCGGA AGAGTCGCAT ACAGTAGCAA TGGGAGATAT TCAAGCGACG CATGTTTGGT GTTCTGGGAC TTGGACGCCT GTTCCTGCCA GATCACTGGG GTTTTCTATC GGCCCTTTCC GTGTTCTAGA AGATCCCGAG TACTTTGAGG CTGCGGCGGA AGCAGAAGTA GATGATGATG AAAAGGGTCC CGACGAACTG ATCGAGACAG CCCGCTCAAG CGGCGAAGGA ATTCGACAGG CTTACTTTTG CCCCGTATTC GCCCGGAAGT TTATTCATCG CGATGCAAGC CGTGTCTTGC TTCCGGAGAC TCAATTGGTG CTCGCCGACC TTTCATCCCG ACAACGCGAA ATTCTGAATC GATTGGACCA AAGTGTCTTA TCGACAACCG TGGGTGTTCC TCACCGAGCC TTGTCATTGA TGCGAGATGT GCTGGCCTTG CCGACCTACC GGACAGCCTC GTATACCCAA ATTTGGATTC CAAACGCCCT TCATGGGGGT GTGACCAGTG GATCATTTCA CTTCTGTCCC GAAGTGTTGG TCAATCCTTT TTTGGGTGGT GCCGTATTGG ACTCAAGGAT GCTGCCGCCC ACAGGATCTC GGCTACCGTT TTATCAAGGG GGGCGAGTAC TCCAATGTCT ACAGGCACGA TGTGCTATTC GCGGATGGAT TGTCGCAAGT TTACCTTTAG GAGGAAAAGA TGATGTTGGC GGGGGCTATA TTTTTGCCTT GGCTGAAAGC TTGCTGATGA GCCTTTACGA GCGTGGCAAT GGGGCTCATG GAGAAGGTAT GGCTTTGATC GCGTATTGGT ATTGTGTGAA ATTGACTGAT GACTTATCGT TCTTTTCGCG ATTACAGGTG GAGGGAAAGG CGGCGTATTT TTTAGTAACC GTTACTCATT TGGAAGCGGG TTAAACAGTT CAAACCTAGA CTTCCTTCCG GTTCAAAACA TCGAAGACAT GGATGTGGTC ATTTCTGGTG TTGGTGCTGT ACCAGTCGGT AAGCTTATTT CCTTCTAAAA TTGAATCGAC CAAACGCGAG TCTCACAGAA TACGAATTTG CTCTAGTGGA CGACCGACAA AACGATCAAC TTTGGAGATC CGCAACGAAC GGATCAGAGT CCCATACTTC ATCCACGGAT GAATTTAATG TGCGGCAACT TCTTTGCCGG GATGCGGTAG AAGCTTTGGA ACGTGGGACG GACAAAGACA AGGCGGTCCC GAGCCCTTCG ATGGGTTGGT TTGGGTCACA CTTGTCACTC TCGTTTCTTT CAAGCAATGC TGCTTCCAGT AGCGATTTAG GCTGTGGCGG TGTGGAGTTG CTGCATCCCA TCGGTGGGCT TGTGTACCGC GCTTTAAAAT GTGAAGTATT TCGAGGCATC GTCGAAGGCA GAGCTGGTAT CGCAAATTTT GTTCGGCTTA TACGTGCATC GTTCATCGCT GCACATTTAG AGGACCTCGG TGAAAGCGAG CTCAAGTTAG CAAAAAAACG TGAAAAAGAA GTAAAGGACA CTAACTCGAG CGTGGAAAAA GATGAAGTTG TACCAAAGGC GCCTTTCGTC GTTTGTGTGA ACGAAATTCT AAAAAAGAAA GGTTTGTCGC ACACTCTTTT TACTCGGGCC CTGCAAAACC TTTCTGGACG AGCGAAAGAA CCACATCTTC TTGGAACACT AGTGGATGTG GAGCGACACG CGGAAGACCC TCGCACTCGT CGACCATTCG TGGAACCAGA AGGTTTTCCC AATTCGTTCG TTAGAGGTGC CTCGCTTCTG TATTGTCGGG TTGGAGTGCA GGTGGAACAA GCTCGAGACA GCACTGGTAC CCAAGGACCA ACGGTCGGAA AAGGTATACA AATGCAGGCC TATGCCGAGC CTGTTATTCC GGAAGGTGGA TTAGCATTCA GTGGTCCCAT CACTGTCCGT GTTATTGAGA ACGAAGGGCA ATTTCGTGAA TATGTTAAAG ATATTGTGGC CGATGGTAGT CGACGCGATT GGGGAACCAC ATTTCTGCAT GCGAAGCCTG TAACGACGGT AAAGGCGCAG ACCGCGGCAA GTGGGACGAT CGAAGCATCG AGCAGAACTT CGAAAGAAGG ATCAAAGTCG TTGGGTTCCG TTAGCAGCAG TATCTTTACA GACTCCTTTT TTCACAGCGG TGGATATCAG GCGATCGAGT TGATACGTTT GACAAATCTT TCACCCCTCT TGTGGGTTCG GGTAGATCCA ATGGGTTTGT ACGCTGGACG TATATCTGTC TGTCAGCCAG ATGCTTGTTT AGCCGAGCAG CTTTTCCATG ACGGCGATGC TGCTGCTCAG GTGGAAGCGA TTCGTACACT CGCCGAGCGG CCAATCCGAA TTCAGCCATC ATCCAAGGTG AATGCTGTCT ACGACGTTAG TGTTATGGAG CTACCGGTTC GAGTTCTTGG TGATTGTTTG CGCGGATCTC CAGCTCTACA CAGCTCGCTT CCTCACACAC CTGCAATACG GAGCCAAGCC GCCCTAGCAA TTGCACAATG GCAAAACAAC AAGGCTCCAC CGTCAAAGGA TTCAGTGGAT GCAGACAGCT GGATAGGGAT CAATTTACTA TTGCACTACT TCCGTGAGCG GTTCTACAGC AACTCAATCA TCATGCCAAC TAAGTTCTCT CGCCTCGCGT TGAAGAAAAG TGATGTTGAA ATGAATCAAG CTGCAGCTGC GAACGAGAGC GCGGGTGCGA ATCAACCTAC GTTTGACGAT GCCTACGAAT ACCTCGACAT ACTTGAAGCC GGAGAGGAGC GAGCGGCAGC CTTGGAAGAC GCCGATCAGG TTGAAATAGA GGAAGATGAA GAGTACAGGG TTCGATCAGC TGTTGTTACT GCGATTGCTT GTACCCGAGC TAAAGACGGT CTTACTCCGT CAATTGCGGT TGAATTCTTG GAGACAGTGC TGGAAGCAGA GGATGCATCA GTCGTTGGCC ATCTGATTTA TCCTGACGAA GAGTTGATGA TTGAAAAGAA TTTTCGAACG AAGAAGTACA AAACCGAGCT AAATGCGGAC GATATAGAGG TTTCTACCAG CAATTACATA TCCACTCCAT CTCTGTCCTT TGTCTCGAGT GCAATGATTG CCGATGCACT CCTTGCAATG TGCCACATCA ATGCTTCCCC TACTGTCTAC ACTGACCCAG CAACAGGGGG AAAAGTTGTT TCTTCTGGCC CTCATCCTGT ATCCAAGCTT ATTAAAATTG CAAGAGAATG GCTCGACTGG GAGAATTATA GGGAGAGCAT CCGGAATGAA CTAGCAATGG AAGACCGATC TGGCATTTCT GGAAACTGCT ACGACAATAT CGCAGCTTGC GCTGTTACTG CCCTTTCGAC GCTGGCGATT CTTCGTCAGA GCACAACAGA AGGTGAACGG ACCGTAAGCT CCCGTAATAT TGGGAACGGA GAGAAAAACT CAGAAAGAAA GACAGAGGAC ATCGCCACAG CAAAATTTTA CGTTGCCATG TTTGACGAGC ACCCAGTCCA CAATGACCCA ACTCGAGCTG CCTGTGCACA GGCGATGTCG TGCATATGCT GTGCCACGGA TAGGTTTGAG GATGAAAAGA AGCCTCCACT GGGTCTCCTC TCAGCTCTTG AGTTTCTTCT AGATCGAGTA CTAGGTGAGT CGATTTATCT GTTGGAGCGG TTTCAATAGC GGTTTGCAGA CCAATGAAGT TTATGTCGTT GGATGGCTCT TACATGATCT CTTACTTCCT GAACAAGACG ATGAGGTCTC ACCCTGTCTT AAACAGACGT TATGTTTAAT TATGATGGAT GCGTGTACCG GAAAAGTTTC GTCTATGCAG CGCGTAGGTG TAATTGGAGG TAGAAACGAT CTCTTCGTAA CGGCTGCCCG ATATTTCAAT GGGCCGTTGG GAGCAAGCCA AGGCAACGAC AATGGAAGTG CGCTTCTAAC TTCCGTCTCA CCGTCGTCCT CTCCAGCAGC GAGCGCAGTT AACGATGGTG CGCGGAGAGG TCTGCGTCTT TTGAGTCGGG CCGGCCATCC GAGAGAAGCG TTGGGAGAGG AGATTGTTGT AAGAGTTGCT CGCTTTGCCA CGCGACTTTG GCGAACAATC AATGGAGAGC CTGCTGAGGC ACTGACCAGC GGTATCGTCC GATTGGGTCC TTCCCTGGGC GTATGCGCCT ACGACGGCGC CTTGCGGTGC ACACTTTTGT CGTTGTGGCA GTGGATATGG CCGCGCGGCT GCTTCGCAGT CCTGCAGGTC CAGGCTTGGA AGGCACACGA AGGGACGGAC CGCTATTTTG GTCTCGGGGC ACATCACGTA ATGAAGATTA CCGAAGATGA GAAGGTTGCC GCCACAGAGG AGGAGGAAGC ATTGTCCGAG TTAAACAAGA TAGTCAGTAC TGAATTGGAC CGTCAAGCAT GGCGTGGTGA AATGTCTCGC AAGGCTTACG ACATCTATAA ATCTTCCAAA GGTACAGTCC ATGACGTTGG GGCCTCTGAA CAGGGCATCG GCCAACCTTT ACCTCCAATT CAAAGGGATG CTGCATTTAA
|
Protein sequence | MTSAGDSGMR KNDKPSLRMN LSAANLSKTE VVPSVPAAKK RRYAPIDLPL SMASVSDVRV LHQKTSMSFA LQSSLNREEA TWHAQGTTTL FLTTPHPPTQ PLSESSTVPL ALHLRGSCQV SQVHVQTLTW TKNQPEPKPQ TLRTSYHHVD PLQHVLVKPP SSYTFEEDHK KEFRFDADAQ SSRGAVGMTS ALRAASVASN LGELRITFEA LNSKSEETTI DAVQSSWRHL LEATSNEGDV VQRLKSSWNG RAGQRRNRRL NLIASCLAAA SNQNRAFCVT VRYNIAIEKS IAHLGGLHAL TTNRTPHVYT TAGVYGDHEG TRCWLPCADS AAAHHRTSHE FLVHVTAGMT HGLATMGLGE DMGCRETLLH DSFTKTGAKS IPSRAANEFG AQHVKFLEDL SAKNTDNSIS HWIPPEESHT VAMGDIQATH VWCSGTWTPV PARSLGFSIG PFRVLEDPEY FEAAAEAEVD DDEKGPDELI ETARSSGEGI RQAYFCPVFA RKFIHRDASR VLLPETQLVL ADLSSRQREI LNRLDQSVLS TTVGVPHRAL SLMRDVLALP TYRTASYTQI WIPNALHGGV TSGSFHFCPE VLVNPFLGGA VLDSRMLPPT GSRLPFYQGG RVLQCLQARC AIRGWIVASL PLGGKDDVGG GYIFALAESL LMSLYERGNG AHGEGGGKGG VFFSNRYSFG SGLNSSNLDF LPVQNIEDMD VVISGVGAVP VEYEFALVDD RQNDQLWRSA TNGSESHTSS TDEFNVRQLL CRDAVEALER GTDKDKAVPS PSMGWFGSHL SLSFLSSNAA SSSDLGCGGV ELLHPIGGLV YRALKCEVFR GIVEGRAGIA NFVRLIRASF IAAHLEDLGE SELKLAKKRE KEVKDTNSSV EKDEVVPKAP FVVCVNEILK KKGLSHTLFT RALQNLSGRA KEPHLLGTLV DVERHAEDPR TRRPFVEPEG FPNSFVRGAS LLYCRVGVQV EQARDSTGTQ GPTVGKGIQM QAYAEPVIPE GGLAFSGPIT VRVIENEGQF REYVKDIVAD GSRRDWGTTF LHAKPVTTVK AQTAASGTIE ASSRTSKEGS KSLGSVSSSI FTDSFFHSGG YQAIELIRLT NLSPLLWVRV DPMGLYAGRI SVCQPDACLA EQLFHDGDAA AQVEAIRTLA ERPIRIQPSS KVNAVYDVSV MELPVRVLGD CLRGSPALHS SLPHTPAIRS QAALAIAQWQ NNKAPPSKDS VDADSWIGIN LLLHYFRERF YSNSIIMPTK FSRLALKKSD VEMNQAAAAN ESAGANQPTF DDAYEYLDIL EAGEERAAAL EDADQVEIEE DEEYRVRSAV VTAIACTRAK DGLTPSIAVE FLETVLEAED ASVVGHLIYP DEELMIEKNF RTKKYKTELN ADDIEVSTSN YISTPSLSFV SSAMIADALL AMCHINASPT VYTDPATGGK VVSSGPHPVS KLIKIAREWL DWENYRESIR NELAMEDRSG ISGNCYDNIA ACAVTALSTL AILRQSTTEG ERTVSSRNIG NGEKNSERKT EDIATAKFYV AMFDEHPVHN DPTRAACAQA MSCICCATDR FEDEKKPPLG LLSALEFLLD RVLDDEVSPC LKQTLCLIMM DACTGKVSSM QRVGVIGGRN DLFVTAARYF NGPLGASQGN DNGSALLTSV SPSSSPAASA VNDGARRGLR LLSRAGHPRE ALGEEIVVRV ARFATRLWRT INGEPAEALT SGIVRLGPSL GVCAYDGALR CTLLSLWQWI WPRGCFAVLQ VQAWKAHEGT DRYFGLGAHH VMKITEDEKV AATEEEEALS ELNKIVSTEL DRQAWRGEMS RKAYDIYKSS KGHRPTFTSN SKGCCI
|
| |