Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37844 |
Symbol | |
ID | 7202649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 200020 |
End bp | 203877 |
Gene Length | 3858 bp |
Protein Length | 1285 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182025 |
Protein GI | 219123424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.378594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTAT CTTCGACAGG TCAGTACTCT TATGCTGTAC GTACTGAGAA AGGAGTTCGC TATCTCAGTA ACGAGGATGA GATCAAAGAC TTGGTCCGAA AATCAATTCA GCGCGCGAAG AAAACGTCCT GCTTACTGTC ACCGGAACGT GTAAGTCTTC CAGATAGTCC GGGGAAAAGG AGTGTTCTTT CCTCACTAGA TAGAGTCGAA ACACAAAGGC GGGCATCAGA GTCTTGTCCA CGTGAAAACC TGATGGCGCC GGGAATGAAT TCTGATGGAG ATAAATCAGA GACTGCTAGT CGCATCCAAC CTCCATCATC CCCGCTATCA CCTTCGTACG TGGTAAAGAC CGAGTCTGGT GACCAGATCA TGATGTCAAA GGAACAAATG AATCGATACG TTAGATCTTC CATTCAAAGA GCCAAGGAAA GCGCCGCTAC GCAGCAGAAG AAGACAATGT CAAAAAAGAA TTCTGTCTGG CCGAAAGTTC GATGTTCCAA CACGTCGGAC TCACCCAAAA CAACGAATCA CTTTTCCTTT TCTGATTATG GAGAACTAGC TTTGGATGAT CGGAGTACAC CAGCATCTTG TCCTCCGATG AAACCGGCGA AAATCATTCC CAAGAACTCG TGTTCACCAT TCACTGCCAA TGAAGCACTC ATTTCACCAT TTACTAGCAG CAAATCATTT GTTAGCAAGA CTCAGCAGCT GGAGGTTTCT CGTCCATTCT TTACTCCGCC CGATGACAGA CACAATAGAC AAGAGCATAG CGAAAAGTAC GATAAAAGGC AACGCCATTC TCTGATGGAA GATGAATCCA GCAGCACGTT ATGGGTCAAT CCTCTCCCTC GATTCTCGTG GTCTTCCTTA GACAATAGCC AGGAAGGGGA GGAAGAATGC TCACTTGCTT CACCTATGAT CGAATCCAAG CTACCGATTG TTATTGGTGA ATCAAAGTCG AATTTGCACC ATTCATGTGT CAACTCAGCT AAAGCAACAA AGAGTGGTAT AGAAATCGGC GGCGAAAAGA AGTGCAGTGC ATTGAACAAT GCTTCAAATT CTCAAGATCC AATTTACCAA GCAAATAACA GGATAATGAA TTCTGATATT GAGCGGAAAA TTTCAGCGAA GATAACCATA ATAAAAGATG GCTTGTTTTC GGACGTCCTG TTGCAGGCAC AGGAGGGTGA TGTGTCGACA GAAACCGCTT CTGCGAGCAG CCTGACGTCC AAAAGCTCGC GCAACAAGCT TATGAGAGAC CGATACACTT TGAAAGTAGC TGAGGATCCT GAAGTTGCCA ACATGACACT CGAATCTACG AACGGAAACT CCAGCCCAAT TCCATCGAAC CTTGCAAATG AGATGAAATT AAATTCGGCT GGAAAGGCGG CGTCACCGAG TTCTCGTAGT CCAGATCCCA AAGCAATGCA GTCACCAAGT TTGCCAAATT CTATCGAAGA TAAATCTAAG GCTGAGAGTG GCCGCGCATC AACATCACCT CGAAAACTTC GACTCCAAGC GATGATGGCC GCTCGACGGG CAGCTCTTTG TAAATCTCCT CCAAAAGAGA AAATCGATGT GAACACCAAG TCTGAAGAAT TCACTTTATG CGATAGCAAA AATGCTATAC CTGGTTGTTC AACCTCTCCT ATTTACGTGA CGGAAGTCGA TCTTATCAAT AGTCACACAC AATCCCGACA ACGCAGGAGC CAGATAAATG TTACAAAAAT TTCGACACAG CCGGAAAGGT CCGCAACACA CAATCACTTG CCTGAGCAAC TGCCTACCCT CCTTTTACTG AAAGATGAAA AGAATTCAAG TGAATACTTG ACCATGTCGA GTCACGCAAG TAAGGATGAA CAGATAAAGG ATCCAGTAAT TGACCGTTCT TCTTATTCCA ATGAAATCAC GCCACCTACG ACTTCGGATT TAGGTAAATA CCATCAGCTG GTACACACAT CATCTGATAA AAAGAAACAT TCGACTCAGC CATGCGCTGA AAACGCAGAA TTTACAATGG AAGAGTGCGA TGTCGATCAC TCGCCGCTTC CAATGAGAAT ACGGGAAGAC CTGGACTCAC CTTCGAACGG CAGTAAGGTG GACAAGCCCG ATCAATCCGA CGAGAAAGAA GATCAGAAAA CCCTTCCCTC TATTCCTGTA GATGAAATTG CGGCAACATC AACACAAATG ACGACTGAGA GCAGACCGGT CGATTGGCAT ACCAAAGACG ACCGACCGTT CGTTGAGAAT GAAGGAAAGA CAGAAAGTGC TAGATCTCAC TCACCGGACG TGTTCGACGG TTTACTGTTA ACAAGCAGTG ACGACAATGT AAATTCAATT CCAACACAAC TTCAATCCCT AGATTCCAAA TTGCCGCCGA AAGGAGTGCC GGCATTAATC CAAGAAACGG ATTGTGAATC GATGCCAACC ATTGCCGAGC CAAATGAATT CGAAATCGGA CACAGAAAAC TGGACAAGGT TGAAGCTTCC GTAGTTTTAC ATTTCACTGA TGCAAACGAA AAGAGTCAGG TGATTGCGCC TCGAGAAAGT AGTGTCCTTC CGATATCGAA TGGCGAAAAC GAAACCTCAA TGGAGCAACC ATCCGGACTT ATGGAGCATA GCAATTCCAT CACAAAGCCT TGTGCAGTCT TGAAGCCCGT CGATGTGGCT ATAGTCAGTA GTCCATGCGT AGAGAGCTCT AGTCGGCAAA CTGTTGCGAC AACTATTGAA GACCAGAGTG GCCCTTGGAC TCGGACCGAT TTCGGCAATA GGCCCCAGTC GGAGCAGCCT ACGACACCAA AGTGGTCACA CAACCGGCAA CGAACTGTTG TTGGAGAAAA GTGCGACACT AGATACAAAG ATTTAAGTCC GGACATTGAC ATCTCGCTAA CGCTGAGTGA TGGGGGTCTC GCCGAGGCAG CAAAAGTGGA AATGATGCGA AGCCGACCTG CCTCTGGCTC TGTTTCCATA GCAAACTCTC CGTTTGCGCA GAGCGATGAT GAGTCTCTAG AGGCTAATCA AGACAGGCGT ATTTCCCACA CGCCGAAGCG AAAACGCGCA ATTAAGCGGC ATTTACCCCT TCATCACACG ACTTTCAGAC ACTCAGTTCA GGTTGATTCT CCAGCGCAAA CTCTTCGCCG GAGCAATCGT TGTCGAAACG CTGGTAGGTC GTCTCGAAAG ATTAAGAAGT ACGCCAAACC TCGCCATGAC GAAGGCGATG ACCCATCGGA AGTGATTGTA ACCGATGAGC TATTGTTAGC AATCGCTGCA TGGAAAGGAG TCAAACTTAC TGCAGGCGAT TTTAATAAGA TTCTTGAATC CCAATCGGTT AGCGGCTCTG TTGCCACAAG GGAGACTATC AACCGCACCA AAGGTCATTC AGTTTCATTT TCTCCTCACG CCATTGCGGC ATCTAAAATG CAGATTTCGA GCCAGCATGA TGAAAAGGAT CGGCGTTCCA AGTTCTGTGT AGATTTCATG GATATCTTGG ATCTGGGTCC GGATGAGTTG ACCGACGAAG CAGGTAGTTT TGTGGATGAT GAGTCATCGA AACTGTTCTC GAGATCGAAT CTAGCTTGTT CTATGGTCTT GGAAAACACA ACAAACTCTA GCTTCGAAGA GACCGACGAA GACAGCAGAA TAAGTTTCAA GGAGAAGAAC ATATTCGACT CACTAGCCGA CAAATTAAAT ACTATAATGG AGGGCAGAGA CAGCGAATCG GACATGGAAC GATCAAAAGA ATTTAATCTA CGATCCTTTT CATTCTCAAG AGGAGATCAA AGTGACACAG AAGACCACAC AAGTCGATCT TTCAGTGAAG CTGAGGAAAT CAGGCAGGGT TGGTTCAAAA TGGGATGA
|
Protein sequence | MSLSSTGQYS YAVRTEKGVR YLSNEDEIKD LVRKSIQRAK KTSCLLSPER VSLPDSPGKR SVLSSLDRVE TQRRASESCP RENLMAPGMN SDGDKSETAS RIQPPSSPLS PSYVVKTESG DQIMMSKEQM NRYVRSSIQR AKESAATQQK KTMSKKNSVW PKVRCSNTSD SPKTTNHFSF SDYGELALDD RSTPASCPPM KPAKIIPKNS CSPFTANEAL ISPFTSSKSF VSKTQQLEVS RPFFTPPDDR HNRQEHSEKY DKRQRHSLME DESSSTLWVN PLPRFSWSSL DNSQEGEEEC SLASPMIESK LPIVIGESKS NLHHSCVNSA KATKSGIEIG GEKKCSALNN ASNSQDPIYQ ANNRIMNSDI ERKISAKITI IKDGLFSDVL LQAQEGDVST ETASASSLTS KSSRNKLMRD RYTLKVAEDP EVANMTLEST NGNSSPIPSN LANEMKLNSA GKAASPSSRS PDPKAMQSPS LPNSIEDKSK AESGRASTSP RKLRLQAMMA ARRAALCKSP PKEKIDVNTK SEEFTLCDSK NAIPGCSTSP IYVTEVDLIN SHTQSRQRRS QINVTKISTQ PERSATHNHL PEQLPTLLLL KDEKNSSEYL TMSSHASKDE QIKDPVIDRS SYSNEITPPT TSDLGKYHQL VHTSSDKKKH STQPCAENAE FTMEECDVDH SPLPMRIRED LDSPSNGSKV DKPDQSDEKE DQKTLPSIPV DEIAATSTQM TTESRPVDWH TKDDRPFVEN EGKTESARSH SPDVFDGLLL TSSDDNVNSI PTQLQSLDSK LPPKGVPALI QETDCESMPT IAEPNEFEIG HRKLDKVEAS VVLHFTDANE KSQVIAPRES SVLPISNGEN ETSMEQPSGL MEHSNSITKP CAVLKPVDVA IVSSPCVESS SRQTVATTIE DQSGPWTRTD FGNRPQSEQP TTPKWSHNRQ RTVVGEKCDT RYKDLSPDID ISLTLSDGGL AEAAKVEMMR SRPASGSVSI ANSPFAQSDD ESLEANQDRR ISHTPKRKRA IKRHLPLHHT TFRHSVQVDS PAQTLRRSNR CRNAGRSSRK IKKYAKPRHD EGDDPSEVIV TDELLLAIAA WKGVKLTAGD FNKILESQSV SGSVATRETI NRTKGHSVSF SPHAIAASKM QISSQHDEKD RRSKFCVDFM DILDLGPDEL TDEAGSFVDD ESSKLFSRSN LACSMVLENT TNSSFEETDE DSRISFKEKN IFDSLADKLN TIMEGRDSES DMERSKEFNL RSFSFSRGDQ SDTEDHTSRS FSEAEEIRQG WFKMG
|
| |