Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43057 |
Symbol | |
ID | 7196250 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1927585 |
End bp | 1932099 |
Gene Length | 4515 bp |
Protein Length | 1315 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177403 |
Protein GI | 219111303 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTCTT CCTCGTATGA TTTCCAGATG GAGTTTACAG GAATAGCTTC GCACACGCAG CTGCCGCCGT CTCCGGAACT GAGAAGCGGC TCGTCGGATG GCTACCTTGG TAGCGACAGC TTTTCCCTGG ATCCCGATCC ATCGACCGGG ACAGAAAGAA ATCCGAGGTC CAGGCTTGTT TCGATCGCTC GATACAGTTC CTTCCAGCCT CAAGCTCTGG CATGCTCGCC TCCTCTTCCG GTAATGTACT CCGAGAGTCA AAGCTCTTCC GGGAAACGTG GCGTTTCCTC TGCCTCGGCT ACATTACTAC GACGGACTCC AACAGAATTG ACCGGAACTA CGTCGCTAGG CGCGCTTGCT GGGCTACATG GAGTCGCTGT ATTCCGCGTA TCAAAGCCCC ATGAACCGCT ACTCGTGCTT AAACATGCTA CCGGATTCAA TAGCACAAAT CGGACTCTCG GCAACGGCGT TCGCTCGCTT TCGTTTCAAC CAGATTCATC GCAATCATTT TACTTAGCGG CTGCGCGAGG TTCGGGTGTA CTTGTGTGGG ATGTGAGCGG TCACAGTTTA AGCCCTTTGT GGGGTCGACT TGAGATCGAC GATGGTACCA CAGGAGCTGC TCGGCCACCG ATTACAAGTA TATGCTGGCA ACCCGTGACG AACAATGCCC CATATTTAGC TGCTTCGACA GCGAAGAGCG CTTGTATATG GGATTTGCGA GAGTCTGCAG TGGCTTCGAC ACACTTTAAA CCCAGTTTAC GATTTGGGAT ATCTCGGAAG AATCTAGTTA CTGCGGCTCC GTATGTTCAG CTCGCTTGCT CAGGGCAGAA CGAATGCGCC ATCTTGGATG CTGCCGGGAT GCTGCGGATC TTCGATATGC GCCTAACTGA CCGTAGTCGC CTGTCGATGG GCGATTTGAG CAGCTTTGCA GCATTCAGTC ATGCCGGAGT GGGCGTAGCA CATATGGGGA AAAGTCTTAG TAGCGAGAAT GCGAGATGGG TAGCATGGGG ACTCGACGCA CCATCAGGCA GCGACGCTAC AGTAAAGGTC TGGTCAGAGA CAACCGGGAA CTCCCAGGAT GTATCTGAAG TCGAAGGCAA CGCGTCGGAA AATTTATGGC TTACGGATAC ACCATCAGGA CAAGACACGC CAATGGAGCT GGGGAAAATT TCGGACTATC ATCTCTCGGC ACAATGTACA CTGCCAAACC TGGCATGCGC ACGGGTTTGT CCAATACCGA CAGAAGATAG TATACTCACG GTGAGCCTTA CGAATAATGA AGGATTACGT TCGTGGAAGG CTGATCTCTG GAAGCTTACG AATGATACAG GAACCCAAAG TATGAAACGC ATCGTGAGTT TTACTGCAGG GGAAGGGTCG GACTATGGTA TTATGTCGAC GGTAGAAAAT GCCGATACAC TTGGGAATGT TTGCGCCTCG GAGCTGGCCA TAGTCAATCC TCTTTCATCT ATAGGAGGCA CCGCTGCTTC ACTACCTGCG CACAGTACAT TGATCCTTTG TAGCTTGACT GAGAATGGAT ATGTTACCAC GCATGTAAGT TGTTGTTCTC GAAAGCCTAT TTTATACGAC AAAAAAGGTA TCTCCTCACC GTGTGCTTTT GGGATGTAAA GTCTATACCT GAGGCTATAC CAGCGCAGTC AAAGTTGACC GAAGAATTAC GAATTGGCAA TCCTCGCAGT TCGTTATTCC CCTCTCACTC ACACATACGA GTATTTCCAG GTGAGCAGGA AATGATGGTA ACAGACGCGG CGAAAGCTTG GACCGATCCG ATTGAGCGCT TGGTGCGGTT ACCAAGCGAC TATCTACCAG CTCAAAAAAA CGATGTTCAA TCTATTCAAA CTTCAGGAAG CGGTCTTTTG GGCATTGACA TGGAAACGCC TTTTTTGTTT CATACGACAA ATGCCTCAGT TTCAGGAGTT CCAAACAAGT CGTTGCCGGC GAACGCAAAG TCTTCCCATA TTATCTCAGG CCCGCAAACT CAGGAGGACG GTACTCCGGT TGTGACAAGA GATATAGACG CCGATCGAGT TCCTTGTCCT CGTTTGTGCG GTGCAACTTT TGGACAAGGA ATAGGCGGTT TGTCGCTCTA TCGCAACGGT GACGTGAAAC AGATGTGGAC ATGGTACGAT CAAAAATCCG AGGTTCGTAT TCGTTCTATT CCAAAGCTGG TCAACAGCCA ATCTCCAACC TCTCGCAACC ACCTTTCGGT GAAACACTTT AAGGAGGAGA AGCCACAGCG AGAGTTCCCT AGGACTGTTC GAGACCTTGT AGAAATGACT ACAGCTGCTA AAGAAGCACA ATGGGGTGAA CACAGCGAAT CGGATATTTC GTCTACTAGC TTTCGAGTTC ACGGTGAGAG CTTCTTTGAA GAAGACTCAA ATGGTTCCTC CGATAGCAAT GATGTACTTA AATTGGACGC CAGTGGCGAA GCCGAAGAGA AGGCACTGAC GAATCTGTAT GACGCGCATT TCTCCGAATC TCGACAGCTT ACGCCAGCAG CGGACGACAA CAATAAAAAG AGTAGTTTCG AAGGGCACCA GTCACAAAGA AAAGCTGTAC TCATCGATCT GGGCTTTGGA GCATCCTCTG AAATGCTCTC CCCTTTGGTG CACGTCACAT TTTCTTACGA TAGAATTTCT CTCAACCAGC AAAGCGCCTA TCTAGCAAGA AACTGGCTGC TCGGTGAATG GAAGCCTCAG CAGCCGCTTA GTTTTTTGCT ATCCAAGCCA ACAGCGACTG CTTCAGACGA GGGCAATACA GATCCATTGT TGGCTCTACA GGGAAATGAG AGCCCCCTTC ACCTGGACTC TCTTGCTTTC CAGGATTTTG GTGAGTTTGT TGACTCTTTC TGGCCGGCAA GGTATCAAAT TATTGACTCT TCGCTTTTCT TCAGGACGAG GCGATCAAGT CACGTCGGCT TCTGATTTGC CACACTCTAT TCGTCAAAAA TCATTGGAGA TGTCGTCCCA GCCGCGGTCG AGAATTCATC AGAGCGAGGA AGAACGAGGT GTTCGTGGCG GCGAATACAG TGTCAGACCC GGTATGCAAG AGTCTATGGT TTTTCTGAAA AAGCTTTTTT CTCATCAACA AGATGGTATG AAGTCTTTTC CGAGCTTGAT GTCGCCTCCA GACGGTCCTC TACGTACGTA CATAAAAGAA AACGATTTTG TATTGATCAA AACGCTTCAA GAGAGTGTCT CTTAACCACG TCTTCCAAAG TTCCCAAAAG GCGATCTCTG GCAAGCACCC GCAATCATGA GAAAATGGAG GGACATCCTG AATCAGAGGG ACAAGGACGG AGTAAACTTG GTGTTTATTC TATCAATGTC GCCAACGCTG GGCTAGATCT TGGCGAAGAT CATGGATTGA TGAGCATCAG AACCATTTGT GATCATAATG CTAAAGTCTC GAGAAAAGCT GACGAATTTG AAAAGGCAGA AGTATGGGAC CTTTTGTGTC AAGTGGTCGA AGGTAGGATG ATGGACAAAG GAAGCACATT CGATGGCTGG GGCGGACCAG GTGGCGGGGC TTTAGGGGTC AATCTGGTTC AGAATATTTT CCAGTATTAC GAGTCACTTG GAGATGTACA AATGCTCTCG ACACTGGTGT GCGTACTGCG CGGCGGATAC TCCTCGCAGA ATATTTTGCG TCCCCGATGG TATTTGCTTC CTCGCGACCA AGAAGTGAAA TACGATACCT ATATTCACCG ATATGGTAGC CTTCTATACT GTTGGGGACT CCTTTCTATT CGTGCTGAGT TAAGGAAGCA TATGTCTGGA GCATTACCGT TTGATGTCTT TTCGGTAGGA GGAACAAATA GTGATAACCA GTCGACAGGG ATTGCACTAG CTTTCATTTG CCCTCAATGC GGAGGTGAGT CTGAAGTGGG CTACAACTAT TGTCGTGTCT GCGAAGACTA TGCATTTCGC TGTATTATCT GCGACAACGC CGTCAGAGGA TTATTTGCTG TTTGCGACAG GTATGTCCCC TAGACTCTTG AAGGACGGCA AGTAATGCTT TTGCAGGTCT TACATATTTT TCTTCGCATA GCTGTGGGCA TGGCGGACAT GTTGGACATA TACAATCTTG GTTTGAAAGA AATTCTCAAT GCCCTTCGGG ATGCGGCTGT AAATGCATTT TTGGACAAAA AGCACTAGCT TTGAACAACC CTAAAGTTAT TGCAGCACAA CACCGCCCCA TGACAACTAG AGCAGACACG GTGAGATCCC TTGGCTCATT GTGAAGCAAT GACTGCTGCG AAAGTCTGGT ACATGTTATA CTTTCATAGC ACTGCTTGAA TAATCTGTTC ACGGTTGAAT GATGGGAAGA GCATATTCCG GTTCAGATCC TTCTCTACCA GAGTTAGGTA GCAACATTTC ATGCTTTTAC CTTGTGAAAC AATGCGAATG AGCTGCCCTG CCGCATAGTC ATCCTCAACC TAGTCTGACT GGCTAAATCA AGATACATGT ATAGATAGTC GCCTCTGTTA GAACT
|
Protein sequence | MHSSSYDFQM EFTGIASHTQ LPPSPELRSG SSDGYLGSDS FSLDPDPSTG TERNPRSRLV SIARYSSFQP QALACSPPLP VMYSESQSSS GKRGVSSASA TLLRRTPTEL TGTTSLGALA GLHGVAVFRV SKPHEPLLVL KHATGFNSTN RTLGNGVRSL SFQPDSSQSF YLAAARGSGV LVWDVSGHSL SPLWGRLEID DGTTGAARPP ITSICWQPVT NNAPYLAAST AKSACIWDLR ESAVASTHFK PSLRFGISRK NLVTAAPYVQ LACSGQNECA ILDAAGMLRI FDMRLTDRSR LSMGDLSSFA AFSHAGVGVA HMGKSLSSEN ARWVAWGLDA PSGSDATVKV WSETTGNSQD VSEVEGNASE NLWLTDTPSG QDTPMELGKI SDYHLSAQCT LPNLACARVC PIPTEDSILT VSLTNNEGLR SWKADLWKLT NDTGTQSMKR IVSFTAGEGS DYGIMSTVEN ADTLGNVCAS ELAIVNPLSS IGGTAASLPA HSTLILCSLT ENGYVTTHSK LTEELRIGNP RSSLFPSHSH IRVFPGEQEM MVTDAAKAWT DPIERLVRLP SDYLPAQKND VQSIQTSGSG LLGIDMETPF LFHTTNASVS GVPNKSLPAN AKSSHIISGP QTQEDGTPVV TRDIDADRVP CPRLCGATFG QGIGGLSLYR NGDVKQMWTW YDQKSEVRIR SIPKLVNSQS PTSRNHLSVK HFKEEKPQRE FPRTVRDLVE MTTAAKEAQW GEHSESDISS TSFRVHGESF FEEDSNGSSD SNDVLKLDAS GEAEEKALTN LYDAHFSESR QLTPAADDNN KKSSFEGHQS QRKAVLIDLG FGASSEMLSP LVHVTFSYDR ISLNQQSAYL ARNWLLGEWK PQQPLSFLLS KPTATASDEG NTDPLLALQG NESPLHLDSL AFQDFGRGDQ VTSASDLPHS IRQKSLEMSS QPRSRIHQSE EERGVRGGEY SVRPGMQESM VFLKKLFSHQ QDGMKSFPSL MSPPDGPLLP KRRSLASTRN HEKMEGHPES EGQGRSKLGV YSINVANAGL DLGEDHGLMS IRTICDHNAK VSRKADEFEK AEVWDLLCQV VEGRMMDKGS TFDGWGGPGG GALGVNLVQN IFQYYESLGD VQMLSTLVCV LRGGYSSQNI LRPRWYLLPR DQEVKYDTYI HRYGSLLYCW GLLSIRAELR KHMSGALPFD VFSVGGTNSD NQSTGIALAF ICPQCGGESE VGYNYCRVCE DYAFRCIICD NAVRGLFAVC DSCGHGGHVG HIQSWFERNS QCPSGCGCKC IFGQKALALN NPKVIAAQHR PMTTRADTVR SLGSL
|
| |