Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37268 |
Symbol | |
ID | 7201931 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 558455 |
End bp | 564344 |
Gene Length | 5890 bp |
Protein Length | 1865 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181227 |
Protein GI | 219121758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.645144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGCC GGCGCACGCG GCAGAAAGGT ATCGGTAGGA TGCCGGGGGT GAGTGTGAGT GTACCCAGCG AACGGTACAT CCTCAGTTCG AAGCGCTGTG GGCATTGGGG CAAGGAGTCG GATCTGTGAA CAAGCTTGTG CGCCGTACTG TAAATGACAG TGAGTCGTTC AAAGTTGTAG CGTCCCAAGT TGATTACAAA CGTTACGTAC GGCACCGCAA GGCAACGGCG ACCCCGTTGA GACCCATCGT TCCATCATGT TGGGGAAACT CGCTACGGTA AGTGGACGTC GCGGTAGTTA CGCTCGGTAG TGGATACGGT AGTCGAGGCT TATCCGTCGT CTCACACACG TCCCTCTTTC TCTTACTTAT CCCTTACTTG GATATTCTTC TATGTATCCC ACTACATATT TGTTCCCCCG TTCAGCAATT GGCCCAGCAG CGTCTAATGT TTGTGGAAAA ACCCGCGACG GACGACGCGT CGCGACGACC CCGAAGCCAC CGTGACACGC GGTCTCTGGG CAAACACGCG GCCGTCAAAC AGCGCATGAA GGCGCAACAA CAAAAGCAAG GACAGACACC ATCACCGTCC CAAAAGCGTT TTCGAACGGT GCTTAGCATT GAAACGGCCG ACGCGACCGC GTCGCCCTCC TTGGCGGAAC CCGAGCCGCA CACGGAACCC TTGCGGGTCG CCGTGACCGT GGACGAAGAC GACAATCAAG GAGCGGCTTT GCCGACGTCT ACCCCCGTTT TGGGTAATAG TGCACCCTCC GACTTGTTCC GGGACGTGGA AGCGGCGCTG CACACCGACT GCGATATTCA CGTCTCGCTC GACGACGAAC CAGAGGTCGC ATCGCAGACT CGGGATGGAT CCTACTGGGC CATGTTTGGG GATACTCCGG AACGGTGGAC CGGTATTGAT ATGACCGAAG AAGTCGCCAT GAGTCTACTA GACACGTCCC TGATTAGTGT CTCGACAAAA GAGCCGGAGG AGGACGACTA CGACGGCAAA GATAGTCACG GTGACGACGC CCCAGCCACG GTCAACAAGA CAATTACCGC TGCCACGGAC ACGACGCCTT TGCATTTTTG GAAAACCTGG CGATCGCAGC CAGTGACTCC CGCATTGGTC CGGATTCAGT CGCCGTCGTT CACTGCTGAA GGGGCTACGC CCGACGACCC CGAACGGCAT TTCCGCAACG TAGACAGCGA ACAACCTCTG TACGCTGCCG ATCGGGTAGC CTTTACCACC TTTCGGCAAA ATTCCACACA GATTGCCTTG CAGGAAAGCA AGACCTATCT CGATCGGATC GCACAATTGG AAGAGGCCTT GGCCAACGCC GAGTACGAAC GGGATTGTGA AACGCAGCGG CGCATAGAAG CGGAATCTAC CGTGTCGGTG CTGCAGTCGC CGTCGTCGTC GGTTCCCAAC CACCCGGCAC CAACCCCCAA GGCAATGCTC CATCCTCCGG TCCCATCCAC TCCGTCATCC CTCCCGGATG TGGAGTCTCT CTGGGAACGC AACAAAACAC TCGTCAAGGA AGTACGCTTT GCCGATCAAA CGTGTATGGA ATTGTCTTCC GAGAAGACGG CCTTGGAACA GAGGATCGAG CACTTGAGCG AGAACATTAC ACGGTTGGAT CAAGAAAACA CATTGTTGCG CGAGCAACTC GGACGGGCCG AAGAGACGGT ACGGCAGGTC AAGACCGACA CCGAAACCGA CAATGCCGCC TCTCCCTCAG ACAAGGATCG TACACGACGG GCCTTGGAAG AGTTCCCTCG TCCAATCGAT CGTAAGGAAG GAATGGGCCC GTCCCTTGGC GAGCACATCG CGAATATGGC TTCCCGAGAA CCGTCCACGA TGGAGAACCC GTCCGATCGG TGTCACACTG CCCAGTCGGA GAATCCATCG CTACGGGAAC GGTTGGAGGT CACCTTGGAA GAGCGAACGG CGGCGCTTCG CCAATGCGAA AGTTTATCAG CCCGTGTCTC GGAATTGGAG GCATTGGAAG GCCGTCACCA CAGAGATTTG GAAACGACAG CGGCCCGGTA CGATGATCTC TTCCGACAGC TTGAAGAGGC TCACCAGAAG TTGACTGTGG CCAACCAAAC GATAGTCGAG GGTGACGAGT GTCGGGAATT GCTGCAACGC CGATTGACGA GTGACGCCCA ACGGACTGAC GAACTAGAGC AGTCGCGCTT GTTATTGCAA AAAGCCAACG ACGACGTGGC TCTTTGGAGG ACCAAGTTTG CAGACGGCGA GGGGGAAATT CGCTGTTTAG AGGGTCGCAT TCGGGAGCTC ACCTCCAGCG GTCACGACAG CTCAAGGGCT GCCGGCCTGC TTCGAAAGGA ATTGGGAAAG GTTGCGGAGG AAAAACGCGA GCTGTTGGTC GAGCTGGAGG CGGCCAACGA GAAACTCGCT GTATGTGAGA CCAAATTGGA ACAGTTCGAA AAGGATGGGC AGGAACTCGA GGCAACGCGA AGTGAGCTTG ATGCCCTCCG TAACGAAGAG ACGTTACGAA ACGAGCAACA CACTGAACGG GAACGTCAAT TCTCATCAAA ACTCGCCGAT ACCCTCTCGC GGATCGAGAC TTTAGAGCTA GACGCTGCAT CTGCGATTAA ACAAAAAGAC GAACAATCGC AAGAGCTCGA TGCGGCACTT GGGCAACTCG ATTGTTTAAG AAGAGAAATT GACGATCTAA AGGAAGCACG CTTGAGTGAT GAAAACTTGT TCCAGCAGAA GCTGAGTCGC GCTAGCTGTG ATTTGGAAGT AGCCCACACA GATATTGCCG TTGCCCGAGA CGCATGGCGC CAGAAGGAAA GCGAACTGTT ATGGGCGGTA CAGAAGGTAG TTGCCCAGAC CGAGAGCCAG GCCACTATCT TGGAAGAGAG ACTTGTCAAC CGTCACGGAG AACTCGTAGC TCGAATGGGT CAGGCAGTGG AGGCAGTCTC GTACGTGAGA GAATCAATCA TTTTTGGAGA CAGTACTACT GCTGTCGATA CTGTGAGTGC TGCCCATGGG CTGGCGGAAT CGGCACCCGC GACACCACTT CCTAGTGACG GTGCTTTAGA TGCAACATTC GATAATCCCG GAGGCGCATC GACAGATCTA GAGTTAATGG AAGAAGCTAG GCCTCATGGT TTCGCCGAGT CGTATGGCGT CCACGAGACC GAATTCGATT TTACCGTGTC ACTACCGCTA GACACCGACA CGATGTCTAG CATTGCCGGT ATTTCTCATT TGTTTTCTTT ATCGCCGGTT TCTATCGATC AATCCCCCAT GGAGAATACC CACAATACCA CAACTTTACT GCGCCCGCGA GGCAAGGAAC TCCAGAGCCC GAGGCGACGG ATCAACGAAT TGATTTGCCG TACCTCGGAA CTGGAAGAAC AGAAAGAGAT TGCACTGCAG GACGTCATCG TCTTACAGCA ACGGGTGGAA GAACTCGAAT CGGAGCTCCA TGCAATGGCC AAGGATCTCT CACTAGCATG CAAAGAGAGA AACGCCGTAG CAAAAATCAG CAATTTTCAC AATGGCTGCA TAGAAGCAAC CCTGAAAGAA AACGACACAA ACCTTTGTCA AGTAGAGAAG ACCGACTCTG ATACCACTGC ATCGGCAGTT TCAATCCATG TCAACGAGGT CATAGAAAGC ATTTTCTCTC GCAAAGGAGA GACCTTTATC TCCGAGTCTG AAGTCGCCGA CAAGGTTTCC CGAAGCGAAA GAGACATCGA GGGACTTGGA TTGGACCGAA GCATTTTGGA TCAGCGCCTA ACGACAACCA ATGACAAGAA AGGAGAGATA GCGGTTGCAT TGGTGGCTGA CAACGAACAG ATCGACGATA ATACCTCTGA GATTGTGTCG TTGAGATTGC ATTGTGGATC TCTTTCGAGT GATACGGACA AAGTGGTGGT GCTTCGGAGA TTGGAAGACT TGGAAGCCAA ACGTGACAGT ATGCGGGATT CATTTTCCAG CATCCAAGAT TTACCTAAAA GGAACGAGAG GAAATTTTTT GAATCCAGGT TTGAGTCAAT GACGAATGAT TGTGCGAAGG CTAAGGAGAG AATCGAAGAG CTTGAAGCAC TGCTAGGGGA AAAGACAAAG CAATGCCAAG AGCTCGAGGC TCACATTGCT ACAATTCAAG AATCGTATGC GGATGCCAAA GTAGAAAAGC AAATCATTCA AGAGTCGATG GAAATAAGTC TGTCTCAAAA GATAGCCGAG TCGCAGCAGG CAGCCCAGAT ACGACTATCT GCATCGGAAG AACAGCTTGC CATCGCTCGA GATGAAATCG CTTTGCTTCA TTCGCTTCAC AAATCGGTCA CATCAGAGAA AAAGGAAAGT GCCACTAAGA TTGCAAGTCT GCAGGAGGCT TCCAGCAAGC AACTGACCGA ATTGAAGAAG GCGAATGAAT CTTTAGACAT AATTGGCCAA GAACGAGATG ACTTGTTGAA GAGCTTGTCG AAGCTTCAGT TCGAAAATGC CGACGCCAAC AGCGAAATGG AAAAACACCA GGAAGCCCTC TTCGAGACTC AAGAAAGCCT TACCGAGTGC CAGCTGAAAT TATCCGTCCT AGATGCCATC ACCACCGAAA GAGACTCCTT GCTCAACAAA ATTACTGAGC TTGAGCTGGA ATGCACTGGG CTGAAAACTA CTATGACTGA AAGTTCGGCT ACCGAACGCA AGTCTTTGGA ACACACGCTC TCGTCACTAG AAGAAATTCA GGTCGAAAGA AAATCTCTTC TGGAAAATTT GGATTCCGCG AAAATCGAAA ATGGACTCGT CAGGAAACAG CTAGATTCTA TGAAGGCTGA GCTTGAAGAC GACAAGCGAA CGAGCGCTAA ACTTCGCGAA TTGTACGAGC AGCAAGTTGA AGCCAACGCT GAATTGCGAA GGGAAGTCTT CCACTGCAAA GGTCTCGTGG ACGATTCTGA TTTTGCAATG CAGGATCTCA AAGAGAAATA CCTCGAGTGC AATGAGAAGC TTGCGAATTT TTGTTTTTTA CACAATTCTA ATGAAGAATC CAAGCTGATG TATGATCGTG CTCGTGAATC CGCTGAAGGG CTTGCAATCG AAAGCCAAAG GCATCTCGCA AAAGCCCGGG AAGACCTGGA AACAGCACTT TCCGAAAATG GCAAAATGCA AGCAGAGTAT GAAAAATCCC AAGAGGTCCT CTCCGATGTC CGACGAGAGC TAGCAGAGCG AAAGGAAGCT ATCAAGGATT TAGAAGTGTC GCGAGAAGCA GCTATTATGG GGCTTGCTGA ATACAAAGAA CAACTCAATT CGCTGGAAAT TTCTCTCGAT AAACGGACGC AGACTGTGAA TCAGCTCCAA GCCGACGTTA AGGAACGTGA CGACGCTCTT TCGAAGTTGG ACCAACATAA TGCTGAAATC AACGTGTGGG AGAAACGCAT ACAGGAGTCA AACGATGCTC TTTCCAGACT GCAGGATCAG CTTGACGAAA GCACGGCATC CCTGCGTACT ATGACTAACG AATTCAAAAT GGCATCGACG CGAAGCGATC ATTTAGAACT AAAGTGTTCC CGTCTTCGAG ACTACATTCG GAAGGTGACT GGCAAGTGTG ACCAGTGGGA AGACTTTTAC GATCGACAGG CTGAGGTTGT GGAGGGCCTG AAGCGTGCCA ATGAGCGGAC TCGTCAAAAG ACTGCTGAGC TCGCTCGTCG GTACCAGGAA CGAGACCAAA TTCATGACAA AGAGCGTGCC GTTTGGACAG CGCAGAAGTG CAATCTGGAC TTTATACATT CGCAGCTAGA AGAAGAGTTG CATGGGATCG CCAACGAGCT AGCGCATGTC GAAAGCCGGC CAGTTTCGAG TTAATGCTTA ACTGTGAAGT CAAATGATTA CCTAATCTCC AGCCCAGTTC CGCTGTTTAA
|
Protein sequence | MSRRRTRQKG IGRMPGVSFE ALWALGQGVG SVNKLVRRTV NDSNGDPVET HRSIMLGKLA TQLAQQRLMF VEKPATDDAS RRPRSHRDTR SLGKHAAVKQ RMKAQQQKQG QTPSPSQKRF RTVLSIETAD ATASPSLAEP EPHTEPLRVA VTVDEDDNQG AALPTSTPVL GNSAPSDLFR DVEAALHTDC DIHVSLDDEP EVASQTRDGS YWAMFGDTPE RWTGIDMTEE VAMSLLDTSL ISVSTKEPEE DDYDGKDSHG DDAPATVNKT ITAATDTTPL HFWKTWRSQP VTPALVRIQS PSFTAEGATP DDPERHFRNV DSEQPLYAAD RVAFTTFRQN STQIALQESK TYLDRIAQLE EALANAEYER DCETQRRIEA ESTVSVLQSP SSSVPNHPAP TPKAMLHPPV PSTPSSLPDV ESLWERNKTL VKEVRFADQT CMELSSEKTA LEQRIEHLSE NITRLDQENT LLREQLGRAE ETVRQVKTDT ETDNAASPSD KDRTRRALEE FPRPIDRKEG MGPSLGEHIA NMASREPSTM ENPSDRCHTA QSENPSLRER LEVTLEERTA ALRQCESLSA RVSELEALEG RHHRDLETTA ARYDDLFRQL EEAHQKLTVA NQTIVEGDEC RELLQRRLTS DAQRTDELEQ SRLLLQKAND DVALWRTKFA DGEGEIRCLE GRIRELTSSG HDSSRAAGLL RKELGKVAEE KRELLVELEA ANEKLAVCET KLEQFEKDGQ ELEATRSELD ALRNEETLRN EQHTERERQF SSKLADTLSR IETLELDAAS AIKQKDEQSQ ELDAALGQLD CLRREIDDLK EARLSDENLF QQKLSRASCD LEVAHTDIAV ARDAWRQKES ELLWAVQKVV AQTESQATIL EERLVNRHGE LVARMGQAVE AVSYVRESII FGDSTTAVDT VSAAHGLAES APATPLPSDG ALDATFDNPG GASTDLELME EARPHGFAES YGVHETEFDF TVSLPLDTDT MSSIAGISHL FSLSPVSIDQ SPMENTHNTT TLLRPRGKEL QSPRRRINEL ICRTSELEEQ KEIALQDVIV LQQRVEELES ELHAMAKDLS LACKERNAVA KISNFHNGCI EATLKENDTN LCQVEKTDSD TTASAVSIHV NEVIESIFSR KGETFISESE VADKVSRSER DIEGLGLDRS ILDQRLTTTN DKKGEIAVAL VADNEQIDDN TSEIVSLRLH CGSLSSDTDK VVVLRRLEDL EAKRDSMRDS FSSIQDLPKR NERKFFESRF ESMTNDCAKA KERIEELEAL LGEKTKQCQE LEAHIATIQE SYADAKVEKQ IIQESMEISL SQKIAESQQA AQIRLSASEE QLAIARDEIA LLHSLHKSVT SEKKESATKI ASLQEASSKQ LTELKKANES LDIIGQERDD LLKSLSKLQF ENADANSEME KHQEALFETQ ESLTECQLKL SVLDAITTER DSLLNKITEL ELECTGLKTT MTESSATERK SLEHTLSSLE EIQVERKSLL ENLDSAKIEN GLVRKQLDSM KAELEDDKRT SAKLRELYEQ QVEANAELRR EVFHCKGLVD DSDFAMQDLK EKYLECNEKL ANFCFLHNSN EESKLMYDRA RESAEGLAIE SQRHLAKARE DLETALSENG KMQAEYEKSQ EVLSDVRREL AERKEAIKDL EVSREAAIMG LAEYKEQLNS LEISLDKRTQ TVNQLQADVK ERDDALSKLD QHNAEINVWE KRIQESNDAL SRLQDQLDES TASLRTMTNE FKMASTRSDH LELKCSRLRD YIRKVTGKCD QWEDFYDRQA EVVEGLKRAN ERTRQKTAEL ARRYQERDQI HDKERAVWTA QKCNLDFIHS QLEEELHGIA NELAHVESRP PSSAV
|
| |