Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49814 |
Symbol | |
ID | 7198383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 426592 |
End bp | 432771 |
Gene Length | 6180 bp |
Protein Length | 2034 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184542 |
Protein GI | 219128695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0269237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCGTT TGCTTGTTGT CTCTATCACT CCTAAAGAGA GTACGCACTG GACCATTTCC TTTCATACCT TCGATCTATT CAAAACAGAG ATACGGAGGA CGACCAACTT GATTGTTAGG GCCTGTTCAG TGCAGCTTAT TGCCGTATCG TCGTCAGAGG ACGAAAATGA AATTGCTGTA TGGTCGGCTA GACCGCACCC AGGTTGTCTT CTCGAAAAAG GCGGCGGAAA AGAGCATGTA CAGGATATCA GTTGGGATTA TTTTGTTTTT CTGTTGAGAG TTAGTCCAAA CTCAGTCGGC ACTACTTCAG ACAAAACCAA GATTATTGAT TTCTCTTTCC TTCGATCTGG CTTTCTAGAC GCATCTCCCT CTTTAATCTG CTATTCTGGC AAGAGCGTTT CGATATACCA GCGCTGCGGT GGCGGACGGG ACTGGGAACA AAGAACTGAA ATTTATTACT CAAACAGTCC AACCTCGTCG AGTACTACAA TGACTGCATC TGCTTTTGGC TCAACTGCGC CTTTTGGCGT CTACTCGAAC CCATTGGATG TGTTTCCTCA CATACTACCT GCCTTGCAGT GTGCCTTTGC GTCATCAGAC GAGAGCACCT TCTTACGCTC GGACTGGCAC CCGGACTCCT TTCTTGCAGA AATTTGTTTG GATGATAGAG GTGTTAAGGC TGCTTTGGAC GACCGACTGC GATCTTTGTT TTTGTGGCTT AGCTCCGTAG GAAAGGATTT GTCGGACGAA GATTTGGAAG GGCCTCTACG TGTAGCACCC CTGCTCATAG TCGAAGAAAA TTTTTCTGAA GACAGCGATG GTCAGGATCA ACCTCGAGCA AGACTGACTC TGCTAACGGG TCCGGCTACG GACCAAACAC TCCCTTCGCA AAATGTCGCG AGAGAAGTGA GTAAACTGAA GAGTTTTCGG GATATACTCT TGTGTAGCAT CGATCAGGCA ATTCCTGAGT CGTCTTCCAC CGTTCAGTCG CTCGATTGCG ACCTAGGCAT CAAAAAAGAA GAGCTCCCAC CGCTCCTACG CACATTGTCT TCTGAAGATT TGGAAATCAT TTGGGCCCTA TACGATATGC TGGTAAAACC GCCTTGTTTC GACAAGCTAG ATAAGCTAGG GCAGCTATTC CTGTTCACCA GAGAAATCTT TCATCGGCTT GCAGCTATTA CAAATAAGAT AGAAAGCTCG ATTTCTACAA GGATACCAAT GCCGTCAATG CATGTTAAGA TCACTTCGAA GTCCGGTGGG AGTCAGAGTT GCGAAGAAAC TTACGTGGCC TCCTCCGGCT GCTTGGCTGC TCTTATGAGC CGCTCTCAAA ATAAAATTTT GGACTGCTGC CGTGCAAATG GACATAAGTT TGACTGGCCA TTCGCGAGAG ACCTTCGCAT TGCATTCTGG CTGCGGTCGG ATAGGGAGCT TGCCAAGATA TCGGAAGAGA TCGGCCAGAC TCTTTACAGG GCCAAGAAAG AGATAATGGA ATGCGCACTA TTCTTTGTGA TTGCGCGAAA GACGAGAACT CTCCGAAATT TAGCAGCAGC AGATTCGACC GATAGTGGGA GGAAGCTATT CCAGTTTTTG ACGTCTCATG ACTTCTCTGA CGAGCGAGGT CGACGAGCTG CAGAAAAAAA CGCATACAGC CTTCTTCGCA AAAACAGATA TGGCATTGCA TCCGCGTTTT TCCTCCTTGC CGAGCCCCCT TTTCTCAAGT CAGCTCTCGA AGTGATTGTG ACAAAAATGC GAGACTCAGA CCTTGCTTTC GCTGTTGCGA GGCTAACGGA GAGTTCTGTT GCCGCCCTAC AACCCAGTCT GAATGCATCT TCCGTAGGGC TTGGAGGCAT ACTTGGAGGT GGAGGCGGGT ATGCCATTCC ACCCGGATCA GACTCTTCGT TGGATGGTGA GGACGGAGAA CGTTTTGACG ATTGGCAACC AGATCTTGGA ATGTATTCTA AGAAGCTGTT GGTGGAACGA TTGCTTCCTC AAGCTGAGCG AGACAATGCT CTGAGCGCTA TCCAGCTATT ATGGCTTGGT AAAATTGAGG AAGCGTCTTG GTGGCTATCC GGGTTCATAG AACCCAGCCA TGATAGCGAC ACGGGTTATC GCTTAGTCAA CGATATCGAT CACCGTGTAT TCGGAACAAG GACTGATGCG AAAAGACAGT GTGCTGGATC AATGAAGCTG TCGAATGCAA TTGGCAAGGT AAACTTGCTT GTTGACTTCA TCTCGGCTCC GCTCTTGCTT GACGCCCTGG GTGGGAGTCA GCGAGCTCAA TTTGCTTCAT GTTTAGCAGT TTCGAGTGCG TTGATTGCGC GGGGAGTTGA GATGCCAACT ATTCGCTCTC TACTACATTT AGCTGGTTCT CAAATGCTTT TGAAGCATCC CATAACTTGT CTAAGTCCCA GAGAGGACGC GAAACACGGT CTGTCGCCCT CGTCGGCAGC TGTGAAGCCG TCGATATCCA AGTCATTAAG TTCTTCCCTC CAGCCAAAGC GAAGCTGCAA TTTAGCGGTT GCTGCTTCCT CGGGGGTTAT GTCGTCCTCT GTTTTTGATT CGTTTGATAC ACCTCCTACC CACAACTCGA AAGCGAAGGT TTCAGACCAT TTTGAGAGAT CAGAAACGGT GCAGTCTTCT ATTTTCGATG CCTTTGATGC TCCGCCTACC CAAAATTCGA ACCCGAAGGT ATCAACCCAG GAAGGCAGAT CCTATGGGAT GCAATCATCC ATATTTGACT CCTTTGATCC TCCTCCTCTC CAGAATATAA GTGTTACGGC AGGACGTGAT GAGACATCGG GGAACATAGA GTCGTCCATT TTTGACTCGT TTGATGTTCA TCCCGGACCG ACTGTGAAGG GATCTGCATG CGACGATCGA TCCGGAGACG TGCGGTGTAC TGCCATTTCA TTTGATGACA CAAGACAATT ATTCAGCGAA AATGATGCCC AAGAATCGAA GGTTTTATGT CCTGTTGACA ATAGACCAAC TATTGACAAT CAAGAGGCGA TCAAGAAGTC CGAAATAGCG ATTTCTCTTT CAATCTGTCG GAAAGGGATC CCAGATTTGT GGAGCGAATG GCGTTATAAC ATGCTTTTGT TATGCGCTGG GAGGCGCCTA CTGCGTGAAG TGGCGAGCGT CGTAGCTCAA TTTCACGGGG ACCCGCGGCC TCAGTCAATG GTGTCTTTTT ATCATAGCGA ACACCCGCTT GTTCCCTCAA GGGCATCTGA AGTACTGCAG CTGAGATGCG ACTCCGAAAA AATAATAGGA AGAGTCAAAC GGTCTCTGGA GCAGCTTTCC ACAGCAAGCG GGTTAGAAAC TCTTGTTGTT GTAAGTTGCT CAGTCCAGCT TTTGGGTCAT TCCCAACAAC GCCGTAGAAC ACTTTTTACT GTTATTCTTT ATGCCGCAGC GCAACAAGAT GACATGGCTG ATCTCATCGT TCGAGCCGCT GCTTCGGATC TCATTCAAAT GGCGAAGAGT ATGGTGTTTT GTAACGACGT TTTTGTACTC AAACAAAAAT CGCGTTCGCA CGCATCTACA CAGCATATTC GGCGACTAGC GGCAAGGTTG AGTTGGCAAC TAGAAATTTG CATGTGGCTG CAGCGGGGTG GAGGTCTTTA CTTATCTGCA AGTACAATCA AAGATGCAAT TGCCGCTGTG CGAGTTGGCA TTCTCATAGC CAGCTGGAAC CGCAACACCG AATGCTTGGA AGCCATGATT CGAAACCAAC CAGACTGTTC TCTCGACGAC GAAGCAGGAC ATCCGCTGTG GACAAATTTA AAAAGCTTTA CCGCTCCCGA GAGAGAAGTA AAAAAAAACA GAAAGATCTC AAGTGGTGGA TGGGAGTTTT TGGTAGACTG CAGAAGAACA GAGGCGACAA AAATGCTGCG ATGTAGACCT ACGGGGTGCT TCATTATGAG GCCACATCCA AACGACCATG GGGTTTTCAC GCTCAGTTTT AAAACTAACT TAGCGAGTGA GGAGGGAATG CCTAAGGATG GCGAACAGCA GCTATCTGAT CATGGGGAGT TGGTGAATAG GCCTGAGCAA CCTAGTTCTA CAGTAATGTC TTCACGGGCT TCGAAAAGAG ACGACATAGT TCAACATGCG GTTGTCAGGT TGTCTGAATC TGGATTTCGA TGTGGTTCTT TTGGTCCTTT CGCTAGCCTC ATCGGTCTCC TTGAAGCCGT GTCAGCATCG CTCCCTTTCA ATTTGAGGTT CGATCAACCG CCAGTTGACC GGGTAATTCA AGAAGGATTC CAGCCTTCTC CAAATGCAGT ATTTTTCCGA AAGCTAGCCC TCAGTCATGC GGATAGCGGA GCTTATCCCC AGCCAGCCAA AGTACAGAGT TGTCAAGTCC CGTCTTCTCC CAAGGAGGAG CGAACCTCTT CGACATTCCG AACAGGGAGC GAAGATACCA GATCCGAAAA GAAGATATCT TTTGCCTATT TTCTCGAGCT CACGACTCTG AGCAAAATTC ATCGGCAATT AAGTGCCGTT GTATCAGTTC ACGTTGACGA GGCTCATCAG TCTGATGGGG AGGTCATGAC GACAGACGAA AGCTACGAAG CAGCGTCGGA TAGTGTTTTA TCGCTAATGA CTTTTCATGG AGTCTCCACT CCGTACGCTT CTTCGCGGAG GCTTCTTAGT CCTCTGATAT GTTGGGGCCG TTCTTTGGAA ATTCTTTCCG TGGAGTATGT AGCCCCGGAA TTGAATGGCT CTTTCGATCC CAGATTGCCA GTGGATTTTG AGGAGTCGGC CGAAGAAATT GAAATATTTC AGCACGAATC CGCAACTACG GTCGAGCAAG GGGACGCAGT ACTTCGACAA ATGATCAAAC GAGGATCTGG AGTCGATTTC AGTACGTTAC GCTTAAGTGA CGGTGGTGAT TGCACTATGG TTGTGGTTTT TGGAAAGAAG GAAGGAATCG AGTGGTTGCT TTCCAGCGGA TTAGAAGCTA GTGAAACCGG CGCTTTGAGA AGATTGAAGA TAATGGAGCA TGAGAATATT ATCGAACCAA TTGAAATGCA GCGACTCCCT CTGAAACATA AGGTACCTGA GCACGGAGAA AGTGATGTGA GATACAGGAT TGTTGACCCC TGGGAAGTGG AAGCGCTTCC GAATCGTGAA GGCGAGACTC GAGGAGCCTC TCTTGGCAGA GAAAGCTTCT CAAGGTTTAG TATAGGGCAG GTGGCACTGT CATCAGAGAG TACCCTTCGT GATATTGGCG GGCATTCGCT TTTAGAGCTG TGGACCAGCA CGAAAGGAGG AGTGCTTTTG ACAAAAGCGT TAGCTTCGAT AGATACACCT TGGGAACGTG CCGGGGCAGG AGATTTGCTT CCAGGAAATG GAATAATGAC CAGTGTTCCT CCGTACTTAA ATAGCATCAG GCAGCACTTG TACAGGAACG CGCTGTTTCG CCGCCTCGAC CTTCCGCAGC GTTTTGTTGC ACTAATGCAA GTGGAGTTGC TCGACCTAAA AAACTTGTCG TCTCCCGGTG GCTCAGTTTC CCTCTCAACT TACGCGCTGT TGCGCCTGAA ACGTGACGGC AACAGGGCGG GCTTAACCAA CAAAACACGG ACACTTGACA CAGCAAAGAC ACATGCCACC AAGCTTGGGA AATCTTCTGG TCCAAATGCC CCGGCGTCCT GGGGAAGCGT CGTTCGATTC CGTTTCCCCC TCCCTGAAGA TGTTTCTGTG GAAGGGTCAA GTCTCGACGA TGATCGAGAA ACACTATTCA AAGGACCCCC GTGTGTACTT CAGATTACAG TGTACGAAAA AAAGCTTATC GTGGATAACT CACTGGGAAC TGCAGATATT CGCACAGACG GCCTTTGGAG CGGAGGCCAG CTTGAGGAGT GGGTGCCTCT GCGCTCAGAT AAGCAAACCA TTACATGGTT TGCTCGAATC AGATTGACAC TAAGGTATGA ATTAATGTGC CATGCGTGCG ACGCAGTATC CATGGACACT ATAAGCGGTA CCTCAAGTGT TGGACTCCGG AGGATGGAGG AACTCATCCG TGCAGGTGCT TCGGCGCACG AAGATAACAA GCAAGCTACA AGCTCCCCGG ATTTGTTGAC ATACTTTGAG AGCATGGTGT ACTAAGAATT TGCCGCTATC TTATCAAAAT TGTAGACTAT ATTTAAAGCA CTAAAAAATA AGCCTCAACC AGTCCGCAGT
|
Protein sequence | MVRLLVVSIT PKESTHWTIS FHTFDLFKTE IRRTTNLIVR ACSVQLIAVS SSEDENEIAV WSARPHPGCL LEKGGGKEHV QDISWDYFVF LLRVSPNSVG TTSDKTKIID FSFLRSGFLD ASPSLICYSG KSVSIYQRCG GGRDWEQRTE IYYSNSPTSS STTMTASAFG STAPFGVYSN PLDVFPHILP ALQCAFASSD ESTFLRSDWH PDSFLAEICL DDRGVKAALD DRLRSLFLWL SSVGKDLSDE DLEGPLRVAP LLIVEENFSE DSDGQDQPRA RLTLLTGPAT DQTLPSQNVA REVSKLKSFR DILLCSIDQA IPESSSTVQS LDCDLGIKKE ELPPLLRTLS SEDLEIIWAL YDMLVKPPCF DKLDKLGQLF LFTREIFHRL AAITNKIESS ISTRIPMPSM HVKITSKSGG SQSCEETYVA SSGCLAALMS RSQNKILDCC RANGHKFDWP FARDLRIAFW LRSDRELAKI SEEIGQTLYR AKKEIMECAL FFVIARKTRT LRNLAAADST DSGRKLFQFL TSHDFSDERG RRAAEKNAYS LLRKNRYGIA SAFFLLAEPP FLKSALEVIV TKMRDSDLAF AVARLTESSV AALQPSLNAS SVGLGGILGG GGGYAIPPGS DSSLDGEDGE RFDDWQPDLG MYSKKLLVER LLPQAERDNA LSAIQLLWLG KIEEASWWLS GFIEPSHDSD TGYRLVNDID HRVFGTRTDA KRQCAGSMKL SNAIGKVNLL VDFISAPLLL DALGGSQRAQ FASCLAVSSA LIARGVEMPT IRSLLHLAGS QMLLKHPITC LSPREDAKHG LSPSSAAVKP SISKSLSSSL QPKRSCNLAV AASSGVMSSS VFDSFDTPPT HNSKAKVSDH FERSETVQSS IFDAFDAPPT QNSNPKVSTQ EGRSYGMQSS IFDSFDPPPL QNISVTAGRD ETSGNIESSI FDSFDVHPGP TVKGSACDDR SGDVRCTAIS FDDTRQLFSE NDAQESKVLC PVDNRPTIDN QEAIKKSEIA ISLSICRKGI PDLWSEWRYN MLLLCAGRRL LREVASVVAQ FHGDPRPQSM VSFYHSEHPL VPSRASEVLQ LRCDSEKIIG RVKRSLEQLS TASGLETLVV VSCSVQLLGH SQQRRRTLFT VILYAAAQQD DMADLIVRAA ASDLIQMAKS MVFCNDVFVL KQKSRSHAST QHIRRLAARL SWQLEICMWL QRGGGLYLSA STIKDAIAAV RVGILIASWN RNTECLEAMI RNQPDCSLDD EAGHPLWTNL KSFTAPEREV KKNRKISSGG WEFLVDCRRT EATKMLRCRP TGCFIMRPHP NDHGVFTLSF KTNLASEEGM PKDGEQQLSD HGELVNRPEQ PSSTVMSSRA SKRDDIVQHA VVRLSESGFR CGSFGPFASL IGLLEAVSAS LPFNLRFDQP PVDRVIQEGF QPSPNAVFFR KLALSHADSG AYPQPAKVQS CQVPSSPKEE RTSSTFRTGS EDTRSEKKIS FAYFLELTTL SKIHRQLSAV VSVHVDEAHQ SDGEVMTTDE SYEAASDSVL SLMTFHGVST PYASSRRLLS PLICWGRSLE ILSVEYVAPE LNGSFDPRLP VDFEESAEEI EIFQHESATT VEQGDAVLRQ MIKRGSGVDF STLRLSDGGD CTMVVVFGKK EGIEWLLSSG LEASETGALR RLKIMEHENI IEPIEMQRLP LKHKVPEHGE SDVRYRIVDP WEVEALPNRE GETRGASLGR ESFSRFSIGQ VALSSESTLR DIGGHSLLEL WTSTKGGVLL TKALASIDTP WERAGAGDLL PGNGIMTSVP PYLNSIRQHL YRNALFRRLD LPQRFVALMQ VELLDLKNLS SPGGSVSLST YALLRLKRDG NRAGLTNKTR TLDTAKTHAT KLGKSSGPNA PASWGSVVRF RFPLPEDVSV EGSSLDDDRE TLFKGPPCVL QITVYEKKLI VDNSLGTADI RTDGLWSGGQ LEEWVPLRSD KQTITWFARI RLTLRYELMC HACDAVSMDT ISGTSSVGLR RMEELIRAGA SAHEDNKQAT SSPDLLTYFE SMVY
|
| |