Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49631 |
Symbol | |
ID | 7198288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 255754 |
End bp | 259971 |
Gene Length | 4218 bp |
Protein Length | 1314 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184430 |
Protein GI | 219128459 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTGGT ACGACCCCGG GATCGCCGCA CGGGCCTCGG ATCCGGAGTT CACGATGGCC GATGCGGATT TGGCACTGGT TGCCGTACGT ACTCGCCGGT TTGGCCACGA TACTGGTCGG ATGGAATCTG TACACGGGGC ATCAGGTGGT TGGACGACAA CCCCCGTTCC CGCCGGACTT TGCCGTCTTG ACGCCACCGG TACAACCCAC ACGAGAGTGG TTCGTTCCTC TCGATACGCA CTCGCTCCCC TCCACTACTA GTACTACTGC TGCTAGTAGT AGCAGTACTA AGACTGCTGC TACCACCGTG GGGTTGGACA ACGCCATTGC ACGACAGCAA GCTCTACCGA TAGTATTGTC GGAAACGAAT CCTGTCGGAA CCAATAGCAC TCTCGAGTCC ACGGACGACC CCAACGTCCT TCCCGATCGC GTTCCCACAC ACACTGCGTG CGACGGCTAC GACGGCGTCT ACCACATTGC CATGGGCGAC ATTGGTGGTG CCGGAGGTAC CATTTTCTTT CAATTCGTTC TCGGACAGGC TCTGTACGCG CATCGGCACA ATCTGCAACC CTGGGTCTTT CTCAATAACG TCAGTCACAT TGTTTACGAC CCGCTCGTGC ACGGTGACCC GACGGCACCG GGAGTCTCCC TTACGGCCCA GGTGGGACGC AACGCCACGT ACCAGTACCG ACCCATTCGG CACCGACGAG ATACCTACCC CGGTCCCCCC GACGCCGACG TCCCGGTACA ATCCCAAACA CTGTCGTTCG AAGGAACGGG TGTGTGGGAA CACTACTTTG AACCCCTCTC GGAATTCGTA CCGGGCGATC GTTCCTGCGT AGACAAACTC TACGTCACCA TGGATCTTTA CTTGATCACT CCCGGCCTAC ACGGATTCAC GTCCTGGGCA CCGCGTTGTT GGCGCTATCA GTATCTCCCG GATTACATTA CCAAGCCGCA CATTCCGATT ACCGAATGGT TGGAGCCGCA GCGGGCCATG GGCCACGATG CGGTACAACG ATTCCTGAAA CCACGGTCGT ACCTACTGCA CGCTGCTCAC CAAGCCAATC CGGACTGCTC CGTACGGAAT GCCTGTTTGG GCCTGCACAT TCGCCATTCC GACAAGGCCG CGGGACGGCG CGTCGTGGCG ACGGCGGAAT TCCTGCCCTA CGCCCAAGCC TTTGTCCACG CCGGTGGACA ACACGTCTAC GTGGCCACCG ACAGTACCCA CGTGCTGCAG GAAATCGAAC AATCCTGGCC CGCCAGTGTC CGGACTCGGA TACGAAATAT GGGCAACACC GTCGTGCGTT CCAGCGACGA AAAGGCCGTT TTTGATATCG CCTCACATCA TCGGACCAAT CAAGAAATTA TCGTTGAAAT ATACGCGCTG AGCCAGTGCC AATTTCTGGT GCACGGTTTC AGTGCCGTAT CCGAATCGGC GATCTGGATC AACCTGAATT TGCACGTACA GTCCGTCAAC TTGGAAGATA CGGAGCGAAT TCAACCGCCA ACCTTTGGAA CGATGGTACA AATGGTTCTC CGTGGCGAAG AGCAGAGTCA TTTGCCTAGG CCAATGCGTA CGGAGGAGTG GTGGAATACA GAAGGAGAAT CAACCGAACC CCATCGTGTC GTACACACAG CATGTGACGG GTACGATGGG ATCCTCCATA TTTCCGCCGT CGGGAATGAT CAGAGTGCTG GTGGGGCATT CTTCACCTCG GTCTTGAACC AGCTGATCTA TGCAGAGCAA CACAATCTCA AACCATGGGT TCACTTGGAC CCGAATGCGT CCACGTTAGT CTATGACGAA TCTGTACATA ACAAAACACC ACCGTTCGCC CCCTTCGAAA TGATGCGTGG AATAGAGATA GCAGTGATCG AAGGACCGGA GATGGTACCC ACTAGAGCTG GGGACGGAGG CAATCCTCCG GAGCTGTACC AAATGAAACA GTACTATCCG GGTGCTCCGG ACCTACCCGA AAATCAGACG CTCAAGCTGC TACGCATGAG CTTTCCGGGT ATGGGTGTTT GGGAAAGCTA TTTCTTGCCC GTCTCAGACT TCGTGCCAGG CGACGTGTCT TGCAGGGTGA AACCCATCCT CACAATGAAC GAAAGGATGG TTGATCCTGG GTTGATATCC TTCAGTCCAC AGTCCGTTCG GGCGTGGCAG TACAACAGTG TCAGCGATGG GCTTTGGAAT CCTGACGGAA AGTCGATGAA AGACTGGTAC AAGTCTATGC GAGAAAAGGG TGCCGAGTTG GTGAAACGAT ACTATCGATT CCAGCCGCAT ATTGTTCGAA AGGCCACAGA AGTGAATCCT GTGGAATCCG GCAAGCCGTG TCTCGCAGTC CATCTAAGAA AAGCTGACAA GCATGGCCAT CATCGCGCGC CAGTGTCGAC TAAGAGGTTT CTAGAATACA TCAGAGCCTT TCTAGATGTC GGGGGTAGGC ATGTGTACAT TGCATCCGAC TCGCACAGGG CGATGCAATA TGTCAAGAAG AAACTTTTGC TAGATGACAA CACTGTCATC CACTCGCAAG GGGAGTATGT GGTACGCTCG TACAAGGCAT GGCCGGCTCA TTTCATGGAA AGTCATCATC GTGTAAATAG CGAGGCTCTG GTAGACGCAC TTGCGATGTC CAAGTGCGAC TTGCTACTTC ATGGACATAG CACTCTATCG GAGGCGGCGA TCTATCTCAA CCCTTTACTT CACGACAGGA GTGTCAACTT AGAAGACCCA GATCGTGTAA CCCCAGAGGA GTTTGGGAGC ATGTTCAAGC ATATACTGGT AGAAACCAGC GTACAAAGTG ACGTCGAAGA GCAGCGGTAT GCTGCCACGG AAGGAGATGA CGTAGAAGGT CAAGGTTCCG TCATTAGCAT TGACCTACAA GGATCGTTGA TTGTCTCTAC GCTTTCAAAA CGCCGGTGTC GGACGAATGC AGTCATCTAT CTCGCCCAAA AGACTCATTC AAGCTACGGA CGCGACAGCT ACGGTAATTT GCTGATGTCT TTGGATCTCC TACACGACAA TTATTTGTCT ATCAACGACC ACCTTGACAA TACCGATATC TATATATTTC ATACCGGAGA CTTCAACAGT ACCGACTTGC AAGCCTTGGA GAAACGATAT GGTTCGTCGT ATCAAGGTGC ACTTCATCTA GTAGATTTAT CTAACTCAAC TTTCTGGGCG CGACCATCCC ATAACCTCAA TGATGACCCC ACAACGTGGT ACGCCTACCC TCTGTTCTCG GAGGGCTATC GTCGGATGAT GCACTGGTAT GCAATTGATG TATGGGAGTT TTTCGCCAAG TGGAACAAGA AAACTGGTTG TCGTTATCGA TACTTGTTTC GTTTAGACGA AGATTCTTTC ATTCACTCGG CTATTCGATA CGATATTTTT GACTTTGTGC GCTCGAACAA CTACGTTTAC GGGTATCGCA TGTGCGCATA CGAAATGGCA GTAACGCGAA GGATGTGGAC CATGTGGAGA AACCGTCACA CAAAGTTTGT CCCACAAAGA CAAATCAAGC CTGAATTGTG TGGCTTTTAC AATAATATGT TTGTGGCCGA TCTCGAATTC TTTCTATCCC AGGATGTACA AGCTTTCCTT CAATTCATTG ATCGTCAAGG TCACATATAC CGACGACGAT TGGGTGATTT AATGATCCAC TCGATGGCTG TATACGGGTT TGCTGAGAAG AATCAAATAC ACCGGTTTCT GGATTTTACG TACGAACACG TGACAGTCAA TCAAACGTCG GGTTGCTTAG CGTGGGGTGG TATACAGGCG GGCTATGACG ACCCTCTTGC TATTGACACG CTCAATACAT ACTATCAGAG CCGTTTGGTT GACAATGGAT GTGCGGGCAA TGCCACCTTT TTATATGCAG AGGACTTGTC TCCTACGTAT GCACATTTTT CAGGAACAAC GCGCAGAAAA CTGCCTTTGT ATACCATCAC CGCTGGACGG ATTGAGACGC CAGCAATGGG TATTCTTTCT GGCTAACTGT AAGCATTGTT TTCTGTTATG CAAACAAAGT AACATTAGCA CAGAATGGAC ACTACTTTCT ACCTGACCTG CAAAAGTTGC CATGCTGAAG GAAACCCGTG CTATCAGGTA TCACATTGCA ACAAATACTA CGAGATGGTC AGGTCATTAG GAAACGCTGG TTTCAGGAAT ATCCTTCATA GTCTACATCT CTATGAGATT GAACCGTC
|
Protein sequence | MDWYDPGIAA RASDPEFTMA DADLALVAVV GRQPPFPPDF AVLTPPVQPT REWFVPLDTH SLPSTTSTTA ASSSSTKTAA TTVGLDNAIA RQQALPIVLS ETNPVGTNST LESTDDPNVL PDRVPTHTAC DGYDGVYHIA MGDIGGAGGT IFFQFVLGQA LYAHRHNLQP WVFLNNVSHI VYDPLVHGDP TAPGVSLTAQ VGRNATYQYR PIRHRRDTYP GPPDADVPVQ SQTLSFEGTG VWEHYFEPLS EFVPGDRSCV DKLYVTMDLY LITPGLHGFT SWAPRCWRYQ YLPDYITKPH IPITEWLEPQ RAMGHDAVQR FLKPRSYLLH AAHQANPDCS VRNACLGLHI RHSDKAAGRR VVATAEFLPY AQAFVHAGGQ HVYVATDSTH VLQEIEQSWP ASVRTRIRNM GNTVVRSSDE KAVFDIASHH RTNQEIIVEI YALSQCQFLV HGFSAVSESA IWINLNLHVQ SVNLEDTERI QPPTFGTMVQ MVLRGEEQSH LPRPMRTEEW WNTEGESTEP HRVVHTACDG YDGILHISAV GNDQSAGGAF FTSVLNQLIY AEQHNLKPWV HLDPNASTLV YDESVHNKTP PFAPFEMMRG IEIAVIEGPE MVPTRAGDGG NPPELYQMKQ YYPGAPDLPE NQTLKLLRMS FPGMGVWESY FLPVSDFVPG DVSCRVKPIL TMNERMVDPG LISFSPQSVR AWQYNSVSDG LWNPDGKSMK DWYKSMREKG AELVKRYYRF QPHIVRKATE VNPVESGKPC LAVHLRKADK HGHHRAPVST KRFLEYIRAF LDVGGRHVYI ASDSHRAMQY VKKKLLLDDN TVIHSQGEYV VRSYKAWPAH FMESHHRVNS EALVDALAMS KCDLLLHGHS TLSEAAIYLN PLLHDRSVNL EDPDRVTPEE FGSMFKHILV ETSVQSDVEE QRYAATEGDD VEGQGSVISI DLQGSLIVST LSKRRCRTNA VIYLAQKTHS SYGRDSYGNL LMSLDLLHDN YLSINDHLDN TDIYIFHTGD FNSTDLQALE KRYGSSYQGA LHLVDLSNST FWARPSHNLN DDPTTWYAYP LFSEGYRRMM HWYAIDVWEF FAKWNKKTGC RYRYLFRLDE DSFIHSAIRY DIFDFVRSNN YVYGYRMCAY EMAVTRRMWT MWRNRHTKFV PQRQIKPELC GFYNNMFVAD LEFFLSQDVQ AFLQFIDRQG HIYRRRLGDL MIHSMAVYGF AEKNQIHRFL DFTYEHVTVN QTSGCLAWGG IQAGYDDPLA IDTLNTYYQS RLVDNGCAGN ATFLYAEDLS PTYAHFSGTT RRKLPLYTIT AGRIETPAMG ILSG
|
| |