Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38156 |
Symbol | |
ID | 7202976 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 209043 |
End bp | 212377 |
Gene Length | 3335 bp |
Protein Length | 1023 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182178 |
Protein GI | 219123743 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00847311 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATATC GTCGCTTTTG TCTTACCAAT GGCACAGCAG ACGTCGACTC GACTGGCGGA TCTCGGTCCC GTTTCGTTTG GAACTACGGA AGCAAATATG TTGATCCAAC TCCTAGTAAG GACAACGTCT CACCAGAAGT TCAATCGCCT ACTGCTAAAC GCCAAGTCGC TTCCGTCCCA CACACAACGA ATCACAGCCC CACAGTCCCT ACGAATATCC AAGAGTACAA TGTAGACTCA TAATCTTCGT AGTTCATCGT TTTCCTCAAC TGTGATCTTC ATCGTCGTTT TGTGTTCTTC AATAGAACTA CGAAGCAACC CCACCGTCTT TACGCTCCCA GCCCTGCCCT ACCGCCCATC CGCCATCTCT TCGGTCCCGA TGTCGACCTC TGCTCACTTC AAACTGAGCG ACTTTCCTCA CAAAGTCCTG GATCCGATTG CCACCCTCAC CGTCCCACCG ACTTACGCGA CCATCAAACA TGCCCAACGT CAGCTCATGA CCAACGCAGC CGCCATCCCC ACGCTCAACG GTGGCGGCGC CCACGGCCAC ATGGCCTTAA CCCTCACCGC TCTCGCGTAC GCCGACATCA GCGACGTCCC GTTCGTCATT CCCGTCGCTC CCCCTGCCAA TCCGCCCCCC GGCGCCACGC AACCCCAAAT CACCGAGAAT AACCGCGTTC ACCAACGCGA CGCTGACATT TACAACCTTT ATGTCGCTGT TAACAACGCT CTCCGCCAAC AGCTTCTCGA TGCGATTCCC CGCATCTACG TACGCGCCCT CGCGCATCCC ATGTTCGAGT TCAGCAACGT CACGTGCCTT GACTTGCTTT CGCACCTCTG GACCAAATAC GGTACAATCA AGCCCGCTGA GCTCCAGAAA AATTTCCAGT CAATGTTCAC CCCGTGGAAT ACAACAGAAC CGATTGAATC CGTTTTTCTC CAGCTCGACG AGGCCATCGC CTTCTCCGTC GACGGTAACG ACCCCATCTC CGAAGCTGCC GCCGTACGAG CCGGCTATGA AGTCATCGCG CACTCTGGCC TGCTCCTCCT CGACTGCAAA GAATGGCGCA AATTACCCCT TGCTTCTCAC ACCCTTGCCA ACTTTCAGCA GCACTTTTCC CTTGCCGACG ACGACCGGCG CCTTACGGCC ACCACTGGTT CCCTCGGCTA TGCCAACGTT CTCGCTGCAA CCCCCTCTCT GACTCCAGCC ACGGTTTCCG ACACCCTCAG CCTTCCCTTC TCCGCGCTCT CTGTGTCACA GACTTCCGTC TCCTCTCCGG ATATGACCTA TTGCTGGACC CATGGGACCA GCAAGAACCG ACGCCATACG AGCGCCACGT GCAAGAACAA GGCCCCTGGC CATCGCGACG ACGCGACCGC CACCAACACT CTCGGCGGAT CCACCAAGGT TTGGACCGCT CCCAAGCCCC CTGAATAGGA AAGAGGGACG GCTACGCCGA TGGTTAACTC TAGTAATACC GATTATTTAA ATCATATTAC TAGTCTTAAT TCATCTGTAG CCCCCTCCCC GCCTAGTTCC CATACCTCGG CCATTGCCGA CACCGGTTGC ACCGGCCATT ACATCACCGT CAACTGCCCC CACACCCACA AACTTCCTGC ACGCCCCAGC CTTGCCGTCC GTGTCCCTAA CGGCGCCGTC CTCCGCTCAA GCCACATTGC CACCCTGGCC CTCCCTGGCT TCTCCCCTTC TGCTTGCCAG GCCCACATCT TCCCCGGGCT TACCTCGCAC CCACTCATTT CGATTGGACA ACTTTGTGAC GACGGCTGCA CTGCCACTTT CTCAGCCACT CGCCTCGAGA TCCACCGCGA CACTACACTA CTCCTCTCCG GCACTCGTGC ACCCACTACC GGCCTCTGGC ACCTTGATCT TACCCCTGCC AAGCCTCCTG CCACAGCCCA CGCTCTAGTT CCCAACACTC CCCTCGCTGA CCGCATCGCT TTTGTTCATG CCTCGCTCTT CTCCCCGGCG ATCTCCACAT GGTGCCAGGC CCTCGACTCC GGCCATCTTG CAACCTTTCC TGAACTTTCC TCCCGCCAGG TCCGCAAGTA TCCACCTCGT TCCCCCGCCA TGGTCAAGGG CCACCTCGAC CAACAACGCG CAAACCTTCG ATCCACCAAG CTTCCCCCTG TCGGTTCCCC CATCACGACG GCACCCCCTG CCGCCGCTGT GCCCGACCTT GACCCTCCCG ACGCCCACCC CGTCACACGC ACGCACCATG TCTTTGCTGC TCACCAGCGC GTCACCGGCC AAATATACAC GGACCAACCT GGCCGTTTCC TCACTCCTTC AAGTTCAGGC CACAACGACA TGCTTGTTCT TTATGATTAC GACAGCAACG CTATCCACGT CGAACTCATG AAGAACAAGT CCGGCCCCGA GATTCTGGCC GCTTATAAAC GCGCTCATGC TCTTTTCACC CAGCGAGGCC TCCGTCCCCA ACTCCAGCGG CTTGACAACG AAGCCTCTGC CGCCCTCCAG TCCTTCATGA CCTCAGAGCA CGTTGACTTT CAGCTGGCAC CCCCCCATCT ACACCGTCGT AATGCAGCCG AACGGGCCAT CCGCACCTTC AAGAACCACT TTATCGCTGG CCTATGCACC ACTAACCCGG ATTTTCCATT GCACCTTTGG GACCGCCTCC TCCCACAGGC CCTTATCACC CTCAATCTTC TTCGTCGCTC CCGCATCAAT CCTAAGCTGT CCGCCCACGC CCAGCTTCAT GGTGCTTTCG ACTACAACCG CACCCCGCTT GCTCCACCTG GCACTCGCGT CTTAGTTCAT GTCAAGCCGT CCGTCCGCGA AACTTGGGCC CCCCATGCTG TTGAAGGTTG GTACCTTGGC CCCGCCCTGC ACCATTACCG TTGCCACCGC GTCTGGGTCA CAGAAACACG TGCCGAACGC GTTGCTGACA CCCTTTCCTG GTTCCCGACC CGCATTCCCA TGCCCGCAGC TTCGTCCACC GACCGCGCCC TGGCCGCCGC CCGCGACCTA GTCCATGCCC TCCAGAATCC TTCCCCTTCG TCTCCGTTCG CCCCCCTCGA TGCCACCCAG CACCAGGCAC TCACAGATCT TGCCACCCTC TTTGCCACCG TGGCCACCCC GACCGACGAT CCCCCTGCCC CCGCAACTCC CCTTGCTCAG GTCCGTTTTG CCGTTCCTCT TGTCACGGCC GAACATGCCC CGGCACTTCC GAGGGTGCCC ATTCCGGCCC CAGCACTTCC GAGGGTGCCC ACCATGGCCA CCTATCACTC TCGCACCGGT AACCCAGGCC GTCGCCGCCG CAAAGCACGC AAACAACCGG CAACCCCAAC CCTAG
|
Protein sequence | MAYRRFCLTN GTADVDSTGG SRSRFVWNYG SKYVDPTPKL RSNPTVFTLP ALPYRPSAIS SVPMSTSAHF KLSDFPHKVL DPIATLTVPP TYATIKHAQR QLMTNAAAIP TLNGGGAHGH MALTLTALAY ADISDVPFVI PVAPPANPPP GATQPQITEN NRVHQRDADI YNLYVAVNNA LRQQLLDAIP RIYVRALAHP MFEFSNVTCL DLLSHLWTKY GTIKPAELQK NFQSMFTPWN TTEPIESVFL QLDEAIAFSV DGNDPISEAA AVRAGYEVIA HSGLLLLDCK EWRKLPLASH TLANFQQHFS LADDDRRLTA TTGSLGYANV LAATPSLTPA TVSDTLSLPF SALSVSQTSV SSPDMTYCWT HGTSKNRRHT SATCKNKAPG HRDDATATNT LGGSTKERGT ATPMVNSSNT DYLNHITSLN SSVAPSPPSS HTSAIADTGC TGHYITVNCP HTHKLPARPS LAVRVPNGAV LRSSHIATLA LPGFSPSACQ AHIFPGLTSH PLISIGQLCD DGCTATFSAT RLEIHRDTTL LLSGTRAPTT GLWHLDLTPA KPPATAHALV PNTPLADRIA FVHASLFSPA ISTWCQALDS GHLATFPELS SRQVRKYPPR SPAMVKGHLD QQRANLRSTK LPPVGSPITT APPAAAVPDL DPPDAHPVTR THHVFAAHQR VTGQIYTDQP GRFLTPSSSG HNDMLVLYDY DSNAIHVELM KNKSGPEILA AYKRAHALFT QRGLRPQLQR LDNEASAALQ SFMTSEHVDF QLAPPHLHRR NAAERAIRTF KNHFIAGLCT TNPDFPLHLW DRLLPQALIT LNLLRRSRIN PKLSAHAQLH GAFDYNRTPL APPGTRVLVH VKPSVRETWA PHAVEGWYLG PALHHYRCHR VWVTETRAER VADTLSWFPT RIPMPAASST DRALAAARDL VHALQNPSPS SPFAPLDATQ HQALTDLATL FATVATPTDD PPAPATPLAQ VRFAVPLVTA EHAPALPRVP IPAPALPRAV AAAKHANNRQ PQP
|
| |