Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_10151 |
Symbol | |
ID | 7197480 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 951042 |
End bp | 955020 |
Gene Length | 3979 bp |
Protein Length | 1127 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177726 |
Protein GI | 219111949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00773421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTG GTGTTGTTAT GCGGGCGGCG GTGATCAACG TTCTTTACGA GCACTCATTG AAATTGACAC CCAGGGGACG AGCTGGGCTG ACTTCAGGAG AAGTTACCAA CCTGATAGCC GTCGATACAC AAAAACTGTA TGAAGTTGCT CAAGAAGGCC ATTTGATTTG GGCTCTTCCT CTCTCGATTA CGCTTGTAAC CGTCTTCTTG ATACTGATTC TTGGTCCGAT TACTCTGATC GGTATAGCGG TACTGATACT CTTCGTGCCA TTAGTAGAAA GAGTGACGTC AAGAATGCTA AAGATACGAC AACGAAGGGC AAAGATGACT GACCAGCGCG TTGAAATTGT TAGCACCATG CTTCAAGGGG TAAGTGCCTT GAGATTTTTG TGAGAGAAGT ACCCATACAT CATCCAACAC TTCATTATCC AGATTAAGGT CACGAAGCTG AGCAATCTAG AAGAAAGCTA CGAAACAAGA GTAGCCGAAG CTCGTCAGCT CGAGCTTAAT GAGCTTCGTA AAGAATTGGC TGTATGGGCA CTGACTCTCG TTATCACGTA AGTTCTGATG GACGCTCGTG ATTTCTCTTG AGTAAGATGC TGACATCTGT TTGTTTGGTG TCTATTTAGC GTATCCTCTC CAGTAGTAGC AAGTTCGGCG ACATTTGCTG CGTACGTCTT GGTTGACGAG CGCAATGTTC TTACCGCGGC AGAAACTTTT TCCGTTTTGT TGCTGTTCGG CGCTCTCCGA TTTCCAATAA ACTATGCAGG CCGGCTTGCT GGAAGTAAGT AACGCCAAAA GATATTGATT TTGAGAGCAT GACATAGCTT ACAATTTCAT TCATTCTCAT GGCAGAGATG GTGCAGGCCC TATCCGCGAT TACTCGCATT AACTCATTCT TTGAACGGGA AACGAGAGAC GTCGACTTTT CTCTTGTGCC GTCAAATAAC GTTGCTTTGT CTGAGTCATC GGACATACCC CTTATTCTTT CTAGAGCGGC ATTTTTCTTG CAGCCCGCCG ATGAATGTGT GCGTAACGGA CAAGACAACG GAAAGAAGAA CATACACAAG GGAAGCTTTG AGCTCAGCGC GGCGTCGTTC AAAGTCTCGA CATTCGATTT TACAATTCGG AAGGGCGAAG TCATTGCCAT TTGTGGCCCT GTCGGTTCTG GGAAATCAAC GCTTATTCAT GGTATATTGG ACGAAGTCCC GTCCATTGAA GGCACGGAAG TTTCCAGATA TGGACGAACA GCCTTTGTTC CTCAAACACC GTTCATTTTA AACACAACTC TAAGAGAAAA TATTCTGTTT GGGTTGCCTT TCGAAAGTTC CGTTTACGAG CGAGTTCTTG ACGTATGCTG TTTGCGACAA GATATTCAAC AGCTGGGAGA ATCAAAGGAT CATACCGAGA TTGGGGAACG TGGGGTGACT CTTTCAGGTG GACAAAAGCA AAGAGTTTCG CTAGCTCGTG CAGCTTACGC AAGGCCCGAT TTAGTTCTTC TTGACGATCC GCTGTCAGCT CTGGATGCGG GAACTGCCAA ACTTGTGTTC GAACGCCTAA TCAAGTCGAC TGGGTCTTAC TTCTCGGATA CTGCCGTTGT TCTTGTGACT CATGCCTCGC ATTTTCTGAA CCGAGTAGAC AAGGCACTTA TCATCGTTGG AGGCAAGAAT GAGTTTTATG GGAGTTGGAA TGATCTTGCT ACCTACCATG CAAACGACTT CGAAACAAAT GTAGCCATTG ATTTCTTGCG TACTTCCGTT CAAGAAGTTG CGAGTGAGAG CACCGATAGC GCGGACCAAA ACAAGGATGA AAAACTCCTG TGTAAGCAAG TGGACGTGAA GGATACTTTG ATGGCAGCAG AAGAGCGAGA ACATGGTCTT TCCAGTCTTA GTGTTTGGCT CCTATGGTTC AAGCGCGCTG GAGGCTTTTA TTTCATTTTC TTTCAAGTTC TTTTCATGGG TATCGATCGT TTTTCTTACG TTGCTACGGA ATATTGGCTT GCAAGATGGA CGCAATCTGC GGATAAGCCA ATCAGCGTGT TTGGTGTATC CTTTCCATCC CAAGAAGAGG GTCGCACGGC TCAATTCGAT TATCTCAAGG TCTACAGTAG TCTCGTACTT GTATCAGTTT CAACTACGAT TCTAAGGTAT GCGTACTGAT TTTTTACGTG CGATGCCTAT TATGAATAAT CTGCTCACAA AATGAATTCA TGGTATGAAC AGATCTGAAT GGAGCGGTAA GCTTTTTGCT TTTGTGAGAT CAAGCGAGTG GACTGTTGAA GTAATAGGCT GGATTGCTTT CTCAAACTAT TCGATATGTT CTTACCTCTA CAGTTACCGG TGGAACTCGG GCTGCCAAAC ATGTATTCTC TTCCATGGTT TACAGCGTGC TACGGGCACC CATGTCGTAT TTTGATACCA CTCCGATGGG GAGGATTCTG AACCGATTTA CATACGACAT GGATGTGGTA GACATTTTGC TGACCCAGTC CATGAGCATG TTCATGATAT CATGTAGCTG GTATTTTGCT GGAGTTATTG TAATGTGCAC AATTCTTCCT TGGATAGCGT TGGCAATCTT TCCCGTTACA GTGATTTATT GGGTGCTGAT GCTGCATTAT CGAAAATCAG GATCAGATCT ACAACGTTTG GATGCTGTGT CACGTTCTCC TATCCAAGCG ATGATATCAG AAGGTAAAGA ATGCTGTCAC CTCATTTTCC TGTAGTATCC AGTTTAAACG ACTGATCCGA TTTTTGTCTT TCAGGGCTCG ATGGATCGGC CAGCATTCGA GTATTCCAAC AAGAATACAA TTTTTTGAAA CGATTTCGTG CATTGACCGA TCTCAACAGC TCTGCCTTGC TCAATTTCGT CTCTGCTCAA CGATGGCTGG GTGTGCGTAT CGAGCTGCTG GGTTCTTTGG TAGTCCTTAT ATCTTCATCA TTAGTTGTAA CTTTGAACGA TTCCCTGCGG TTGGATCCTG GAATTGGTGA GTGGAATTCT TTTTAGAAAG TATCATGCCA TCAAATACTG AGTATACTCA ACAGTAACTT TCAAAGTTGG ATTACTCATT ATCTGGTCGA GTAACTTCAC GATAACCTTG GGGTTCCTGG TAGACACATT CGCGGAAACT GAAGCTGCTA TTACGGCGAT AGAAAGAGTT GATGCCATGG CTGAGCTCCC TCGCGAAAGA TCGATGAAGA CGGACCCAGA ACACACTGTG CGCTCATCTT GGCCAGAGAA AGGTGCGATC GAATTTAAGA ATGTTTGCTT GCGTTACCGG GCAGGGCTTC CTTTGGCGTT GGACGGGTTG TCTTTTCGAA TTCCCCCAGG TCTGAGTTGT GGCGTTGTAG GACGCACTGG TGCTGGTAAG AGCTCAATCT CAGTCGCGCT TTTTCGACTC GTCGAAATAG AATTTGGTGA GATCCTTCTC GACGGTATAA ATTTGGCTAC TTTGGGATTA TCTGATGTTC GGGGTCGGCC AAACGGGATG ACCATCATTC CGCAGGATCT ATTCTTGGCC GGTACAACTT TGAGGGAATG CTTGGACCCT TTTGGTGTAC GAGAAGACGA GGACATCTTG CAAGCTCTCA AAGCAGTTCG TTTGGCAAAG TCGAACGATT TGGTTTCAAA GCTAGAGACG GCAGTGCACG AAGGAGGCTT GAACTACAGT GTGGGAGAAC GGCAACTCTT GAACCTAGCA AGGGCACTGT TGTCCAAGCC CATGGTGCTG ATTTTAGACG AGGCTACAGG TAGTGAAAAG CAGACCATGT TGTCCTATCT ATATCCATTT GTCGTATCTC ACTGGACCCC GATCACTTTC CCTACAGCTA GCGTTGACGG GGAGACTGAT GCCTTTATCC AGCGGATGTT GCGGACGAAG TTTACTGACA CGACGCTAAT CACGGTGGCG CACCGGTTAA ATACTATCAT GGACTACGAC TTGGTTTTGG TCATGGACCA AGGCAAAGCT GTCGAGCTG
|
Protein sequence | MKSGVVMRAA VINVLYEHSL KLTPRGRAGL TSGEVTNLIA VDTQKLYEVA QEGHLIWALP LSITLVTVFL ILILGPITLI GIAVLILFVP LVERVTSRML KIRQRRAKMT DQRVEIVSTM LQGIKVTKLS NLEESYETRV AEARQLELNE LRKELAVWAL TLVITVSSPV VASSATFAAY VLVDERNVLT AAETFSVLLL FGALRFPINY AGRLAGKMVQ ALSAITRINS FFERETRDVD FSLVPSNNVA LSESSDIPLI LSRAAFFLQP ADECVRNGQD NGKKNIHKGS FELSAASFKV STFDFTIRKG EVIAICGPVG SGKSTLIHGI LDEVPSIEGT EVSRYGRTAF VPQTPFILNT TLRENILFGL PFESSVYERV LDVCCLRQDI QQLGESKDHT EIGERGVTLS GGQKQRVSLA RAAYARPDLV LLDDPLSALD AGTAKLVFER LIKSTGSYFS DTAVVLVTHA SHFLNRVDKA LIIVGGKNEF YGSWNDLATY HANDFETNVA IDFLRTSVQE VASESTDSAD QNKDEKLLCK QVDVKDTLMA AEEREHGLSS LSVWLLWFKR AGGFYFIFFQ VLFMGIDRFS YVATEYWLAR WTQSADKPIS VFGVSFPSQE EGRTAQFDYL KVYSSLAGLL SQTIRYVLTS TVTGGTRAAK HVFSSMVYSV LRAPMSYFDT TPMGRILNRF TYDMDVVDIL LTQSMSMFMI SCSWYFAGVI VMCTILPWIA LAIFPVTVIY WVLMLHYRKS GSDLQRLDAV SRSPIQAMIS EGLDGSASIR VFQQEYNFLK RFRALTDLNS SALLNFVSAQ RWLGVRIELL GSLVVLISSS LVVTLNDSLR LDPGIVGLLI IWSSNFTITL GFLVDTFAET EAAITAIERV DAMAELPRER SMKTDPEHTV RSSWPEKGAI EFKNVCLRYR AGLPLALDGL SFRIPPGLSC GVVGRTGAGK SSISVALFRL VEIEFGEILL DGINLATLGL SDVRGRPNGM TIIPQDLFLA GTTLRECLDP FGVREDEDIL QALKAVRLAK SNDLVSKLET AVHEGGLNYS VGERQLLNLA RALLSKPMVL ILDEATASVD GETDAFIQRM LRTKFTDTTL ITVAHRLNTI MDYDLVLVMD QGKAVEL
|
| |