Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50124 |
Symbol | |
ID | 7198926 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 117088 |
End bp | 121271 |
Gene Length | 4184 bp |
Protein Length | 1144 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184970 |
Protein GI | 219129595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.190181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGTC CTCTCGAGGT TCCTCCTCCA TACTCCTCAT CGTCCTCCTC CCCACCCGGA CCGAACGTTA CGCGGGGTAA CCAGAACTTG CCGCCACCGG GTAGCACCTC TTCGTCGTTG CCGGATGACA ACAATGGATT GGACGCGTCA ACAACCAAAC CGACGCCGAT TCACGTTTGT TGTTCCGTCT TGAATACCGT CATTGATCAC GTCATGGTCT CACCCAGTGC ACTGGAACCG GAAGAGCTTC GAGCACAGTA CGTCGACGAT GCGGTGTTGT CGAGATCCAC CGAGTTGGGA GCCGCATCAC AGACACGCAC ACCCCTGCAA GTTCCCGTTC TCCGTGTCTT TGGTCCCATC TTGCGTCGGG ACAACGACGA CAGCAATCCA GATGGAAATG GATCGTTGGA TCCGCCAACC CAGTCGGCCT GTTTGTACAT TCACGGTGCC TTTCCCTACT TGTTGTGTCG GCCAGTCGTG GCAGGCGCCG ACGGATCCTG GCACCGATCG TCACACAATC ACCACGGCCT GACGCCATCC GGACACCTCG ATTGGGACGA TGCCGCGGCG GTGGAACGCA TATTGCCCGT TTTGCACGAA CATCTCGAAG CCTCGCTGCA AGCGTCCCTC CAACAATCCT CCCTCGGTCT CGACAAGCAC AGCAACAGTA ACCGTGAAGC TACCGGGAAT TCACGACAAC CACCCAAGCC ACCGGCAACC AAAATTATTC GCCGACTGTC TCTCGTGGTC GGTCGTGGAT TTTACACCTA CTGCGCGGGT CCGCCCGCTC CCTTTGTTCG GGTCGAATAC TACAATCCCA AATCACGATG GAAAGTTAAA ATGCTCCTGG AGCGTGGCTT GGAGCTCCCC AGTTTGTACC ATCCCGATCC AATCCAGTAC GAACCGGCGG CTCGGGAAGA TCACGTCGAG CCAGACATTG ATGCGGCCAA TGCGGAAACC TTGTCCTTTC ACTGCTACGA AGCACACATT CCGTACACGA TGCAGTTCTT CAAGGACTAC AATCTGGCCG GCATGCGGTA CATTCATATT GGTAAGACCA AATTCCGACA ACCCTTGCCA CGAACCCGAC GCGCCCGATT TTGGCACAAG CATGACGTTC TGGTGGAACC ACACGTGCGG GACGAAACCA TGTTTCTCGA GTCCAACACC CCAGCCGTCT ATCGATGGAA TGACGACACC CAAACCAATG TGAGCCTATT ACACGTATCC CCGACCGCAC AAAACGATTT CGGCGTGGAC GATTCGGTGG CGCATTCGGG TTTGGGAGAC ACGTTTGCGT TGGCGCAACT ATCACAGACA GATATTCCGA GCAGTCCATG GTCGGACCGC CCCGACAATG CCAGAGCGGA AGAGGCTGTG TCCCACGCTG AAAGTACACC CCAACACTCC AATCCAGACG CGCGAGAATG GATTGTCTCG CCAGGAACGT ACGAATGGGA GAAACGCCTC GAAGCGCAAG CGCCACCCAC GAAAGAAACC ACCTGCGATT TGGAACTCGA TATTCACGTC GACGATATAC TAAACGTCCA CGAAGTAATT CGAGAAGTGC CAACCATTAC CAACAATTCG CGACAACCAG TGCATTGGAG GGCGGTACCC AGTCTACGAG AGATTTGGGC CGAAGAACGT ATCCGCATGG GCAAACTACT ACCTCCACAG GATGACTTTC TGAGTCCAGA CCGTGGAGCT AGTACCACCA CACCACCCTT CACGTTAAAT GTCCAGCTTC CAGATTCGGC TTTGCCGGGC ACCAGACTTG CAGTCACAGG AATGAAACGT CTACGAGACT TGACGCTCGG CTTGGACGAT AACTTTCGTC GGGTCATGAA AGATATAATT GCCCGCCACG ACGTGGCTGT GCAGCGAATT GACGAAGGTC TCGCACGCCG ACGATTAAAT TCCGATAGAT TGGCAGAGGC TCAGGAATGG GGGCTTTTGA ACCCCAACCG TTCGTCGGGT CCGAAAGGTT TGACCCCCTC CGATCAGGAA GCGACAGACA CTTTGGCAAT GCTTGGTTCT TTATTCAAAG AACCATCACC ACCACACGCA AACAACAATT TCTCGAGTGA TCAAGGACCG GGATCATCTT CGAAGTATAG TTGGTCATCC AGTCCGCAGA ATTTTTCTTC GAGCCAAGAC ATCACAGTCA ATGGCAAAGC GCATGGTATC GAAGAGGATC GACATTCTTC GCAGAGCGAG CGCTTTCAAA GCTTGTCTCA AACGTATTAC GATGGTCAAG CTGAAGCTGC TGCGGAAAGT GACTTTGAAT TGAGCCAAAG GATGGAGCGA GGAGATGGGA TCTTCGAGGG GCCGTTCGAA TATGTAGAAG ACTTTATTGA TCCGGAAACG CTGGCGCCTT TTGAGTCAAT CGACGAGGAT GGGGATGATT TGTTCGATCT AGACAGCGAT AGCGACGATG GCGCAATGGA TGAGTCTCGA ATAGAGGAAG AGCTAACGCA ACTTGCAACT CAAACGTACA ACAAATCAAT GGAAGATTAC ACCGACGATT CTAGATCAAA ACCGTTCGAT ATGAAGGGTA GCCAAGATGG TTATTGTGCC TCATCGGGCC GTGACCCAGT CAGCACAGAA AAGTCCATGG CAGAAGCCGA CTTATCGGAC CATGATCCAG CCAGCAGAGA AAAATTCATG GCGGTTGACA TGAAAGCCGA CAACCCAACG CCTATGTCTG GGAAGCAAAA TTTGCAAGGG TGTGGTAATG AGAGCGCTAG AGAAATTTCC GCTTTTACAA CAAGTCTTCT GAGCTCATCT GACTGCCATG CTCCTTCTAA GTGCTACGTT GAAATTTTGC GAAATCCTCC TACACGAAAG GCATCGAAGA ACAGCGGTAC ATCCGTCGGG TACCATCCGT TAGCAACCGT TGGTGATGTC CCGCCGTGGT TGTTTTTTGC AGAGTATCAA AAGCTTCGTG GTACTACCTC CAGCGCTGCA CCATTTTCTT CGTTTCCTTC CATTCCCGAT GGTGGATTAA GTGTCCTTCC GACGAAATCC CCTCCTACTC GGCGAGCTGT ACAAGGGTGG ATGCTGCGAG AACGCAAACG AAAGCAGCCT TTGGATTCGG GATGCGACCA GGAGCACGTC GCAGAGAAAA AGCGAATTGC AGGCGCTTCT GCGAGCCTTT CTGTTGTCGC TGAGATGAAT GTTGCCATCG ACAGAATTCC TCGTCAGCTT GAATTGCCAC AGACAGCAGA AATTCATTGC AAGGAATATG CTGTCAGGGA CAAAGATCAA GCTGGAGCTT TGACCGTTGA AGAAGTAGAC TGGTCCAAAA GCCAAAATCT TTCACAGTAT CAAGCTTCGC AAACCGGAGA CGAAATTGAG ACTGACCATT CAACAAGTAA CGGAGGGTTT CAAGTCGTTT CGACTGCTGT TCAAGAGAAG GTCAGAAGTA GCGGTGACAC TGCGAAAAAT CACTTACCCA CAGACTATGA GTCATTGAGT CAAAGCATGC CAATCAGCGA ATCAGGCGGA TCGTTTTTTG TTGCTCAGCC ACTGGATGGT ATTGGTAATC AAGGGGGTCG AATATGGGTG GAAGGTGGTG GCGTTTTGAA AGCAAACACA AGGTCCTCAG CCCGACAGTC AACAGCCGAC AAAAGCCCGT GCTCTACTAA GGCACATTCC GAGACTATCG GCGGGCCTCA CTTACCCTCA CCACTTTCAG TCATGATCAT CGACGTTCAT GTGCAGTGTC GGTCGGGTCG CGCTGGAACA TCTGATTCGA AAACGATTGC TTTGACCCCC GATTCTGATC GAGACAAAGT CGCTTCAGTG ATTTACTTGT ATGGAAAGGA TCCTGGCGGA GGAGAGTCTC TGGAGATTCT TGAAAGAGGA TGTATCTTTA TTCCTGTAGA GAAGGAACTA TCCAACACGT CCTTAGACTT CAACGAGAAA AAACTCCTGA AACGCTTCGC AGACGATCTT CGTCTGGCGA TGCCGACGAA AGTCTTAGGA TACGACGCGC CCTTTAGTGT AGACTGTGTG AGCGACGAGA GGCAGCTTTT ACTCAGGCTT TCTTCGGTAG TATATTCCAA GGATCCCGAC TTGCTGCTTA GTTGGGACAC TCAAGGTACA GGGCTCGGGT ACTTGATAGA AAGAGGCTCC AAAGTCTCGG GTACAACAAA TTCAGACTTA TCTCGAGAGT CGGA
|
Protein sequence | MSRPLEVPPP YSSSSSSPPG PNVTRGNQNL PPPGSTSSSL PDDNNGLDAS TTKPTPIHVC CSVLNTVIDH VMVSPSALEP EELRAQYVDD AVLSRSTELG AASQTRTPLQ VPVLRVFGPI LRRDNDDSNP DGNGSLDPPT QSACLYIHGA FPYLLCRPVV AGADGSWHRS SHNHHGLTPS GHLDWDDAAA VERILPVLHE HLEASLQASL QQSSLGLDKH SNSNREATGN SRQPPKPPAT KIIRRLSLVV GRGFYTYCAG PPAPFVRVEY YNPKSRWKVK MLLERGLELP SLYHPDPIQY EPAAREDHVE PDIDAANAET LSFHCYEAHI PYTMQFFKDY NLAGMRYIHI GKTKFRQPLP RTRRARFWHK HDVLVEPHVR DETMFLESNT PAVYRWNDDT QTNVSLLHVS PTAQNDFGVD DSVAHSGLGD TFALAQLSQT DIPSSPWSDR PDNARAEEAV SHAESTPQHS NPDAREWIVS PGTYEWEKRL EAQAPPTKET TCDLELDIHV DDILNVHEVI REVPTITNNS RQPVHWRAVP SLREIWAEER IRMGKLLPPQ DDFLSPDRGA STTTPPFTLN VQLPDSALPG TRLAVTGMKR LRDLTLGLDD NFRRVMKDII ARHDVAVQRI DEGLARRRLN SDRLAEAQEW GLLNPNRSSG PKGLTPSDQE ATDTLAMLGS LFKEPSPPHA NNNFSSDQGP GSSSKYSWSS SPQNFSSSQD ITVNGKAHGI EEDRHSSQSE RFQSLSQTYY DGQAEAAAES DFELSQRMER GDGIFEGPFE YVEDFIDPET LAPFESIDED GDDLFDLDSD SDDGAMDESR IEEELTQLAT QTYNKSMEDY TDDSRSKPFD MKGSQDGYCA SSGRDPVSTE KSMAEADLSD HDPASREKFM AVDMKADNPT PMSGKQNLQG CGNESAREIS AFTTSLLSSS DCHAPSKCYV EILRNPPTRK ASKNSGTSVG YHPLATVGDV PPWLFFAEYQ KLRGTTSSAA PFSSFPSIPD GGLSVLPTKS PPTRRAVQGW MLRERKRKQP LDSGCDQEHV AEKKRIAGAS ASLSVVAEMN VAIDRIPRQL ELPQTAEIHC KEYAVRDKDQ AGALTVEEVD WSKSQNLSQY QASQTGDEIE TDHSTSNGGF QVVSTAVQEK TMSH
|
| |