Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50482 |
Symbol | |
ID | 7199322 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 180208 |
End bp | 184144 |
Gene Length | 3937 bp |
Protein Length | 1188 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185442 |
Protein GI | 219130584 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCATCGA GGAGTAGATT TAACTGTCTG CAGCTTCACT AACCTTTATT GCACACCGCT TTGAGGCTCG TGTGCATATT GGAGGACATT TGCGAAATAA GTTTGCGGTA AGATAAAGAA ATCGTCCAAG CTTTTGGATT ACCATATTTC AACACCGATC GTCCTTTGGC TAACTTCATC TTCTACACGT TTACTACGGT AGCTCACGTT GAACTTTAGT TTTCTGACAG AGTTGGCCGG TAAAAAAGGA TATTTGTAAT TACAAACAGA TTCAGCCCCT CTCTTCTTAA CTACTTTTCC TAATTCGCTC AAAATGGATC CTGAAGAGCA AAAGATAAGC CGAGGCATTT TGCTATCCGA AACTAGGGGT ATCTCTAACG AATACGGTGG CGATAGCACT CGCGAGGACC TCGCTAGAAT CGACTCCAGC TCACCAGCGA CAGAGACACC GCCCGCCAAG AAGCCGAAGC GCGAATCTCC GTGCCCTGAA AGTAACGGCA TCGCCAACGC CGAGAGCAGC TCCAAACCAA GCAGTACAGT AGGCAACGAT GCTGCACCTA GCGTTGACTT TCCGCCAGAA GCAGGAGGTA CTCAGCCGGA TTCTCAAGGT CGAGATGAGA TTGTTAGCTT GTATATTTCT CTCGAGCCAA TCTCCACAAA AGTGCCACAA ATTCCAAAAT TGACCGAGTT TGAGGTCAAA CAACTGGAAG CAGTCTTAGA GTTCAAGAAT AGTTCAGAGT GGCGAGATGA TTGGATGGGT AACCTCGCCT TTGCCGATCT AGACGTGGGC AACCCAGCTA TTAGTAGTAA ATCACGGGAC AAACAACATA CTTTCCGCCA GCCTTTGATC CAATGGGCGC ACAATGATCA ATCGAACGCC AAATACGTGT GGTTTTTGAT TTGCCATGTC TATAACATTC CAAGTATTCC TCCGGCAGCC AGAAAAATTC TTGGGGCCGT GGATTTACGC TCCTCTGTTA ACATGGAAAA AACTTTACGG CGTGTTATGT ATGATCCCGA AGTCTTGCGC GAAGATGGTT GGACTACCGC CAAGTCAAAT GAGTACATAG GGGCGACCGG TGGCCCACAC AACATTGGAG AGCAGATCTA TTGGGATGGT AGCAATGCCG TCGTAATTGC CTACATTCAT GACCCAGGTA AGACCAAGGC AAAAAAGAAT GACAGAATTT AAATCAAGGT ATTCTCAACT CAAGGATTGC GCGTTTCAGA TATTGGCGAC CTTTGGAAAG CCATTTGGGC TGGCAACGAC GACGACGATG ATGTACTCAC GACCACCTTT GATCTCGAAG CTGAAGAGCT ACTAGAAGCG AAGCGAAAAT GGCAACGACG GCAAAGCTCA AGATCTGGAG GAGGTTTATC GAGCTCACGA CACCCGAAGA AGTCGAACGT AAGCGACGAT TTCAACGTTG CCGGGGTCGA ATTGGGGATA GTTCTTGGAG CGAGTTACAG TAAAGGTGCC CGAAACGGCG TTTACTGGCC GGCCCGTGTT ATGCACGCCT CTGAAAAGAT GGGCACGAAA TCTCAAACAA AAAGGCAAAG TAGCAAAAAC AAGATCGATC TTGTCTTCCT TGCCCCCTAC TGGAATTCAC TTGAGCAGTC TTTCGCGGCT AGGAAAGTGG AGGCGCTATC TGAAAATCGG AAATCATCGT TTCATTCGAA TCCTTTGTTT CAATTTGAAA CTGTTGAAGC CACTGACGAT ATGATTAAGG AGTATCTTTA TCGCCCCGAG TGTGAATTGG ATTTGCAACA GCTTCGCCTT TCGTTCCGAT TCACCGGTTT ACCTAAGGGA GCATTCTCTC GTTTCGTGGA TGCTCATCGT CTCGCTCTAG GCTTGCAGAA TTACGCCGTT CGGCATTTGA AGAAAAACGT ATCCGCCACT GATCGTGCTA CTGCTGGATT GTTCGAGGCC CACCCACTAG CGGTGAGAGC GCCGATATAT CCTTCAATCG TCCTTGAGCT TCCCTTCGCA TTTATCCTTT CGCAGTTACC AACTCTTTCG AGCCAGTCTG GTTTTGAGCA CGAAAGACAC GAACCTGTGT TGGAACTGAA GACGATTGTC GACTCTATGA AGCCACCATC ATGCTGGGGT AAGAACATCA TCGCAGCACT CCCTACGTCA GCAGACGAGG GTCGAAACAT TCACGAGGGA ACCGGCGATA TCACTCCATG GAAGATTACT ACTATGTCTA ACGGGAAAAG TTCGAAGAAT AGAGAGTATG AAATCGGGCA TTTTCTTTCA GGTTTGTTAG CTCTTCAGCA GGCGTTTGAT GATCATACTT CATCGCCTGC AGTTCTTGGA GTCATACACG AGTTCGACAA CCTTTTGGTA TTAGTGTCAC AGAAAAATAT TACAACCGCT GGAACTGAGT CTGAGCGCAC TTCCCGGCTT AAAGATTTGG TAAGAAACTG GGCCATCTTA AAAGGGCATG GGGAAGAAGG GCTGTCGATT GAGAAATCAA GCGGGGAAAG TTTAGTTGTC TCAGAGTGGC ACAGAGCAAC TGAGCGTCTT TTCAAGTATA TTGTGGAATC TTTCTCTGCA AGCATTAGTC GACGAGGAAT CTCGACTGTC TTTACCGACA CACGGTGCAA TGGGCACATT ACGTCAAACG ATTGCTTCGA ACGTGCAGTG AGACTTCCGG CGGCTTTGAA AGGTGCAAAA CTTGCGGGCG CTGGAAGTGA TGAAAACTGT CGACTTATCT CAGCAGTCGA CGAATCCTAC CTGAAGTATG TCGAACATAC ACTTTTACCG AAAGCTCACG ACAGTGCATA CCTGAAACGT ATGCGCGGAC GATGCGCAGC AGCGGTGAAT GAGACTGAAA TTCTTGTACT CACCGAAGAT TCGGAAGGCA ATGGAGGCAG TGACACTCAC GGATCAAAAG GAACATGGGC GGCAGCGGTT ACAGCGGTGG CAGCGGCCGT CGCTGCAGCG GATATGATAG TAGGTGGAGA GTCAACAAAC GCTTTCTGCG CTACGCGACC CCCAGGGCAT CACGCAGGTA AGGGTCTGCA TCCTATGAAA GCAGTCTCCA ATGGCTTTTG CGTTTTGAAT GCTGTTGCCT GTGCAGCTAT CCATGCGACT TCCTCAATAT TGGAAGGGGG CCTCGGACTC AAGAGAGTGT GCATTATAGA TTTCGACGTG CACCATGGAA ACGGCACTCA GGATATTCTC TGCTCGACTT TCAACCCACA TTTTCTCTAC GTCTCGATTC ATGCGGGAGG TCCGCATGTA AATGGAGTAG CTATTGACGA CGATCCAGAT CATGAGCTAC ATGAACTAGC AAGTAACCCA AAACAAGGCG GTGGCATTTA TCCGGGTCGC TGCGGTGACA CCTCTCCCCA CAAAGGAGTA TTGAATATTC CGTTAGGCTC TAAAGTTACT GCCCATGCCG TAGGGGCAGC TTTGTTGAGC ACTGTAACTC CTGCTGTCAA CAAATTCACA CCGGACCTCA TTATTCTATC CGCCGGCTTC GATGCACACA AAAGTGATCC AATGTGCTTG GGAAGTTTAA ACGCTGAAGA TTTTGGCCAC ATCACCGAGG TTTGCTGTCA ACTTGCATAC AAATCCTGCA GTGGTCGAGT ATTAAGCGTA CTGGAGGGAG GCTACGGTGT TCCCTGCTGC CGACCACAGA AGAATGTATT TATTCCTTCT CCCGGTCGAG GCGACCGGGA AATCGAGTCG ATTTCTCAAA AGCAAAAGGA CGGACCTGAA TTGCCTCCAT CCACTCCGAG TGCTTTACAA ATACCCCGTC CACAGCCATC GAGGTTATTG CAGTTGGGAG ATGACTTACC GGAATCAATG GACGATCAGG TTCCATTCGC TCTACAGCGT CGGCTCGAGA AGTGCCATGC CGAGGGTTTC GTCGAATGCG TCAAGGAGCA TGTCGCCTCG TTAATGCGAT GCAACAAACG CACGTAG
|
Protein sequence | MDPEEQKISR GILLSETRGI SNEYGGDSTR EDLARIDSSS PATETPPAKK PKRESPCPES NGIANAESSS KPSSTVGNDA APSVDFPPEA GGTQPDSQGR DEIVSLYISL EPISTKVPQI PKLTEFEVKQ LEAVLEFKNS SEWRDDWMGN LAFADLDVGN PAISSKSRDK QHTFRQPLIQ WAHNDQSNAK YVWFLICHVY NIPSIPPAAR KILGAVDLRS SVNMEKTLRR VMYDPEVLRE DGWTTAKSNE YIGATGGPHN IGEQIYWDGS NAVVIAYIHD PGLRVSDIGD LWKAIWAGND DDDDVLTTTF DLEAEELLEA KRKWQRRQSS RSGGGLSSSR HPKKSNVSDD FNVAGVELGI VLGASYSKGA RNGVYWPARV MHASEKMGTK SQTKRQSSKN KIDLVFLAPY WNSLEQSFAA RKVEALSENR KSSFHSNPLF QFETVEATDD MIKEYLYRPE CELDLQQLRL SFRFTGLPKG AFSRFVDAHR LALGLQNYAV RHLKKNVSAT DRATAGLFEA HPLAVRAPIY PSIVLELPFA FILSQLPTLS SQSGFEHERH EPVLELKTIV DSMKPPSCWG KNIIAALPTS ADEGRNIHEG TGDITPWKIT TMSNGKSSKN REYEIGHFLS GLLALQQAFD DHTSSPAVLG VIHEFDNLLV LVSQKNITTA GTESERTSRL KDLVRNWAIL KGHGEEGLSI EKSSGESLVV SEWHRATERL FKYIVESFSA SISRRGISTV FTDTRCNGHI TSNDCFERAV RLPAALKGAK LAGAGSDENC RLISAVDESY LKYVEHTLLP KAHDSAYLKR MRGRCAAAVN ETEILVLTED SEGNGGSDTH GSKGTWAAAV TAVAAAVAAA DMIVGGESTN AFCATRPPGH HAGKGLHPMK AVSNGFCVLN AVACAAIHAT SSILEGGLGL KRVCIIDFDV HHGNGTQDIL CSTFNPHFLY VSIHAGGPHV NGVAIDDDPD HELHELASNP KQGGGIYPGR CGDTSPHKGV LNIPLGSKVT AHAVGAALLS TVTPAVNKFT PDLIILSAGF DAHKSDPMCL GSLNAEDFGH ITEVCCQLAY KSCSGRVLSV LEGGYGVPCC RPQKNVFIPS PGRGDREIES ISQKQKDGPE LPPSTPSALQ IPRPQPSRLL QLGDDLPESM DDQVPFALQR RLEKCHAEGF VECVKEHVAS LMRCNKRT
|
| |