Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44794 |
Symbol | |
ID | 7199752 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 263939 |
End bp | 267218 |
Gene Length | 3280 bp |
Protein Length | 973 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178738 |
Protein GI | 219115886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTGGATAAT GCATCGCATA TCGCTGAAAA GAAAAAGCTC GGAATTTTCT ATTCTGTTAA CAAGGAGAAG TAACGACTCA TGCTTTCGTA CCAACCACCT TAGCAACCTC AACTTTGCTA CTTTCTCGCT CACGCGGATC TGTCAAAACC GAATCCTGGA AGGCTACTTT CAAGCTACGA AATCTTCCCA GCGAACTCTC AGAACGTATC AGTCGTCTTC TTACGCATCA CTAGGCTTGC ATGAGGATAG CCATCACTCG AATGCAGAGG ACGTGATCGG TGTTAGTTTC GGCAATTTGC CTCAACAAGG TGCCGATGCC GTCGAAAATA GCAAATTCAA ATTTCGAGCA AAAAATAGAG CTAGCCAAAG TATGACAGAA CAGATACCAA AGTCAATTGA AGAGATCATT GTGTTTCACG AAACTCTTCA CAATAGTCAC TTGCGAATAC ATTGGACCAA GGAGCGAGAG TCTCTCTTGC GGCGGGAGCG ACGGAAGCTG GGGACGACGA ATCAACCAGC TGAAGAATCG ATGGATAGAT TTCAATCTAC TGGCTCGATT CTGACTCACG CAACAGAACA CGAAGAGGAG TCTCCACAGG TTTCTTTACT ACCGGACCCT TCACTCTCGA CAACGTTTGT CCCTCTTACA AAAAAGGAAC CACCTGGAGA CGAGAAATGC GATAATTCCT TCTCTCATCA AGCTCTGAGC AAAAGCGACG CCTCTTTTGA TGGCTTGTAC GACTTGGAGA CGATCGCTTT TTTAGACTCC CTTGTTTCGT TTCGAGACGA AAATGAATAC GACGGAAATC TCTCTTCGAT GGCAGTCAAG CCTTCGTTGA TTGGTAGAAC AAGCGTTTCG AATGTCTGGC CACAGAAGAG GAATTTGTCT CCAGAAAGCT CACAGCCAGC TAAAAGAAAG AGTATTGAGC AGCATGAGCC TTTTGTTGCT AGCGCGAATA TTACAAGTAA GCCGGTTTCC CCCAATGTAG CCGCTCAAGA TGCTCGAGTG GAAGCTTCCC TCCTTCCCGA AACTGGGTCT ATCATTAACA ATGATGAAGG GACTGGGAAG CAGTTTCCTA AATTTTCTGA CGACGAGACT CCATCATTAA ATTTGCTATC TACCCAGCAA GAAATGCTGT TTGCTGCAGT GTCACTTTTA CGCGCAACTA GCGACCGAGA ATGGAAACGC TTTGACCAAT TTGTCGAATC AGATGATGAG GATGCAGATT CTGCCATCGA TAATGCGAAT GAGAATCTTC GAGATGATAC TATTTCATTC AATTCGATTG ATGACAGCGC TAAAGCAAAA GACAAATATC TACGTGGGTC GATAGAGAGC TACTCGATTG ACGATATCAC TGAGGATGTC CTCTTAGGAA ACTTGATGTT GTCTACATTG GAATACAATT TGCTTTTAGC AAATTTGGCA CTGTCGCCAA CACATTCCAC AGACGATGCT TTTAGTTTAC TCATGCGTCT ATATCGACAT GTGACGAAGC TGAGCGAAAT GGGTGTAGGT GCATGCACTC CTGACGGTCT CACATATGAA ATTCTTATTT TGACTTTTGG TCGAAGACTA CAAGCATACG CTGCAGGAAT GGATTTAATA AAGGAGATGA TGGATACTTC TCGATTTACC CCCCGAGCAC TGCTTGCAGC CTTTGAACTT TGCCGCCAGC GATCAGATTT GAATCTCACT AAGGAGATCC TCCAACAAGC AATCTCAGAC GAATCTAGAT CTTTTCCAAT TCCAAACAGT GTGTATTCCA CCTATCTCAG CATGTTGAAA CCAGAAGATG CAACCGAAGA GGCTTTGCAG GTGTTGCAAA CCTGTTTAAA AGTAAGTGTG TTACCAAAAA AAACATGTAG AAGTTCGCTG CAGTCAGGCT TGCTAACGTG CCTTTGATTT GAAAGGAAAA CAAGCGTACG AAGGATAAAT ATGTCGACGA AGTTTTCAAG ACAGCAATAG AGTGGCCTCA TCGAAACACT AAGGGGACGA CAACCGATAG TACTGCTTTC CTTGGCCACA TTATAAATCT ATTACAGGAA GAGGCCATAT ACCGACCAAG CATTCACGTT TGGACAAAGC TTGTCCACAC TTTATGCTGG GGATCGCATC AAAATGAGAA GAGGAGAAAT CTTTTAGGAA ATGTTTTTCG ATTTCTTCTG TCTAAATGGA CCGACTTCGT TCCAGATGCT CGGTTGCGCC GAATTGGATT GGATCTGAGT CAGCAGATTC CTGATCCCAA AATGGCTCAC GATTTGATTC AAACTGTCTT GAAGCACGAA GTGTTGAAGC AGCGTCACAC TTCGGCCTCT AGAAGGAGCT CCGGGGAGCA AATACGACAC GGCACATCAG CTAGCTCCTT GCTCTTATCA CCGACAGAGG ATGAACGAAA CAGGGGAAAG AATAGTTTCC GCGGGTACTC CGTGCCGTCG GCAGATGTGA CGAAGGCGAT GGAAATTTGC GCTCGCTGTG ACGAAATGGA CAAATGCGAA TCTATCCTTA AAAAGATTGA CAACCTTGGT TATGCAGTCG ATCCGGCCCT CCATAGTACA TTGTATAGTA TGGTCCTGAA AGGATATGCC AAAACCGGAA ACACAACAGC CGTGGTTCGA CTGTTGTCAC ACATGCGGGT ATCTGGGATG AAATTGAGGT AAGCTTCTGA AAACCAAGAC GCTTAGGGCA CCTATGTAAA CGTCGAAAGC ACTCTCACCA TTATATCTTT ATGTTTAACA GTGATGAGTT GTATGGCACT GCAATCCATT GCTACGCCGT TTCAAACCAG GCTGAAGAAG CGTGTGCGCT ATTGGAATGT ATGAAGTCAA ATTCGTTCAA TGACGGCGTA AGTCCAGGTG ATGCTTGCTA CAACGCACTA ATTCTTGCGT ACATCCAAGG TGAAGAATGG GACGCTGCGT TATCAATCTT TGCCGAAATG AAAAATTTGG GAATTTCTCC TGATCCAACT ACGTCACATG GCCTACTGTT GGCGTTCTTC AAATCAGGTG GGCTCTCGAG CGCAGCAGAG TTTGTCACCA CTCTGCTATC GACGAAGGCT GGTATCAACG GACAGACATG CACTCTTGCT CTTCGCTTCT TTATTCCTGA GCTACAAGCC TGCTCTGATA CAGCTTCCAT GCGCAAAAAG CTCCGCGAAC TCGGTATGGT AAGTTCTCGG GATGAGGATG CTATATTGTT AGACTTAGCA CGTTCTGTGC GCGTCGCAGA GCTGGAGGAA GGTCGGAGTA TTTCAAAAAG TCTTCCTGAG GAAGTATTGA
|
Protein sequence | MHRISLKRKS SEFSILLTRR SNDSCFRTNH LSNLNFATFS LTRICQNRIL EGYFQATKSS QRTLRTYQSS SYASLGLHED SHHSNAEDVI GVSFGNLPQQ GADAVENSKF KFRAKNRASQ SMTEQIPKSI EEIIVFHETL HNSHLRIHWT KERESLLRRE RRKLGTTNQP AEESMDRFQS TGSILTHATE HEEESPQVSL LPDPSLSTTF VPLTKKEPPG DEKCDNSFSH QALSKSDASF DGLYDLETIA FLDSLVSFRD ENEYDGNLSS MAVKPSLIGR TSVSNVWPQK RNLSPESSQP AKRKSIEQHE PFVASANITS KPVSPNVAAQ DARVEASLLP ETGSIINNDE GTGKQFPKFS DDETPSLNLL STQQEMLFAA VSLLRATSDR EWKRFDQFVE SDDEDADSAI DNANENLRDD TISFNSIDDS AKAKDKYLRG SIESYSIDDI TEDVLLGNLM LSTLEYNLLL ANLALSPTHS TDDAFSLLMR LYRHVTKLSE MGVGACTPDG LTYEILILTF GRRLQAYAAG MDLIKEMMDT SRFTPRALLA AFELCRQRSD LNLTKEILQQ AISDESRSFP IPNSVYSTYL SMLKPEDATE EALQVLQTCL KEEAIYRPSI HVWTKLVHTL CWGSHQNEKR RNLLGNVFRF LLSKWTDFVP DARLRRIGLD LSQQIPDPKM AHDLIQTVLK HEVLKQRHTS ASRRSSGEQI RHGTSASSLL LSPTEDERNR GKNSFRGYSV PSADVTKAME ICARCDEMDK CESILKKIDN LGYAVDPALH STLYSMVLKG YAKTGNTTAV VRLLSHMRVS GMKLSDELYG TAIHCYAVSN QAEEACALLE CMKSNSFNDG VSPGDACYNA LILAYIQGEE WDAALSIFAE MKNLGISPDP TTSHGLLLAF FKSGGLSSAA EFVTTLLSTK AGINGQTCTL ALRFFIPELQ ACSDTASMRK KLRELGMSWR KVGVFQKVFL RKY
|
| |