Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49408 |
Symbol | |
ID | 7195902 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 176601 |
End bp | 180390 |
Gene Length | 3790 bp |
Protein Length | 1026 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184075 |
Protein GI | 219127714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.612989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACAACCTGT CTTGACCGTG AAAACCAAAA TCGTTCAGTA GCCTATGGTG ATTGGTGGGG CTTGGAAGAA CGATGAATAG TTCAGGTCAC AGACGGTTGT TCCGTTGGAA TCGTGACAAA CACGGTGATA GTTTTGACGC TTCAACCGAA GGGGGAGGAA GCAGCCATGC TCTTCCGGTA TCGAAATCTG GCTCACCGGT TCCCAGTGAT AGTATTCCCA AAAGTTTTGC ACAAAGCCGA ATCAATTGGG AGTCCGGAAA GAAGTACGAC GAGTCATCGC AGGATTTTTC CGGACATCGC AGTCGTGATG ATTCTGTAGT GGAAACAAAG CTTTGGAGGA ACAAGCTGAC AAAAAATCTC CGTGGTAAAT TCAGTCCACG TTCAGGTCTC CAGATAAGAA AGCGTAGAGA TAAGAAGGAT TTAAAAGAAG AGGCCGAATC TCGCTTCCTC CGGACCACAA GTTTATCCGA CGTTGATGAC ACCTTGACTC GAAACGACTC TCGTGGATGG CACAAGCGAA ATGGCCATCC CAGAAATTTA GCTCATGCAA AGAAAAGAGG CGATGACTCG TTGGAATTCG CACGAAAACA ATCCCCGCCG TCTAGCACGG TATTCTTTGA ACAAGGAATG TACTCTGCTG CATCGGTAGA GGAGCTTGGA TATGAAACTA TGACTGGCTT CACCGACGCT CCGAAAGAGT CAGACCAACA GTTGGAGATG AAACGGGCGA ACTATGGAAC AGGGCATTCG AAACGCAAGT TTCGCTTACG ACCCCACCAT TGTTTTGAGA AGGCTACGTA CATGACAGAG GAAGACATTT ACTCGGATAG TATAGAGCCG TCCCAGTCTT TTGAGTTCTT GAAGTCTTAT CTGGCACCGA CTGCTTTCAC AGAGCCGAAA GGCGAACGTG ATGGAAAAAT TGAAGAAAGT TATGGTTCGC CGGAAACCGA TGGCCGTGTG GGAGCGCTGC GCGTGGAAGT TCTTGGATGC GTGAGCCTGG CGCGACAAAA ACCTGATGTC GCCACGTATT TGGTTTGTGG TGACAGTGCT TTTTGTACGG ATGTGATAAG CGGTTATCGA TCCCCAATGT GGCCAAGCGC TTCAAAGCGT GCCGCAGTTT TTCCTATTCA TCATGCATTC TCCAGATTGT ACATTGGTGT TTTCGATGTA AAAGCACGAA AGAGTAACAA GCAAACAGAC GCATTTTGTG GAAGAGTCAC CGTTGATATT TGTTCTTTAA GACCCGGGAC TGAGTATGAT ATAACATTGG CCCTTCGGGC CTCTACATTT GTTTACGATC GTCGTCGACG AGGAGTGGTC CGCTTGCGCT TTTCTCTTCA TTGGTTTAGC GAGCGCGGCG CTATTCTTTC TTACTTCAAC CAACCGCGAA GCCTTGTCCA GGCCTGTCCT CTTGTCAGTG GCTCGCCTGT GATTCCGTGT GCCGAGCCCA GAACATTCAG AAACCTTGCT CTTACTGTGC ACGGTCAAGA TCTTCCTGGA AAGTATTCCC GGAGCGCGTT TCGGGCAACC ATGCGTGAAT TCAATTTGTA CCAGGTGAAC ATCCGGCACT TGCTCAAGAC AAGCGTAATT GAGGCTGTTC TCTACGAGAA ACCAATAGTC TCTTTTTGTT TGTTCCTCGC TGGAATGCAT TGCGTTGTCT CAAACTCAGT ACATATGGTT CCACCATATT TTCTTGCTTA TCTTTTGATT CAACTGCACG AGAGCTACCA ACACTATGTC CAAAGTTCGG TATACAATTG TGGATACAAA CCCCTCACTT TCTTTGAGGT ATTTCAAGCT CTGATATTCA ATACGACAGG ACGAGATCGA ATGTTTGAGT CCATATCCGT GGCCAAGCAA GCAAAGCAAA GAGGAAATAT TAGGCAACAT AAGCAGCTTG CCGATATGGA TTCTGATAGA GGGGAAGGGG GTTTGGTAAC TCACCCTCCG GATCATCAAG AATTTCCTTT TTCAGATAGG GATGCCTACC CCAATTTTGG TGTTGACGAC GCACTCGCAC CGAGTTTGAA GAAAGGACGA GGTGAGTTGG TGGTGTGTCT ACACTACAAT GAACTGGCGA GTGACTGAAA TCTCAAATAA TTGTTCTCTG CAGGAGGGGA AAACAGTCTC CATGGAAGAC TCTCTGTGTA CTACACAGCT ATACCAAAGA CGGATCCAAG TGGATATGTT GACGGCATCT CAAGTGATGA AAGCGGGGAC GCAGACAATG ACGACGAAAC AATTACGACA GAGACGGGTA TGGATGACAG CATGTTCCAC CTTGATATGG AGGGGGAGGA TGATGGCTTG GAACCGGATG ACCATTTGGA GCTGGCTTCA CAATCCAACC TTCCTTTGGC CGCTAGCAAC AGACGACGGA TTAAGCTTGG GCCGGCTCAG AATACAGACA CATCTGCAGC AGTCAGAGTT CCACCTCAGA TCCACTTGAA AAAAATGGAA AACATTTTTC ATAAACTTTC TAAAAAGCTG TCTGTTGAGC TTGTTGCAGC ACCTATGGAT CGAAATCTGG GTGGAACTGT GGACACGTCT CACCTTGCGA TGCGTGGGTC GCTGGCTTTA ACTACGGAGG CTTTGCAGGC GCAAGAAAAA CTGGGCAAGA AGGCAATTTA CGATGAATTC GATAGACTCT TGGGTCTAGA GACGAGGACT GCTAATCCAG TTCTTAGAAT TGCATCGGCT TTTTTAGGTC CTTTGATGCG GATTATCCGA GTCTTTGTGT ATCTGGTCCG CGTTTTATTC AACCTTACAA CATGGCGCGA TCCGTATATG TCTTTCTTCC TCTTCATTTG CTTGTCCGTG TTGTGCCTCA TTCTCGCCAT TTTTCCATGG AGAGTGTTTT TCTTTCTGAC TACATTCTTA TTCCTGGGTC CTCAGGTAAT TTCTCGACCG GGTATAGGAA AGCATACGCT GGTTTCGTCT AACTTTATAA TGTGCGTTGT TTTTTTTTGC AGAACATTTT TCTTCGCACG CATCTCGAGA ACAAAGCAGC AAAAACACCC AAGGAGCACA GGGAGAAAAC CGGTAACAAA GATGAAGCGT ACCCGACGGA GCAGAGAAGA GACCACAAGA CTAGCGAATT ACAATTGGGT GTTTCGAGCA ATAGTCGCTT AGATCATCGT AAGGTTTTTT TTCGGAAGAA CACAAGTCGT GAACTAAGTA CTGACAAGCC TCACTTGATT GGACCTTTAT ACCAAGATGA ACGCCCAGTA TTCAGTGTTC AGCACGGTTC GGTGGCGAGT CGTAAACTTC GGCCTAGATC TGTGGCCATA CCGTACAATC GTTTGCGGAA GGAGCGATTT TATGATTGGC CCCCAGATCC GACAGTTTCC CGTGCAACTC CTTTGCACTT TGCCTACGGC GAAGAAACGG AACGCAGACG ACGAGACAAC ATTTTTTCTT CTGACTTGGT TATGTCGCTT CAGGAAAGGG ATAAGAACGA AAATCGAATC TTCCGACAAC GATTGCTAGA AGCCTCGTTG GGAGAGAATT GAAGTGGTCG GAATCATCAA TCGGAATAGG ATGATGAAGG TATGAAAGCG ACTATGTATC GTAGCAATTC GTAGCGGAGA GGAGAAACGG AGCGAACTGT ATGTGCTACT CGGAAAGATC TACTACTATC ATTTAAAATT CAGTCTCAAA ACGCCTTTGA GAGTGCTTGT CTTCTACAAA AGGCCTTGTG AAGTCAGAGG TGCAGTTGCG GAAGCGGCGC AGTAGTTCAC GTGCGTCAAC TGAGACATCG AGAAGATGAC GGAACGATTA GAACATGGGC
|
Protein sequence | MNSSGHRRLF RWNRDKHGDS FDASTEGGGS SHALPVSKSG SPVPSDSIPK SFAQSRINWE SGKKYDESSQ DFSGHRSRDD SVVETKLWRN KLTKNLRGKF SPRSGLQIRK RRDKKDLKEE AESRFLRTTS LSDVDDTLTR NDSRGWHKRN GHPRNLAHAK KRGDDSLEFA RKQSPPSSTV FFEQGMYSAA SVEELGYETM TGFTDAPKES DQQLEMKRAN YGTGHSKRKF RLRPHHCFEK ATYMTEEDIY SDSIEPSQSF EFLKSYLAPT AFTEPKGERD GKIEESYGSP ETDGRVGALR VEVLGCVSLA RQKPDVATYL VCGDSAFCTD VISGYRSPMW PSASKRAAVF PIHHAFSRLY IGVFDVKARK SNKQTDAFCG RVTVDICSLR PGTEYDITLA LRASTFVYDR RRRGVVRLRF SLHWFSERGA ILSYFNQPRS LVQACPLVSG SPVIPCAEPR TFRNLALTVH GQDLPGKYSR SAFRATMREF NLYQVNIRHL LKTSVIEAVL YEKPIVSFCL FLAGMHCVVS NSVHMVPPYF LAYLLIQLHE SYQHYVQSSV YNCGYKPLTF FEVFQALIFN TTGRDRMFES ISVAKQAKQR GNIRQHKQLA DMDSDRGEGG LVTHPPDHQE FPFSDRDAYP NFGVDDALAP SLKKGRAIPK TDPSGYVDGI SSDESGDADN DDETITTETG MDDSMFHLDM EGEDDGLEPD DHLELASQSN LPLAASNRRR IKLGPAQNTD TSAAVRVPPQ IHLKKMENIF HKLSKKLSVE LVAAPMDRNL GGTVDTSHLA MRGSLALTTE ALQAQEKLGK KAIYDEFDRL LGLETRTANP VLRIASAFLG PLMRIIRNIF LRTHLENKAA KTPKEHREKT GNKDEAYPTE QRRDHKTSEL QLGVSSNSRL DHRKVFFRKN TSRELSTDKP HLIGPLYQDE RPVFSVQHGS VASRKLRPRS VAIPYNRLRK ERFYDWPPDP TVSRATPLHF AYGEETERRR RDNIFSSDLV MSLQERDKNE NRIFRQRLLE ASLGEN
|
| |