Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42527 |
Symbol | |
ID | 7196077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 308705 |
End bp | 310421 |
Gene Length | 1717 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176562 |
Protein GI | 219109615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000294856 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTCATGTA CCAAAGCAAA TCAGCACCGT AAACATAGGG CGAGACCTTC GAAAACCTGC GTTATTAAAA CTGTGTGTGA TAGGTTGTTT GGTCCCCACC CACTCTTGTC TCCCGCAATA ATGGCAGAGA TGTCGGATGC CCCGGGCGAC TGCTACTTGG GTATTGACAA TGGCACCCAA GGTTTATCGG TGGTGTTGAC GGATGCCAAC CTCAAGCGTT TGGTTACCGG AGAAAGCAGC TATGAGTTTG TCCCAGATAC TGACGCGGGA TGCTACGAAC AACTCACGGA TGATTGGGAT CTAGCCCTGA CCGACGCTAT GAAATCCGTC CATTGCTATT TGTCAGAACA CAAGTCCTTA CAAATAAAAG CTATAGGTAT CTCGGGGCAA ATGCACGGCG AAGTCTTGGT GAATCATGAA GGCCAACCTT TGTCGTCGGT CCGTCTCTGG TGTGACGGTC GCAACGAGGA CGAAGGCGAG GAACTCACAC TCGCGTTTCA ATTCAAGGTT CCCAAACGAG CAACCTGTGC AAGATTTCTT TGGACAGCTC GCAATCGACC CGAATTGGCC AGTCGCGTTG CGCACATAAC GACACCGGCA GGCTGGATGG CTTACCGACT GACCGGAGAT GTTGTACTCG GAATCGGTGA TGCAAGTGGC ATTTTTCCTA TTGACGCGGA TACTCTTGTT TACGACGAAA ACATGCTGCA AACGTTTGAT AATTTGGTCG GAAATGCTGA TATCCCATCT ATGCGAGACA TTCTCCCGAC AGTACGACGA GCTGGTCAAG ATTCAGGAGT GCTGACGCAA CAAGGCGCCG CTTTGTTGGG CTTCCAATTG CCGCCAGACT ATCCAATCGC AATAGCTGCG GCCGAAGGCG ACCAAGTGGC CGCTCTGGCA GGTAGTCTTA TTGGCCGGGA TGGTATCGTC TCATGCTCGT TCGGAACCTC GGTTTGCGCG AATGTAGTTA GCAAACACGG CGCAGTAGAA TTGCCTGTGG AGCCTTCCGT TGATCACTTT TGCGGAGCCG ACGGCAAAAA CATACACATG GTTTGGCTGC GCAACGGTAC AACGTTTTTC AATACAATGG TGGCTTCGTA TGGAATCCTG TCGGACAAAA ACGATGCATT TTCTGCTGTC ATGCCCCAAA TGCTCAACGC CGCCCCTGAT TGCGGCGGAC TGCTTGCGCT GCCTTTTATG GATGATGAGC CGGGCTTGCA AGTTTCTAGG GGTGGTACGG CTCTACTCCT TGGTCTCAAT GGGAACAACG CAACACCAGG AAATATTGCC AAGGCGGCCC TGCTTTCCAC CATGTTCAAT TTGAAACTGG GTTGTAAAAT TCTACAAGAG AATGGTGTTG TTATGAAAGA ATTGGTTTTG ACCGGTGGTC TGTCAAAGTC CCCCGCCTGC GGACAAATTT TGGCCAATGT TTTTGGCTTG CCAACCCAAC TGCTGGAGGC GGCAGATGAG GGAAGCTGCT GGGGTGCTGC TGTGCTGGCG AAGTACCGTC ATTTGAGCAT TGGAGACAAT GGAAACGACT GGACACTTTT TTTGGAGTCA ATCATGAAAG AAAAACGTAT TGAACAAACC AGGTTTGAAC CCGATTTCAA TGCTGTGCAC GAATATTCCA AGGTGTTCGA CCGCTACCAG ATCCTTGTAA AATTGCAAAC ACAACTAGCC AATGTTTAAA TTATAGATTA CACCCGTCTG TTTCCGG
|
Protein sequence | MAEMSDAPGD CYLGIDNGTQ GLSVVLTDAN LKRLVTGESS YEFVPDTDAG CYEQLTDDWD LALTDAMKSV HCYLSEHKSL QIKAIGISGQ MHGEVLVNHE GQPLSSVRLW CDGRNEDEGE ELTLAFQFKV PKRATCARFL WTARNRPELA SRVAHITTPA GWMAYRLTGD VVLGIGDASG IFPIDADTLV YDENMLQTFD NLVGNADIPS MRDILPTVRR AGQDSGVLTQ QGAALLGFQL PPDYPIAIAA AEGDQVAALA GSLIGRDGIV SCSFGTSVCA NVVSKHGAVE LPVEPSVDHF CGADGKNIHM VWLRNGTTFF NTMVASYGIL SDKNDAFSAV MPQMLNAAPD CGGLLALPFM DDEPGLQVSR GGTALLLGLN GNNATPGNIA KAALLSTMFN LKLGCKILQE NGVVMKELVL TGGLSKSPAC GQILANVFGL PTQLLEAADE GSCWGAAVLA KYRHLSIGDN GNDWTLFLES IMKEKRIEQT RFEPDFNAVH EYSKVFDRYQ ILVKLQTQLA NV
|
| |