Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44574 |
Symbol | |
ID | 7198085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 944634 |
End bp | 947670 |
Gene Length | 3037 bp |
Protein Length | 556 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178609 |
Protein GI | 219115627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACCACAAG GATTCACAGT CAATCACTGT TCGAAATGGC TCCATCTGTC TCCGTCTACC ATGGGGCTGC TCCTATTTAT GTAATATGGC GAGTTATGAT GGCCAATCGC ATGCCGATTA TGGACTGCGA CGAAGTCTAT AATTATTGGG AACCCCTGCA CTTTATTCTC TACGGATCGG GTCAACAAAC GTGGGAGTAC GCGCACGAAT ACGCGCTGCG AACGTACGCC TACGTGGTTC CGTTGCAATG GCTTGCCCCG GTTCTCAGTC GATTTGTGTC GCCCTACGCA TGGTTGTTAT CGGATTACGT TGTTGATACC ACCCTTGATA CCAGCAAACT ATCCTTGTTT CTTTCGTTTC GGGCTTTTCT TGCAGGTACC ATGGCGCTTT GCGAACTGGC TTGGCTCTAC GCATTGCACT CCCGACTCTC CAAAGACAGT TCGCTTGTTG TAGGCTGGAC GGGAGTGCTG TTGTTGACTG CTGCAGGCAT GAACCACGCC GCCGGAGCAT ACTTACCAAG CTCGACCTTT ATGATGGCGT GGCTCGCTGC GAGTGCGGTT TTTCTGTTGG AACGTCATTT TTTATTCGCA GCAATTGCGA TTATATGCAC ACTTTCAACT GGATGGCCCT TTGGAGTCGT TGTGCTCGTG CCCTTGGGCA TTCGTGTTCT AAGAAAGGAA TACAGAGCAC GTGGCGTGTG GAGGTTGTTG CTCTGGAGCG TTGGCGTTAC GGCTGCGGTC GAAGCTGCCG TCTTGATGAT CGACCACAAA TTTTACGGGG TGTGGCTATC TCCGACGTGG AACATATTCA AGTACAACGC CGCGGGAGGC GGCGACGAAC TCTACGGAAT TGAACCAACA TCCTACTACA TCAAAAATCT GTTTCTGAAT CTGAACCTCC TAGCCCCGAT GGGTATCATC GGCCTTCCCG TTCTCGTCTT TTCGCGAAAT CAACCTGCCA AAGCAGACCT AGTGACAATG ATAGTCACGC TCTACACATG GCTTGCTATT ACTGTCCCTC GTCCGCACAA AGAAGAACGG TTCTTGTTCC CAATTTATCC GGTGCTGGTA CTATCTTCTG TTTTGACGGT TGACCACACT CTCAATTTCA TTGGCCGAAT TGTTGCGGGA TTTTCTCGTC ACAAGACTCT CGTGCGTAAC CAGCGGATAG CTTTGCACTG CCTCGTATGG CTTCCCGTCG TCGCGATGAG TTTGTGTCGC GTGGCTGCCT TACACAAGTA TTATACAGCT CCGTTGCAAG TGTATGCAGC ACTAGTTTCC AGAATGGACC CACTCTCCAA CCAACTGGTC TGTAGCTGTG GTGAATGGTA TCGATTTCCG AGCTCATTCT ATTTACCAAA GAACCATGAC CTCGGCTTTC TCCCTTCGTC CTTCGGTGGG CAGCTACCAC AAGCATTTTC TGTACACGGA TCGCTACCTA AAAGTCTGAA TCTTTTGCAG CCTTTCAACG ATCAAAACCA GCAAGAAATG TCGCGGTACG CTACCTTGGA TCAGTGCAAT TATATTGTGG ATCTAGAAGG AAGCGATTGT GCTCCTTCCG GCGCCGAGGT CGTTGCTCGT GCTCCGTTTT TGGATGGTGG ACGATCCTCA ATGATTCATC GTATGTGGTA CCTTCCGATA TTGCACGATG CCGCCATCAA ATCCGGGAGC GTGCAATACG AACACTATGT TTTGTACAAG ACTTGAGGTA TGTTGGTTTC CGACGAGACT AGTCTACAGC ACTTGTTTCG AAAAGTAGGA AAGCTCGGTT GCTACGGTTT TTGGAAGAAT TTCAAGTATT GAAATTCTCC ATCCGAGTGG TTCTCTTTCG AAAAAAGCCG TGGTGTGTTT TTTGGATGAG AAATCTGTCG GTAGAGACGA TGATGAAAAC ACTTTATCTA TTGGAGCTGT CTAAATGAAT GCGTGTTGAA AAGCAAAGGG GTTTATCTGA GTCTCACTTG TTTTAGTACA ATATCTCCTG TAAACGAGCA TTTTCCCTTG CTGACAAATC GCCTTTCTTC GCAGGCTGTT ATTGAACGCG ACGACTTATA TTACGGAACG AGTTTCTTTT CCAAGTAGGT CAAAGGTACG TGAGGAAAGC AGATGCCACC CCTTCGCATG TCCGGAAAGT GATGCAAAAA TGATGAAGTC CAATTGCTGA CCGAACACAT TTTGTGGAAA TAAATTCACA GCTATTTCAA ATTGCCTGAT TCCTTTTATG CTATTAATAC GCACGGCACG CTTTTTCCTT TCTCTCACTC GTGCTTCTCT GTTTAGATTT GCGTAAAGGG CCAAACGATC CGAACGCATC GCACACTTTC CACACTTGAG CTTGGAGCCT AATGTGGAAT GGTATCCAAT GCTTTTGTTA AACCTTCGGC AGCGTCTGAA AACACGTAGG CATTACAAGA TTTTTAAAGC GCTTCGCACA CGAGAATTAT TCCTATCAGG AAAGGGTTTA TTCAATTGGC ACGACTTCCC TAGCGAGATC TTTGAGGGAA GTCATATCTG TCTCGTACAA ATGTACTTTC GGTTCTGTTT TGTTGTCGAG TACATCTTCT TGGTACAAAC CGACCACACG AACGCCCTTG GTCATTCCAG GAACCAGAGT AATCATTAAC TTGTGTAGTG ATGGTACGGG ACGTTTTTCC TCTTTTGCCT TTTCGATCTC AAACTGTTCG TCGTAATCAG CATACCCCAG GACTAAACCG TCTTGGTCCA CTGCGTAGGC TACTTTCAAG CTCATTCCAT CCTTCGTTAC CGGTATTATG CTCGACGTGT TCACAATGAA GGGCTTCTTC TCTAAGATAA CAAATCGTTT CCAGTAGGGA ACGAACTTGT CCAACGACCT CCACCCAGCG ACGTCTTCAA TAGCCTTTTT AAATGAACCT TCGTCCTTCA AAAAATCAAA CCACTCCTTC CAATCATTTT TCGAATTCAA GGCTTTTGTT TCACTGAAAC GGAAATCAAT TTGAATATTG CGTACGTTGA CTTGATTGAC ATCAAGAGCG ATAAGGACAT AAAGCTCGGC ATTCTCC
|
Protein sequence | MAPSVSVYHG AAPIYVIWRV MMANRMPIMD CDEVYNYWEP LHFILYGSGQ QTWEYAHEYA LRTYAYVVPL QWLAPVLSRF VSPYAWLLSD YVVDTTLDTS KLSLFLSFRA FLAGTMALCE LAWLYALHSR LSKDSSLVVG WTGVLLLTAA GMNHAAGAYL PSSTFMMAWL AASAVFLLER HFLFAAIAII CTLSTGWPFG VVVLVPLGIR VLRKEYRARG VWRLLLWSVG VTAAVEAAVL MIDHKFYGVW LSPTWNIFKY NAAGGGDELY GIEPTSYYIK NLFLNLNLLA PMGIIGLPVL VFSRNQPAKA DLVTMIVTLY TWLAITVPRP HKEERFLFPI YPVLVLSSVL TVDHTLNFIG RIVAGFSRHK TLVRNQRIAL HCLVWLPVVA MSLCRVAALH KYYTAPLQVY AALVSRMDPL SNQLVCSCGE WYRFPSSFYL PKNHDLGFLP SSFGGQLPQA FSVHGSLPKS LNLLQPFNDQ NQQEMSRYAT LDQCNYIVDL EGSDCAPSGA EVVARAPFLD GGRSSMIHRM WYLPILHDAA IKSGSVQYEH YVLYKT
|
| |