Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18217 |
Symbol | |
ID | 7197066 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 830843 |
End bp | 832515 |
Gene Length | 1673 bp |
Protein Length | 326 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178014 |
Protein GI | 219112525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00902772 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAAAGGC GCTGCGAACC CTCTCCGAAG GCACAATTAA GTGTTATGAT GTTGCCGTAT GATAGCATCC AATAGAGACG TGGGATTGGC ATTGCCTAAG ACAGCCGTTG CTCGATTGAT CGGACACGAT GGACCTGTTC AGACGCTGCG TTTTACCGGT AAGTTATTGT TTTCATGTTT TCTTTTCTGC TTCCCAAAAG ATGAAAATAT TTCTCGCGCT TGGACTGAAC GACTGCATTC AGTGGCCATT GATGACACCG ATAGAAATAT ATTCCTTCAC TTTGAGCTGA TGGGATAAAA CATAGTACTA TATCGAGGCA TTTTCACGCG ATTTAATCTC AACCACTTGT TTTAATGTTT TTAGGCGACG GAAAGTACTG TTTGACAGGT GGACACGATC GTTCCATTCG ACTATGGAAT CCGACTCGTC TTGACCCAGC ATTTCCTCCC CTGCCATCGG GCACCTCAAA ACATCGGGAT CAATCCATTG CCGTGGAAGC TCTTCCACGG GCTCTGCCAA TTCAGGCTTA CACGGACGGA TATACCCACC CCATATCGTC TCTTGCGTTG GACCAGAAGT CCAGCACTTT AGTGGCAGCT TCTGGAAATA CCCTAATAGT TACCGATGTC GTAACCAAAC GTGTGAAACG GCGATACCAA GGCCACGCTG GGCGTATTAA CGCGGTTGCT ATCAGCGATA ACTGTGAGAC TTTCTTGTCG GCCTCTTATG ATGCAACTGT TTGTGTTTGG GATGGACGTG CCAGTCGCAG TCATACTCCC ATTCAAATTT TTAAGGAAGC CAAAGATTCT GTGTCATGCA TGCATCTAGA CCAGACAGAT GGAAATGCCG TCATCATGGC GGGTTCAGTC GACGGCGCAG TGAGATCCTA CGACTTGAGA AAAGGACAAA TCCGGTGCGA TCAAGTTGGG GGTGCGATCA CTTGTATGGC ACCTACTTAC GACGACAAAT GTCTCGCAGT GAGCTGCCTT GATGGAACTA TCCGTTTGAT TGAGCTGGAT ACGGGTGAGT TGCTCAACAC CTATTCTTCT CACCACAAGG CAGGCATGTA TGGACTGGAA TGCTGCTTGA CTGCTGATGA CGCAACCATT GTCTCGGGGT CAGAGAATGG AGATGCTGTT TTGTATGATT TAGTTCGTGC GACGCCCATA CAGATTCTTC GGGGGCATCG CAAGCCCACC TGTACGATCG CTGCGCATCC TTCTGTTGCC CAAAATTCCG TCATACTTAC GGGAAGCTAC GATGGAAACG CTGTAGTATG GGCGCACGAA TCTTCATTCA TGAGATGGGA AGAGTGATTG TTTCTGTGTC ATGCGGGGAC TCAGGACCTA TTCTGTCAAA AATTTGGGTC TGGAGACAAT CATACATATG ATGAGACAAG CCTCATCCAT GACCACAGTT CTCCCACCAT CGCTTCGAAT TGGCATGCTG CTCCGCTTGC TGGTTGAATT AGCTGGTAAC ACGTCTCGCC TTTGTGCATC AAACCTGATG GACAGGATGC CCCAGAGATT TTCTAGGAGA TTATAGTTCT TGCTTCTCCG TCAAAAATTA ATGTGTATGC TTGCTATGTC AGTAAGCTAG ATGAACAAAG GCAGTGCCTT CGTGGATTGA TGTAGAGTTA CGGTAAATTG AAAGAACACA AAATGAAGAC AAA
|
Protein sequence | MIASNRDVGL ALPKTAVARL IGHDGPVQTL RFTGDGKYCL TGGHDRSIRL WNPTPLPRAL PIQAYTDGYT HPISSLALDQ KSSTLVAASG NTLIVTDVVT KRVKRRYQGH AGRINAVAIS DNCETFLSAS YDATVCVWDG RASRSHTPIQ IFKEAKDSVS CMHLDQTDGN AVIMAGSVDG AVRSYDLRKG QIRCDQVGGA ITCMAPTYDD KCLAVSCLDG TIRLIELDTG ELLNTYSSHH KAGMYGLECC LTADDATIVS GSENGDAVLY DLVRATPIQI LRGHRKPTCT IAAHPSVAQN SVILTGSYDG NAVVWAHESS FMRWEE
|
| |