Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38821 |
Symbol | |
ID | 7203581 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 284203 |
End bp | 285720 |
Gene Length | 1518 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182936 |
Protein GI | 219125330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTC CCTGTGGACA CCAATGGAAA AGAACACTAT TGTGTCCGAG ACTGCTTGCG ATGCGTTTAC TTCTCGTGAT CGGACTGCTG CAAGAAGCAG CCTCCTTTTC CACTGCTGGC TGTAGCAATC CGGCAAGTTA CAGAGCAAGA CCAGCACTTG GGCTTGTACC GTTGCTGCTG CCAACGACAG TACGGCTCAA CACCAAGGCT CGACCCGTTC TCGCTAAAAC TTTCCCGACC ATGCCTTGGT CAACACTCGC CGCCGGTTCG TCTGATGGCA TCGCTCTGAG CGATACTCGC CCTGAGAAAC TGTGGCGGTT CTTCTGGCGC AGCTTATGGA AGGGTATGAC CTTGCCATTT CCGGCCCTAC GTAGCATGGT ACTCGACGTA TCCCAGCCCC AGAACGCTTC GGTGGGTTTC CGGATCCGCG AAAGCCTAGC GGCCATCGTG GCGTATCTCG GCATCGGTGT ACTCGCCTAC TATTGCGTCC TGGAACCGAC CTGGACCGTC GTAGACGCCT TGTACTTTAC CGTCACGTGC TTCACCACGG TCGGGTACGG GGATCTATGC CCTTCCACGC CGCAGTCACA GACCTTTACT GCGCTCTTTG GGATCTTGGG TGTCGCCTTT TTGGGAGCAG CCCTCGCTAC ACTTTCGTCC AAGCTCGTAC AGACCCAAGT CGAAGTACTC CAGGCTGTCC GAGAAACTTC CAAACAACGC ATCAAGGCCC TGTTTGAACA GGTGTCGCCT CTACCGATGT CTGCAGCGAC AACATCGACA GCGACAACGT CACATGTTTC CCAAAGCAAT TGGCAAAGCC CCGACTCTAC CCGTACCGCG GATACCGTGC TACTCTGGCG TCGCGTAAAT GCTCTCGTGT GGACACTAGT TAGGCAAATA CTTCCGCCCT TGCTGATAAT TGTTGGTGGA GCGTGGCTCG TCCACCACCT GGATGCTCCC ACGATGATGA CTCGACCATG GCGTGATGTC GTGTACTATG CCGTTGTCAC TGGTACGTAT AATTTGGGTC AAGAGGCAGT GCATTAATTT GCCGTGAGTT TTTGTTTTAC TGTCACACAC AAAACCCGAC GCTAAAGCAA TTGCTTTGTC ACCATTCCAG CTTCCACAAT TGGCTTTGGG GACATTTGCC CCGTTTCCCA ACGCGCAAAA CTGGCGGCGG TCGTGTATAT TCCGCTGGCT GTCGCTGCCG CGGGAGAGCT ACTGTCTGGT GTGGCCACAC GCATACTGGA ACGCCGGCAA AAACTCGTCT ACCGACAACA GCTCCTCGCC GATCTGACGA TTGACAATCT GAAAGCAATG GATGCGAACG GGGACGAGAA AATATCGCGA CATGAGTACA TCCAATTTAT GCTGATTGAA ATGGGTATCG CGGATCAGCA AGAGTTTAAC GAGCTGCATC AGCAATTTGA AAAGCTTGAT GTGGACGGAT CGGGCTTCCT CGATAAGAGA GATCTGGTAA AGATGGCAAG ATCCCGTGGT GCCAATGTCA AAGACTAA
|
Protein sequence | MTTPCGHQWK RTLLCPRLLA MRLLLVIGLL QEAASFSTAG CSNPASYRAR PALGLVPLLL PTTVRLNTKA RPVLAKTFPT MPWSTLAAGS SDGIALSDTR PEKLWRFFWR SLWKGMTLPF PALRSMVLDV SQPQNASVGF RIRESLAAIV AYLGIGVLAY YCVLEPTWTV VDALYFTVTC FTTVGYGDLC PSTPQSQTFT ALFGILGVAF LGAALATLSS KLVQTQVEVL QAVRETSKQR IKALFEQVSP LPMSAATTST ATTSHVSQSN WQSPDSTRTA DTVLLWRRVN ALVWTLVRQI LPPLLIIVGG AWLVHHLDAP TMMTRPWRDV VYYAVVTAST IGFGDICPVS QRAKLAAVVY IPLAVAAAGE LLSGVATRIL ERRQKLVYRQ QLLADLTIDN LKAMDANGDE KISRHEYIQF MLIEMGIADQ QEFNELHQQF EKLDVDGSGF LDKRDLVKMA RSRGANVKD
|
| |