Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44584 |
Symbol | |
ID | 7198090 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 973374 |
End bp | 975109 |
Gene Length | 1736 bp |
Protein Length | 533 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178614 |
Protein GI | 219115637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACAATGAA CACTGGTAAT GGATGGCCGA ACGGTGGAAC AATGAACGGA AGCGGCTCAG GCGTCGTTCG CTCAAGTAGC AGCGCAGGAG ATCTAGGTTC CAGCCCCGTC GTCAGGGCAA CCGAAGGGTG GCTCGAGCTG TTTTTGACTC CCCCCGAAAA ATACTTCTTC AGTCGATGGG AAGAAATCAT TCGGGATCGG TCCCTAGTTC ACAACAACAC ACGATGGATA TTTCAGTATT TGGCAACCTT TCTTCCAGAC AACCTGGCTC CCAATGTCAT TACACTAGCT GGTTTCCTAC TTTTGGGGAA TGCTTGGTAT ATTACGAACA CATATGGGGA ATATTATCCT ATCGGGTGTA AGTGGAAATC ATGCATTACT CTTGAATCTA TCTTTTAATT ACATTGCTAA TGTATTAACT TTTGCTACGC AGGTACGGTA CTGTGCTGTA TTCATATTTT CCTCTTTTTC ATTTTCAACT CGTTGACCGG GCTGCATGCC GATCGAATTC GCCAACACAC TGCATTGAGC GATTTTTTCA AATATGCTTG CGATTCTGCA TCTACTGTCT GGCTGCTTAT GTTGACTACG TACTGCTTAG GTGCATCATT GGAAACACAG TGGTACGCAG TGCAGGCCAG TCAGCTGGTC CTGCTCCTCA AGCACTTGAG CGCCTTCAAA CGGCACGCTG GCTTGCGTTA TTCGCCGACA GCAGGACCTG GTGAAGTCCT GATGACATGT AACACTTTGC TTGTTGCCCG GGTGGTGTTG GGTTTAGACC ATTTGCAGCA AATATACGAC AAAGTCATGT TCTCGGTCAT GGGTATTTTG GATATGGAGT ACGAACTGAC TGGGGGGGAA CTGGTACGAT TGACATACTA TACCTTGTTT GTCGCATCCG TCGTACAAGC ATTGCGCTTG CAAGAACCCC ACGGATGGAC CAGATTTGGC CTCACTTCCA GTTTAATGAT GCGCCTGGTA CCAGCTCTTC TTCTTCAATG GGATGTGTCG TTTCCTTTGA TCGTCATGGA TGTTGTTTGC GACGGCTTCT TCCTGGCGGT CCTCACGAGC GATCTTATCC TTGCAAAAAT GTCGAATCGT GAATTGCATG CATGGGTAGT CTTGATGAGT TTGGCTGCGG TTCTGAGTTA CTCAACTATT CTAATTCTTG TGGCGGTGTA TTATGTCGCT GTTTTTTCAG ACCTGTGCTT CTATCTGAAT CTTCCGCTAC TGGCAACATG TCGGAATGTT TACTGCGATG GCATTTACGA CTTGTGTCAT ATTGGACACA AACGAGCATT CCAAAATGCA CTAAGTTTGG GGAATCGCCT ACTTGTGGGA GTGGTGGGAG ATGAGGATGC GTCGCATTAT AAGCGCCCAC CGGTCATGTC TCACGCCGAG CGCTGTGCGG AGGTGGAAGC ATGCAAAGCT GTTACCAAGG TAATTCAGAA TGCTCCTTGT TTTGGACTGA CTCAAGCGTT TCTAGACGAG CATCAAATTC ATGTCGTTGC CTTTGGAGAG GAATATTTGA CGAAGTACAA GAACCCTGAC GATGATCCCT ACTACGGATA CGTAAGAAAG ATAGGAATTG CCTACCCACT ACCAAGAACA AATACCCTGA GCACAACAGA TCTTATTGAA CGCATTCACA AAGCCTCTCT GGAGAAAAAC TCGCCGACTT GAATATACAC TGATGCAGAA TCTTGTCGAT TAATGTAAAT GTGTTCATGT GCAGTC
|
Protein sequence | MNTGNGWPNG GTMNGSGSGV VRSSSSAGDL GSSPVVRATE GWLELFLTPP EKYFFSRWEE IIRDRSLVHN NTRWIFQYLA TFLPDNLAPN VITLAGFLLL GNAWYITNTY GEYYPIGCTV LCCIHIFLFF IFNSLTGLHA DRIRQHTALS DFFKYACDSA STVWLLMLTT YCLGASLETQ WYAVQASQLV LLLKHLSAFK RHAGLRYSPT AGPGEVLMTC NTLLVARVVL GLDHLQQIYD KVMFSVMGIL DMEYELTGGE LVRLTYYTLF VASVVQALRL QEPHGWTRFG LTSSLMMRLV PALLLQWDVS FPLIVMDVVC DGFFLAVLTS DLILAKMSNR ELHAWVVLMS LAAVLSYSTI LILVAVYYVA VFSDLCFYLN LPLLATCRNV YCDGIYDLCH IGHKRAFQNA LSLGNRLLVG VVGDEDASHY KRPPVMSHAE RCAEVEACKA VTKVIQNAPC FGLTQAFLDE HQIHVVAFGE EYLTKYKNPD DDPYYGYVRK IGIAYPLPRT NTLSTTDLIE RIHKASLEKN SPT
|
| |