Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46385 |
Symbol | |
ID | 7201764 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 176925 |
End bp | 178836 |
Gene Length | 1912 bp |
Protein Length | 476 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180959 |
Protein GI | 219120442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGAGATTCG AACTTTTGTC GGAAACATCG AATCTGCTAT CCGCCATCTT TCAAAAAGAA CCGAATCTAC TCTATCTTGC TAGAAAAGAG GCGTACGCAG TTATTACGTA CTTGCAGCAA ACCTCTCCTG CGGCGTAGAA TAGCGGGCGA CTCATAATCA GCGATTGAAA CATCAAAGGA CAAAAAGTCA CTTCATTCTG TCAGTTTCGG CGGAGCTCGA AGTCCTCTCA TTGCATCCTA ACAAAATTCA GTCATGTGGA TACTTTACTT CCTTTCAATA GCTGCATTCG TTGTCCGCCA GACTTCAGCT TTTTCTCCGT CCCCTACAAG ATTGGCCTTT CGGTCTCCTC ATACCAGCGG TAAGGCGGTA CCGCCGATTC TTCCGAATGA GGTTTTCACT TCCGTGTCCA ACGGATTGTC TTCCACTTAT GATGATATCC CAGTCAACAA TCGCCACGCA GCTTCCGACT GGTTATACAA CGTCCGATCC TTTCCACAGT CGAAGGTTTT ACGGGAAATC CGCAACCCTC TTTTTTGTGT CGCTGGATGG TCTTTTGCAG TTTCACTAAT CCAACGTATT TTTAGCACTT CGAGCTCAGC GCCTCTTCGA ATCCTCGGGG AAAGCATTTG CATTCCTACA GCAGCCCACT CATTTTTGGT GTCATCTTTG GGATTGCTCT TGGTGTTTCG CACCAACAGT GCTTACCAGA GATTTTACGT AAGTCCTCTT TTACTCCGAT CGAAAGCATC CTTCTTTTGA AAGCGCACCT TAGAATTTCC TCTTACACTC GAATCCTCCG TCACTATAGG AAGGGCGCAA AATTTGGGAG AATATTCTTA GCGTGTCCCG AAATTTTTCT CGTATAACAA GGCTATACGC GAAAGAGGTG GGAATGGACC GCAAGGTGCG AATGATGAAT CTGGTAGCTG CATACCCTTA TCTCTTGCGC CATCATATTC GGCACGGTTG CCTCTGTGAG GAAGCCGGGG AACGTATTCC CGAGGAGCAT CGCCTCTTGC TAGAAGACCC CCTAAAGACT CTCGAGACTC GCTTCGAGGG TGATAAGATT GACGGGCTGA CTAGGTCCAA TGCTTCGCCT TCACTCCCGA GAGACAAATG CTATGTCGAC AAGCGTAAGC TGCCATGGAA TCTGTTTGAT TACTTCTCTA CACAGCGCTT GGCGCGAACA CAGAATCGTC CTTTATGGGC ATGCGACCGT ATCGGACGTG AGATCATGGC AATTCCTTAC GGTCCGAACT TTAGCAGCCG GGAGCGTCTA GTAATGTTGA CAGCTGTCGA CAAGCTCACG AATGCGATTG GTGAATGTGA GCGCATCCAT CAAACAGCTG TCCCGCTGAA CTATGCTCGT CACTCGCTAC GCTCGCTGAC TTTGTTTCTG TTCACGCTCC CTTTCGCCCT GGTTAAGGAC ATGGGGTTTT TGACTGCACC AGTGACAGCC GGAATCGCTT GGCTAATGTT TGGAGTGTAC CAAATCGGAT ACAGCATTGA AGACCCTTTT CAAGGATCCC TACGACTTTC GAATCTATGT GATGCTATTC GAAAGGACGT GGTCGGTTCG ATTGCGGAAG ATATGGAAGA TAGCTACAGT TCGAATCAGC TAGGCTTGTG GGATGAAGGT GTTGACGAAG TTACTTTAGC AGAACGCGCT GAAGACTTTC TGAAGACCCC GATGATCATC CCTACTTTGC TCGATCAAGG AATTTCGAAC CTAACCTCAT CAGCTTCGGT TGATGCCAGT AAGCCTCCCT ACTGATGAAG GTCGCATGCC CTTGAAGCCT TTCCCTAGCA TCATATTCTC AACACGATTT CTCTTGCACA AAGCCAAAAC AAAATAGAAT AGACCTATAT TAGATGGATA TAGAAGCATA AAATAAAACT ACAAGTTATT CG
|
Protein sequence | MWILYFLSIA AFVVRQTSAF SPSPTRLAFR SPHTSGKAVP PILPNEVFTS VSNGLSSTYD DIPVNNRHAA SDWLYNVRSF PQSKVLREIR NPLFCVAGWS FAVSLIQRIF STSSSAPLRI LGESICIPTA AHSFLVSSLG LLLVFRTNSA YQRFYEGRKI WENILSVSRN FSRITRLYAK EVGMDRKVRM MNLVAAYPYL LRHHIRHGCL CEEAGERIPE EHRLLLEDPL KTLETRFEGD KIDGLTRSNA SPSLPRDKCY VDKRKLPWNL FDYFSTQRLA RTQNRPLWAC DRIGREIMAI PYGPNFSSRE RLVMLTAVDK LTNAIGECER IHQTAVPLNY ARHSLRSLTL FLFTLPFALV KDMGFLTAPV TAGIAWLMFG VYQIGYSIED PFQGSLRLSN LCDAIRKDVV GSIAEDMEDS YSSNQLGLWD EGVDEVTLAE RAEDFLKTPM IIPTLLDQGI SNLTSSASVD ASKPPY
|
| |