Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45068 |
Symbol | |
ID | 7200079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 107320 |
End bp | 108928 |
Gene Length | 1609 bp |
Protein Length | 479 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179139 |
Protein GI | 219116689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGCAGC CCACCGCTAC GTTAACCATG AATTTCGCTC TCGCTTGCCA AGTCTTCCTT GCTTCTTTTG CTTTGGTCTC TGCCCATGAC ACCCGGGGAC GAAGGCTACG TTTGCCGAAG CGAGTCAAGA TAACGAAGAT GGACTTGCCA GAATTCATGC CAACTGAGGA TTTGCCCGAA ATCACGCCAA CCGAGTATTT GCCCACCGAA TTTCCCTCAC TCACTCCTGA TAGCGAGTCA CTCGAGTTAG CAAGCGAGTC TACTTCCCTA GTCCCCAGCG ATGCCCCTTC AATGGTCCCG TCGGATGCCC CTTCAATGGT TCCCTCGGAT GCACCTTCAA TGGTTCCCTC GGATGCACCA TCCATGATCC CCTCAGACGC TCCTTCCATG GTTCCAAGCC CTTCTCCGAC CCCTTGGCCA GATGTCAAGT GGGCGCCCAC ACTGAACGAC ATTAACAGTC CTGGTGAGGC TCCTGATTTT CTGTTTGGCT CTACTCTTTC CATGTCCAAG GATGGGTCTA TCGTCGCTGT CCTTACTGGA GGATTCAATG GAACCGTGAG TGTATACGAG ATTAATTCGG ACGGCTGGGC CCAAATCGGA GAGCCCATTC CAGCTATCAA TAACCAAGGT ATACAATTGA CGCCGGATGG CACTGACTTA CTTGTGCACG GTCTAGATGC GCAAGTATTT CAATTTGTAG ACAACGCGTG GATTCAGGTT GGTGAAGACA TACCATCGCA ATCAACTGAC TCCAGCAGTT CTATTTCCGA CGATGGACTG ACAGTGGCTA TCGGCCTTCC TGGGGCCACC ACTCTCTTTT CATCGGTGGG CTCGGTTGAG GTTTATTCGT TCGATGGCTC CGCCTGGAAC CTAATACGCA CAATTGATGG AGATGAAAGC GACGATCGAA ACTTCGGGCG TCACTTGTCT CTTTCCGGCG ACGGCAACAC TCTCTCCGTT TCTGGTCGCG GAACGTCTTC CATTGCCAAT GTGTACGTCA AAACTTACCA TATTGCTGAC ACTGTCTATC TGCTGGGCAG ACAAGACGCT GCACCGAACA ATGGCAACTA TATAGTTCCC TCGCTTTCCA GAGATGGAAG CCGCTTGGCA ATTCTCGATC CGGCAGGAAA GGTGCGGTCG TTCAAGTTTG AAGCAGCCGA GTGGAAAGAA GTGAGCAATG GGTTGCCCGA GAGTTTTGAT CCAAGGGTCG TCCGTGGTTC CACGGATGGT ACCTCCCTGA TCATGTGCAA TCCCAATTTG GGAACCGTCA AGGTTTTCGC CCTGGATGAA GAAACCTGGG TTGAACGAGG CGAAGACATT GGTGATGCCG AGATAGTCCT GTTGGGACGC GACTGCGCAC TTTCGGGAGA CGGTTCCATG GCCGTCGCTA GTGGATGGAT TATGAAAGGA ACGGCGGATG GTGTCATTGG CTTTTGGCAA GCGGAGGTTG TGGCAGACGC AGAATAGTGC ACGTCGCCCA GAGTTGATTC GGCAAAGGCA ATGACACGTT CCAATCATAG TTAGGTGCAT TTGCGCTACT CTTAGAGGTT ACAGCGTCAC GTAGTGATGC ATTCTATACA TTTCCTATTA CTCAAGATTG ATTTGTGAC
|
Protein sequence | MNFALACQVF LASFALVSAH DTRGRRLRLP KRVKITKMDL PEFMPTEDLP EITPTEYLPT EFPSLTPDSE SLELASESTS LVPSDAPSMV PSDAPSMVPS DAPSMVPSDA PSMIPSDAPS MVPSPSPTPW PDVKWAPTLN DINSPGEAPD FLFGSTLSMS KDGSIVAVLT GGFNGTVSVY EINSDGWAQI GEPIPAINNQ GIQLTPDGTD LLVHGLDAQV FQFVDNAWIQ VGEDIPSQST DSSSSISDDG LTVAIGLPGA TTLFSSVGSV EVYSFDGSAW NLIRTIDGDE SDDRNFGRHL SLSGDGNTLS VSGRGTSSIA NVYVKTYHIA DTVYLLGRQD AAPNNGNYIV PSLSRDGSRL AILDPAGKVR SFKFEAAEWK EVSNGLPESF DPRVVRGSTD GTSLIMCNPN LGTVKVFALD EETWVERGED IGDAEIVLLG RDCALSGDGS MAVASGWIMK GTADGVIGFW QAEVVADAE
|
| |