Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35493 |
Symbol | |
ID | 7200877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 38152 |
End bp | 39504 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180161 |
Protein GI | 219118789 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0224729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCTG TGTTCGTTAT AGTTCCAAGT TTTACTTTCG TCAATCGCGA CAGAGACGAT GCCGAAGCTT ATACTGAAAG CGTTTCTCTC CCTGGTTCCG TTTCGTCAAC TCCGTCCCCA AGTGACAAAG ATCAGGAGCA TGAAGTGACA ATGCCCGTCA CGCCTTTTTC CTTTATACCC AATTTACCGC TGGCAATAAG TCGTTTCCAG CCATCACCTT CACCAACGAC AGCGATCCCC GAGGGATCCG TAACACCGCA GAACAATCCC TGGCATCAAT CACAAGCACC GACAACGCAC AACCCGACAG CAGAACCGAA CAAGGAGCTT CCTACAATCC CGGCCACCAT AGAGCCTGCA TTATCATCAT CTGTTACACC TGCTTCGTTG ACAGCGACGC CAAGTACATC ACCTACGTTT GTGCCTTCGA CAGCTTATGT TCACGAAGTT GTCTCACGCC TGAGCCGGCA GACATTCAAA GAAAACATTG AAACGCTTTC GAACTTTGGA GACCGAATAC AAGGTTCCAC GAGCTACAAC AATGCTGCTA ATTGGGTAAA ATTGCAGCTG CAAGACTATG GATACACTGT TCAAGAGCAC ACCTACACGT ACCGTGGATC ACCGCGCACA AATATTTTTG TTACCAAAGT AGGGTCTTCG CAGCCGGATC AAATGTACAT TGTGTCGGCG CATTTGGATG GCCGCGGCGG TGGTGGGGCA GCCAATGACA ACGGTTCGGG CTGCTCGCTT GTACTAGAAC TGGCGCGTGT CTTGGTGGCT AGAAGTGTCC AGCTCGATGT CTCGGTTCGT TTCATCTTTT GGAATAACGA AGAGACTGGT CTCAACGGCG CCCATGCGTA TGTAGCTGAT CGTTTGCCTT TGCAAGGAAT TGCTGTACCA TCAGGCGCAA ACGTTTATCC GGAGCCAACA TGGCTCGGTG TAATAACTCA CGATCAAATC CTTTTCGACC ATGGACTTCC CGTTGAACCC AATCAATCCC CTACCGCCGA CGTGGATATC GAATACCAAG CTAACTCGGT ATTTGCCCTA CAATCGCGTG CTCTTGCTAT TTCATTGAAG TCAGGCAACA ATCAGTTTGC CTCAGACTAT CCATCACAAG TTAGCAGCGA TATGTGTTGC ACTGACTCGG TGCCCTTCCA AGATTTGGCA CCTTCTGTCA GTATTCGAGA AAACCAGCGG CGGGCAGAGA TAGGGAATGG ATCCCAGCCG CATTGGCACC AGCCAACTGA CCTGTACACG ACCTTTGACG AACTAGATTT TCTGTTTGGT TTCAACATTG TGCAGACGAC GACTGGAACA ATCTGCCAAT TGGCTCGTTT ACATGGAATA TGA
|
Protein sequence | MGAVFVIVPS FTFVNRDRDD AEAYTESVSL PGSVSSTPSP SDKDQEHEVT MPVTPFSFIP NLPLAISRFQ PSPSPTTAIP EGSVTPQNNP WHQSQAPTTH NPTAEPNKEL PTIPATIEPA LSSSVTPASL TATPSTSPTF VPSTAYVHEV VSRLSRQTFK ENIETLSNFG DRIQGSTSYN NAANWVKLQL QDYGYTVQEH TYTYRGSPRT NIFVTKVGSS QPDQMYIVSA HLDGRGGGGA ANDNGSGCSL VLELARVLVA RSVQLDVSVR FIFWNNEETG LNGAHAYVAD RLPLQGIAVP SGANVYPEPT WLGVITHDQI LFDHGLPVEP NQSPTADVDI EYQANSVFAL QSRALAISLK SGNNQFASDY PSQVSSDMCC TDSVPFQDLA PSVSIRENQR RAEIGNGSQP HWHQPTDLYT TFDELDFLFG FNIVQTTTGT ICQLARLHGI
|
| |