Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39115 |
Symbol | |
ID | 7194876 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 366606 |
End bp | 367983 |
Gene Length | 1378 bp |
Protein Length | 409 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183082 |
Protein GI | 219125637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.219892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCCTC CCGTCGTTGT CGAGTTTCGG GTCCCCCTTG CCAGCGAGAA CGACAGCATG ACGCATCCGC ACTTTCGGAC AACCCCGCAT CCGACCAAGC AATGCAAGCC CGAGGGGTGC AAAAACCGCA GTGTCACCGA TACGGAGGCC CTGACTACGT TGGAGTCACA CCGGACCGAT AGTCCGAGTC GGGATCCGCG AAATACGCTG ATATCTATGG TACGTCTGTC GTCCCCCCAA GAATCTACGA AAAGAGCGCG GGATGTGTTG CGACTACGCA ACAATGGTTT CCTGTGCACG AGCTGTTGGA AACACACGAG AGATTCACAT CTCACCGGAC CTGACTTTAT AAATAATCTG ACTGCAGAGC GTCCATGGAG CGAGTTCGCA GCCTCAAGGT GTCGCCGATT CCGACGTCAA CGATCCTCTC GATACGGAAA GTCATGCAAC CGACAACGAG CAGGTCACGG CGGCCGATAC CCAATGGCTC GAAATTTGCG CTGAAATTCG CCAAGACAAT CAATCTGTCT CCTTAATACA ACTTGAGTCC TTTCAAGATT GCTTAGAGCG TCGCGAACGA GCCGCTTACA ATATTAGCCA AGCTTTTGAC CATTGTCAGC AAGGACTGGA AACTGCCGTT ACTGGCATGA TCCACAATGT GGCCGTTCCC GTACACGACA CTTGGCAAGA AAGTCTCGAG ACGTTGGAAT CCGATATAAC GCGCACCCTT GTGTCCAATC ACGAACGACG TCAACAGTTG CTGCTAGCTT TGGAAGCTTC CCACGCGGCT TGGCAAATGC AGTATTCTTC GTTGGTTGGA AAGGTTTTGC CGCAACCTCC ACAGGTACAC CCATCGTCAG ACATACCGGA TTCATGCACG ACCTCGTGCC GTGAGCTCGA AAGCCCTAAA GCGATCGTAA AACATGTTGA CGACGGTGAC AAGAAGGAAC CGAACTGGAC AGAGCTCGCC AAGTATCAAC CCACCCGTAC CAACCTAGAA CTCTTTTTGC AAGGCCGAGA TCGTTGGCAG CATGCGCACG GCCGATTTGC GCAAGCACTG GACGAGATCT ATGCCGACCT CCAATCGGAC AATGAGCGAA TTCTCCAAAC CGCCTTGGAT ACGCATGATT CGTGGCAAAA GACATTGGAC GAACAGCAGC ACGACATTCA ACTACAGTTG GCCTCCAACG TTGTGCGCCG ACGAGAGCTT CGGAGGGCTT TGCAAGACTC AGCCAAGCAT ACCCAGGGAA TGTTCGCGTC CCTAATGGCA CGCGTAATGT CTTGTTTGCC TTCGCAATCC GACGTCTCAA CGGATAAGGC ATCAGTCAAC AAGAAGCGTA AACTAGAATC TCCCGCCAGA CGTTCGCTCA CACGTTGA
|
Protein sequence | MSPPVVVEFR VPLASENDSM THPHFRTTPH PTKQCKPEGC KNRSVTDTEA LTTLESHRTD SPSRDPRNTL ISMSVHGASS QPQGVADSDV NDPLDTESHA TDNEQVTAAD TQWLEICAEI RQDNQSVSLI QLESFQDCLE RRERAAYNIS QAFDHCQQGL ETAVTGMIHN VAVPVHDTWQ ESLETLESDI TRTLVSNHER RQQLLLALEA SHAAWQMQYS SLVGKVLPQP PQVHPSSDIP DSCTTSCREL ESPKAIVKHV DDGDKKEPNW TELAKYQPTR TNLELFLQGR DRWQHAHGRF AQALDEIYAD LQSDNERILQ TALDTHDSWQ KTLDEQQHDI QLQLASNVVR RRELRRALQD SAKHTQGMFA SLMARVMSCL PSQSDVSTDK ASVNKKRKLE SPARRSLTR
|
| |