Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50093 |
Symbol | |
ID | 7198688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 418834 |
End bp | 420591 |
Gene Length | 1758 bp |
Protein Length | 479 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184874 |
Protein GI | 219129392 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.333665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTAGATGAC GACAGTGCAG GATTCATTTT CTCCGGCGCC CCCACCTTCC TGGAAGTGCA GTTCACGCTA CTGGGACCTT TTTGAGATGC CAGCAGAGAG TGAATGGAAA CGAAAGCGAA AAGCTCAACC TTTGGTGGGC CATTCTAAGA ATTGGAAAAC AACACAACTC CTAGCGTAGA TCACAGCTGT CTCTGGCCAC TGTTGCCTGC CAGAAATATT TTGGGCCCAC TTAGGACGGT TGGCGCAAAC AAATGACAGC GTCGAAGAAT CCAGACCATG GCCAGTCTCC GAACCTCCCC GATCACCAAC CAGCCGATGC TGACCCCCGT CCTCCTTCCG ACGCACCTCC GGATCGATCG ACTCTTCCGA AACCTACGAT ATCGCTGGTA GAAGCTTCTC CGAGGGACGT GCTGTTAGGA CGTGGGCGAG GAAATGGGGG TAATAGCGGG AACCAAATGT TTCAGGCTCT GATAATTGAG AATCTCGCTC GCTACCACAG CGCCGCAAAC CGCCCCGAAA AGACCCAAAT CATCAAGGAA ATCCTGGTAG CAATCAGCAA GGCCGGTGGT CGCTTTCTGA AGCCGGACAA AGTTCGCGGT GGCCTTGTTA AAATCTCTGA CGCTGAAGCA CGGACTAAAG TAGGACAAGC GATCCGCTAC CGTACTGTGA ATCAAAAGGA AGTCCCCGGG AGAAGCACAG CCAGCGTTCC ATCGACATCC GAACAGAGGC GAGAGTACTT ACTCGAGTTT GGCCATTCTG ACGGCGACAT TGTAGATCGA AAGGGGTCGG GGCTTGGACT CCGCAGCCGT GGACGACAAG AGAACGTATT GGCACAGTTC CCCAGCAGGC GCTTCCATCA ACAACATCTT CCGAACCCAC TTACACATCA TTTACACCCA TTCGGTAGAA ACATCGCAGA TTTCGACTCG TTCCGTGATC CCGATGGACA AACGCTCCCC CCACAAGCAC TCAATCTAGC AGAGCGGAGT GTGGATGTTG TTGGGGATAT CGAGCCTCGG CCGCTACCTC CTTCTCCAGA GTTCCTTCAC CAACTTGGAG CTCTTCCAAC AACTCAACGG TCAAGATACA TATTTGATAT TGACTCCTCC TCGGAACCGA CTCCAAGGGC GTCGTCACAC CACATTCAAG AAGGTTCCTC ATTTGTACGA GAAACAATAT CCAATGTGGA CATTCTTTCG GTCCCTAGAC AGTTCTGCTC TCCATTTGAA AGGCTCAATC CAACTTCACA ATCTCATCTC CCCCATTCGC TACCGTCAGC GAATATCCTG TCGTTTCCTG ATTCCTCCAT GGGTATGGCC CTCAGGTCTA CCTTCGGGCA ACCCTCAGCG CAGTCGCAAT ATCAACGCTT TGCATCTTTG CAACCAGCTG CCTTAGATAC CTCGAATACG AGATTTAATC AAGCTGAGAG CCTCTCACCA TCTCGGCAAC AACTCTTGAT GATCCAGCAA CATCAAGAAG AGAGACTTTG GAACTCGCAG GAATTTGCAG ATCCGTCAAA CATCTATCGT GCATCTATGC ATCCGAACTC TGGTAGAATA CCTGACGTGG ACACAACCGT CCCGCAAGAT TTGTCCCGCT TTTATCGACC CAGCCTCCCA AACATGCAGT CCGAGGCTGG CCCCGAAACC GACTACCACT CTTTGTTTAC AGAATTCGAC AGGAAGAATT GAAGCCGTCC ATTGTGAGTT CTGGCAATTA TGCTGATTCT AAACACAGTA GCAACTGTTG TTTGTGGC
|
Protein sequence | MTASKNPDHG QSPNLPDHQP ADADPRPPSD APPDRSTLPK PTISLVEASP RDVLLGRGRG NGGNSGNQMF QALIIENLAR YHSAANRPEK TQIIKEILVA ISKAGGRFLK PDKVRGGLVK ISDAEARTKV GQAIRYRTVN QKEVPGRSTA SVPSTSEQRR EYLLEFGHSD GDIVDRKGSG LGLRSRGRQE NVLAQFPSRR FHQQHLPNPL THHLHPFGRN IADFDSFRDP DGQTLPPQAL NLAERSVDVV GDIEPRPLPP SPEFLHQLGA LPTTQRSRYI FDIDSSSEPT PRASSHHIQE GSSFVRETIS NVDILSVPRQ FCSPFERLNP TSQSHLPHSL PSANILSFPD SSMGMALRST FGQPSAQSQY QRFASLQPAA LDTSNTRFNQ AESLSPSRQQ LLMIQQHQEE RLWNSQEFAD PSNIYRASMH PNSGRIPDVD TTVPQDLSRF YRPSLPNMQS EAGPETDYHS LFTEFDRKN
|
| |