Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48597 |
Symbol | |
ID | 7194752 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 323325 |
End bp | 325268 |
Gene Length | 1944 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183073 |
Protein GI | 219125619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.100939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATACATACAC ACGTTCGCAT CCATCGGTGG CCAGCACTCA TTGTGGACTG CGCGACAAAT GATGAAAGGC ATCGTACCGT CCCAGAGCAC AACGACTCTC CACCGTAGCG ACAACGAAAA GTATCTTTAC CTCATGAGCA GCGACAACGA TCACGACGAA GGACACAATG ACGAGGGCAT CGGCATTGGC GGTGGTACGG AACAATCGAC GTCATCCACC ATGTCACCTA ACAGTCAGAA AGCAGGCCAT GCCCAGCAAA TGTTTGAAGA CGCCAACAGC GACAGAAACG TAGCGAACAC TTTATTGCCC ACAGCGTCTC TGGCGGAACG CACTTTGTCG CAATCCGGCG TTTTCGCTAC CAACATAACC ACCAACGATA ACACCAACCA AGAAGAGCAG GATCACGTGG CGTTGGCCGC AGCGGCAGCA ACAGTAAACG CAGCCCTCCA TCCAGAACAA CCGCATCCTC CCCAACCCGA AGTCGAATTG GCCGTGGCGG CGGCGCACGC CGTGACAGTC GACGGACAAC ACCATTCCTT GCTCGACGGT GCTACGACGG AATCATCACT TCCTGATCAA GCGGAGAAAC GACAAAGTGC AGGGAAGAAA CGGGTCCGAC GTGTCGGAAA CGTCGCTGAG CCTCGGAAAG AACACAAAGG ATTCCATTCG TCCGAGCCGC CACAAACGTC TCGACCATCG TCCCTACCAC ACCAGCCCAC ATTCGACTCA CAGACGGCCT TGCCACAGCG GGCCCTACAC AATACTCAAA GCGTGGCAGC GGCGGCCTCT GCGGTTGCTA TTGAAGCGAC TGGCAATTGC AAGGGGCAGT CCAAGCACGA CGAAAAGTGG AACGCCATGC TAGAAAAGCT CATAGCTTAC GAACGGGAAC ATAAATCCAC CATGGTACCT CAGTGCTATC ATCTAGATCC GCGACTTGGA AGGTGGGTTC ACTATCAACG AGGTACGTTA CGGTGTGTTT GACAGGGATA CTCGGTACAG CACGCGGATT TTGCTCGATT CTCGTTCCGA CTTACTTTTG AAAAATGTGT GTTCGATTTA CGATAGTGGA GTACTGGTTG TACCAATCCT CGGGCAAGGG AAAAATCAAT ACCGATCGCA TTGCGCGTTT GGAAGCGATG GGCTTTGAAT GGGATCCACA GCGGGCGCAA TGGAACCTCA TGTACGACAA ACTGCTAGCT TTTGCGGCCG AAAACGGACA CTGCAAGGTC CCCAAGGGGT ATCAGAAAGA TCCGGAGTTG GCCAACTGGG TTCGGAATCA GCGCCTCGAA TTTGCCAACC TGCAGCGCGG ACGTAAAACA CGCATGAACC AACTTCGTCT CGACAAACTC AATGCAATTG GTTTCAAGTG GAGTACGTCC ATGCCGTTTC GAGTGAAGGG TGATTCGGCA TCGAGTAGGG TTGACAACCG TACCGACCCT GCCAGCGATG GCGACGCCAC GGACGCTAGT ACGAATGCGG CCACATCCCA CGTTACCGCG AAGACGCACC AACAACTTGC GTCAACAAAT GCGGCAGGGT CTAGTGTTGT TCTTGCCGAG CATGGAAGAG TCGAAACGTC CGAAATCTTG CCACAAACCC AATCTATGCA GGAGCAAACG GTTTCGACTA TATCACACAA CCCTGTGGAC GAGCCCGTCA CTACAATCGA ACAGAAGAAC TTACGCGGTT TCGCATCCGA ACGGCAGCAG CCACATGATA CACGGGAGGA AACGAAGCAT CCGCGGAACG AGGATTTGGA TGGTACCGGT TTGGAAGACG AGCCGTCGCT CAACTTTGAC TTGATTTAAT AGCAAACACC GCGAGGGCTA TGTCGAAATA ACCGCTATCA TGCTCAAAAA TCTTTCTCTT TAAAGCGAAG AGTTGATTGG TAAAGTAGCA TCTATGGAGA CAATACAACA CAACTTTTTG ATATAGATTG TAGC
|
Protein sequence | MMKGIVPSQS TTTLHRSDNE KYLYLMSSDN DHDEGHNDEG IGIGGGTEQS TSSTMSPNSQ KAGHAQQMFE DANSDRNVAN TLLPTASLAE RTLSQSGVFA TNITTNDNTN QEEQDHVALA AAAATVNAAL HPEQPHPPQP EVELAVAAAH AVTVDGQHHS LLDGATTESS LPDQAEKRQS AGKKRVRRVG NVAEPRKEHK GFHSSEPPQT SRPSSLPHQP TFDSQTALPQ RALHNTQSVA AAASAVAIEA TGNCKGQSKH DEKWNAMLEK LIAYEREHKS TMVPQCYHLD PRLGRWVHYQ RVEYWLYQSS GKGKINTDRI ARLEAMGFEW DPQRAQWNLM YDKLLAFAAE NGHCKVPKGY QKDPELANWV RNQRLEFANL QRGRKTRMNQ LRLDKLNAIG FKWSTSMPFR VKGDSASSRV DNRTDPASDG DATDASTNAA TSHVTAKTHQ QLASTNAAGS SVVLAEHGRV ETSEILPQTQ SMQEQTVSTI SHNPVDEPVT TIEQKNLRGF ASERQQPHDT REETKHPRNE DLDGTGLEDE PSLNFDLI
|
| |