Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49993 |
Symbol | |
ID | 7198777 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 67856 |
End bp | 70742 |
Gene Length | 2887 bp |
Protein Length | 756 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184822 |
Protein GI | 219129284 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00340671 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTGAGCCG CATACCATCG GCAAAAATTC CAATCATGAA CTGATGCTAC TCTTGTACGA TTTTAGTCGC TCAATGCTAC AGTTTGGGAG ACACAGCTCC TCCTCGATAC TCCGTCCTCC CACGGATCAC GATGGTACAA TGTCGAGTTG CACCACATAA AAAAACGGTT GTATTCTTTA CTATTAGGAC CAGGCAGGGA CCTAGACACG AAGGATCGAA AGTAACCATC CCTTGAGGAA AAACGCAAGA AAGCTCTTTA CTACTGTCCG GAGTCAAAGA ACGGTCCAAT TTGTGCAAGA TATTAACTGA AATGAAGGCC TTGATTCGCA AGCGGATTAT GTGGAGCAAG AGGACCGCCA CGACGGAGCA AAAGGCTTCC GCTTCTCTTT CGTTTGCCTC ACTGTCAGCA CTCCCTTCGC TGCGAGACAG AAAGGAAGCA ACGGTTTCAA CCCCCAATAC ATCAATCGGG AGTGTGGGTG AGGAAGAGGC TTGGACGTCG AAACGATCTT TTCGCTTGTT TGGGCGTAGG AATAAACGCA AGCTGAGCCC TCAGTCTCGT TCTGCCCCAT CCAGCATTCC AAGAACAACA AGGAAAATCG AGGCTTTCAG TAACGAGACA AACTCAGTTC ACAGTCGATT TCTTTCCCAA ATAAGGCGTC GTGCATGTAG CACTTTGAAC AACAAGGGCC GGAACGGACT AATAAGCAAA ACACATGATC TTCGGGACCC AGGCCAGTGT AAGGCCTTGA CGACACCGCG CGAAACTACC GTTCCCGCTA CGCATTAATC GCAACAGCGA TTGAAATGTA GTCGCGAAAT CGACGTCCTG TCCAATTGTA CCGGACTGGC TAAAGCATCA CAATCCGAGT CAAATTCGTT CTTGCACGAG GCAAGTGCGT TTGACGATGC AGCTAGAGCA GGAGGAGAGG AACTCCTTGT GGACGTGCCA GTGTCGAATG GTTATTCGGT CATGGCTAGT TTGCAACATT TGTACTGTGA ATATCTGTGT GGAGGCAGCA ATTCCGAAAG AGAGCATAGT TTTGATACGC AGTCCGTGTA CGACTTGATT ACCTCATATC TTGGTACAGC GCAATCTCAA GAACAACTCT ATGCTGGTCG TGCGAAGCGA TTGTTAGAAA ATCCAGCCGT TCTAGAAAAT TTTGAACGTG TTTTTGAGCG ACACCTGAAG GCGGGGTTAG CGGCCAAGCT CATGGAAGAA GACGAGGGGG AATTGGAGCA GTCAGGACCG CAATCTAGTC TTGCTGCGCC GTCTCAACTT CCGCAATCCC TACTGCGCTC ACGTTTCTTT ACTTATAAAG CTGGTTCTAA GCATCACACG CCTCGCCAAT CTCTCCGAAA ACCCCGCAAG GTTCAGCACC ACGTAAGTCT CCACCGCATC AATGCGGTAA ACGAGGCGGC GTCATCTGTG ACAGCTCCCG TCGCTCAAGC CACCAAGGGG CATCCGGCAA CTTGCCATTG TCAAAAGCAT ACCGCGCCGA TCGTCCCACC ACAGCTTTGG CCCCAAGGTC AACTTTTGAT GCGACCTACA CCTGGATCTG GCGTTCGTAT CAGAGGCATT CGTTTTGCCA AGGCAACATC GTCGTCGTAT TTGTGGCAAG CGTCCGACTA TCCGTCTGGT CAATCCCCTG TGACCTGGCC ACAAGCTTTG CAGGACCACT GGAAAGAGGC GAACGTGCAT ATATCTGCAA CTGCCATGCC AGAAGGTCAT GACGAGCTCC TGCAGCGTCT GCAGTCCTAT CGCATGTGCC CAAACTGTAT GATCCTTCCC GTGAATAATG GAAATGAGCC CGACGGAGAA TCGCTAGTGA CAGATTTTGA ATCGGACTTA TTTGTCGGCA CTATGCTGGT CCGGTTACGT CACACACAAG GTACTACACC GGAACCATAC AATGATCAGA TCGGGTACTT TGCCAATGTG AAACGACGTT ACCAAGTGGT GATGCGCGGT CGTTTCAAGC AAAGTATTCC GTGGACACGT TGTGTGGCGG GTTTGGAGCT GACTCGCAAG ACGGCGCGTT TGCCCGCAAA ATGCGTCGTC AAGGGGGCCA TGAAAGTCAT AAGCTTCCTT GCGCCACAAC TAGATGCGCA ATTGGAAGGT TCCCATCCCC ATAGTTTGAC GCCACTCGGT AGCACGGCGC AATCCCTCAG GGTGCAGCGT AATACGCCAT CAAGTATTGA GAATAACTAC GATGAATTCG ATCTTGAAGC CAAGTTAGAA GAACCAACTC GTGACGAAGA AACTTTGCTA GGAAAAGCAA GTACGGCAAG TAGTACAACG TCACGAGCTC GCTTTCGGAA AAAGGGCTTT GATAAAGTAT TTGGGAGCAA GGAGGCAAGT CTGCAAACCA ATCCGGAGGA CATATATACG TTTGAGTTTC TGCAGCACTT GTTCAACTTT CAGGAGTTTT CCATTGAGCT CAGTTCCCTC TTTGGCAGTA TCCATTTGCA GGACTGCCTT GATGGGCAAC CACTACAAAT AGTGGCGAAG CACAAAGACT CAAACAACAC GCGACTATGG AGCTTTGACG TTTGGCACGA GTGTTTGTAC CGAGAAGCTG TTGCGTTTGA CGAAAGAAAT TGCGACTAGT CGTCAATGAC CACTACCAAA GAATATTTGT TGGAGATAAT GCTACTGACA GTCAGCAGAT ATACTGCAAC AGAAAGATTC AGGAAGGAAT AGAATTTGTA TTCGCATGCG ATAGGCTTTT ATAATATACC TTGTGTGTAT GGATGTGTAG CTCATGTGAC ACTTTGGCAT GTCATTACAC ATCACAGACG GTGCATTTGT CGTTTGTCAC TACTTTTCTT TGTCAAGGAA TCTGTGGGCG TTTGTTTGGT TTAAGTAACA TAAACAGGAT TGCTACT
|
Protein sequence | MKALIRKRIM WSKRTATTEQ KASASLSFAS LSALPSLRDR KEATVSTPNT SIGSVGEEEA WTSKRSFRLF GRRNKRKLSP QSRSAPSSIP RTTRKIEAFS NETNSVHSRF LSQIRRRACS TLNNKGRNGL ISKTHDLRDP GQCKALTTPR ETTRLKCSRE IDVLSNCTGL AKASQSESNS FLHEASAFDD AARAGGEELL VDVPVSNGYS VMASLQHLYC EYLCGGSNSE REHSFDTQSV YDLITSYLGT AQSQEQLYAG RAKRLLENPA VLENFERVFE RHLKAGLAAK LMEEDEGELE QSGPQSSLAA PSQLPQSLLR SRFFTYKAGS KHHTPRQSLR KPRKVQHHVS LHRINAVNEA ASSVTAPVAQ ATKGHPATCH CQKHTAPIVP PQLWPQGQLL MRPTPGSGVR IRGIRFAKAT SSSYLWQASD YPSGQSPVTW PQALQDHWKE ANVHISATAM PEGHDELLQR LQSYRMCPNC MILPVNNGNE PDGESLVTDF ESDLFVGTML VRLRHTQGTT PEPYNDQIGY FANVKRRYQV VMRGRFKQSI PWTRCVAGLE LTRKTARLPA KCVVKGAMKV ISFLAPQLDA QLEGSHPHSL TPLGSTAQSL RVQRNTPSSI ENNYDEFDLE AKLEEPTRDE ETLLGKASTA SSTTSRARFR KKGFDKVFGS KEASLQTNPE DIYTFEFLQH LFNFQEFSIE LSSLFGSIHL QDCLDGQPLQ IVAKHKDSNN TRLWSFDVWH ECLYREAVAF DERNCD
|
| |