Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34367 |
Symbol | |
ID | 7199782 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 398061 |
End bp | 401660 |
Gene Length | 3600 bp |
Protein Length | 837 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178993 |
Protein GI | 219116396 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00490098 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAT CGACACGCCG CTCTTCTCGC CGCCGCGAGA AAACTGTGTA CAGCGTCGGA GATCTTGTCG AGGTGAGTTG GTGGTGCGAT ACTTTTTCAT AAACCTTAGC CGCGCTCTTA TTTCACTCCA TTTGAATGCA TTCTTCAGGT GACCCGCGAC GAGTCAATTG CTACGGGTAG ACTTGCATCG AAACAGACCG ACACTGCTAA ACCCCGTTGG CTTGTAAAGT TTGACGAATC TTCATGGCCA AGCATGGAGC TATTGGAGAC TGAACTTGGG CCTATTCTTG ATAGAAGCGA CGACAACGCG TCTCAAAAAG AAAAGCAAGT AAGGCAAAAA TCATCGTACG AGTTAGGAAC CACTTTGGGT GGGCGCGGCT CCCCAAGTAA GCGATCTTCC TCGCCGATGG TATCAGGCAA GAGCGAAAAC GGAAAGGCGT CCCCGATATC TACGAACAGT TCAGATTCCA AGAAGAAAGT AGAGTTCATC GCTCTTCAGG AAGAGTCTGA CATGTCTGGG TCCAAGAAGA GACCCGGTTC TTTATCCAGG GAGGAGCGAA GCAAACGTCG TCAGGCCATG ATTGAGCAGG ACAAACTGAA TTGGTCGAAG CCAGTTATGT CGCGACCCCC GAAAAAGAAA AAGGCGCAGC GCGATGAAGA AGTTGTTCGG GTACCAATGC TTACGGGTAC ACTTTTGCTA TATCGCGGTG CTCATCGCCG GGCCGAATTT GTGCGTAAGT TTTGAATGAC ACTGAGTTAA AAGCTTACAT ACGCAGATCA TATGGTAAAT TTGATTTGTA GGCGGATTTA GATCAACGAA ACATATTTTA CATATGATGG CAATTGAATA GTAGTGTGCT AGAGACGCTA TTCCTTAGAC TTTATATTCG CGAATATAGC GCATTCGGAT AGGCGCGTTG CAATAGAAGT ATTGGAGACA AAGAAACTGC AGTAACGTTC TTAACTCCCA TGTCTTGTCA CGAGCGCTGT GTTCGGAGGT TTTCCTCGGA AGAATGTCGA TGCGAAGGCT GTAGAGGAGG AGTGTGTGAG ACGTTCGTTC ACAGTCATGT CAAGGAAATA TGTGAAACGA AAACACAACA TTTTTCAGTT TTTTGCATTC ATCATTTTTC TAAAGGACGT TTTAAAGCGT TAATTTGAAA AACTGTCGAG ACATATACAT AGAAGAAATC CCAAGGAAGA GCTAGGTAAA GTAAAGACTT TTCACTTTTA CATTCTTTCG ATGTCGTATC TTGTAGTTAC AGATAGTGAT ATCAACCCAC CTGCGAGGCG AGTCGCTTTC ATCAGTTTTG GCGAAAATAG CGGCGGAGTG GCAGTGATTG TGCCGTCTGT TTTAAATTTC GCCCACATGA ATCTACAGAA GTAAGCTTGA GACTTGGCAT GCATGCCCGT TTCCGGACCA TTCAACAAGT GCTTTATTTC GATTTTTTGA CTTATGGTAC TAAGCAACAG CAAGGACGAA CACGGAGACT GGGTGGCATT CCGTGACACG CCAGCTTCTT TCGAAGCCAG GAGTGTAAAC GAAAAGAGAA GTGTACTCAA TGTTGCTTCC AAAAATATCT CGCGCAGAAA CTCCGGTACA GGTTCTTGGA TGGGAAAACC CATCAATGCT GATGCATCGT GGGTCCCGGT GGCCTCCCCA GCACCAGGAA CGTATTCGCC TCGTCCGTCC AGTCCACAAG GTTCTCGCAT ATTGTTTAGA GAGTTCGTGG GGAATCGCAA GAAAGAGCGT TCTGATATTT TGACCCCATC TCAACCATTT ATGTGTCGAA GCAGCGCAAC ACAGTCATCC AGCTTACAAG CTTGCAAGAG TGACATCTTG TCAGGAAAGA AACTCAAGAA AGGTAGATCG GGCTCGTCCC CAGTGCCTCG TATTGTAATC CCTCTGAAGC CGGACCCTTC GATACTTCGT ACCCGTACAC GTAGTACCAC AGTGTCAAGC GGGGAAGACG AAGATCAGAA TCGATGTTCC CAAGCTCAAA ATGGTATTAG GGCTGCTAGT GCACATGAAA GGCGAGGACG GTCCAAAAGC CGCAGTCGGC ATTCGCGATC ACCGTCGCCT AGAACTAGGT CGAAGTCAGT TTCTGGGAAT AGTCAATCAA ACAACCGACG TGGGCGATCC AGTAGCCGCC CACCTGTAAC ACGGCCGAAC ACTGTGCGTG AGTCTCGCGT TAAATCTCGA TCAAGCAGCC TGACTAGGAT CACAAATCAC AGGGGACCGA CAGTGGTTCC ATCACCTTCA GCAAGGTCAG TAATTAACTT TCCCCATGCA TCTCTGTCGA CCACGTGTCA TCGTAGAGAA AATGGCAGTA CCGGACCAGG TATGCTCCCC TGTTCCAAAC AGAGACGGGA CCCTAAGATC GGACGAGATA TCAGCTTTGG AACGGTGCAT TTATCTGACT CATCATCGGT ATCAATCAGA AGTGAAAAGA GTGGACTGTT CGAGAAAGTT TTTGGCTTCC CAGGCGGACA AGCTTTGCAG AAACCTCAAG TAAAGCACTC CATTTCGACA CGACCCCGTA TTCTTCTAGC TGCAACAGTG TACCACAATA CAGCGACTGG TCTATGGATC ACAACAATCA ATACAAATCA ACGAGGAGTA TCAAAAAATC CTGCACAAGC GAATAAATTT CTAAAAGCAT TCTCGTTTCC TACAGAAAAG GAAGCTCGAG AGTCAGCTAT CGCAAACGCC CCACCGAAAA TGGTCTCCTT TCAAGAGTCA GCTAAATGCT TCCATTGCAG GAAACTTTTC GCAGTTTTCA AGCGCGCCTG TCATTGTCGA AACTGCGGTG TGTGTATTTG TGCCAGCTGC TCAATATCTT GGCCTGCTAA GATGCTTCCA GAAACATACA ACCTGAAAAA TGAAGCTTCC TTGAAAGTTT GTACAAGTTG TGATACTCTT AGTTCTCTCT TCAAAAAAGC GCTCTTGGAA GCGAAATATG AAGAAGCGAT AGCAATATAT GAGACTGGTA ACGTCAACCT GCGTACTCCT TTTCCTCCTG CTAACAAAAA GGATGAAGTT CTTTATCCCA TCCATGCTGC TATTGAGGGC GGCAACCTTA AGCTTGCGCG TTGGCTTGTC GAAGACCGCT TCTGCCCTCT AAAGCAAATC AGAGCCGGGC GATCGAAGTC AGATAAAAAC GCACTTATTC AGACGTCGAA AGGGCGAACT GTCTTGAGTA TTGCTATGGA GTTCCTTCGC ATCGGTATTC TACGATTTTT GGTTGTTGAA AGAGGAATTT CCGTATTCGA AGCTACGGAT ACAAGAAGCG CCCTTCGGAC TATTGAGGCG GCCTTGGTTG CTCTGCCCTG CTCTTCAGAA GGAGATGGAA TTCGAGAAGA CGGGGCTTCC ATAGCGCGGT GGGACCAAGC CTACTTCGAC GATATGTCGG AACCGAGTAG CCTCGGAGAC GATGATAATG TCACAATTGT AAGCCGATCG GTTCGAACAA GAACGAACAC GGGCGACTGT TGCATAATTT GCATGGATCA CAAAATTAAT TGTGTTGCGA CTCCCTGCGG ACATCAGGTA TGCTGTTTGG GTTGCAGTGC GAGCCTTTCG GCATGCCCAG TTTGCAATAA
|
Protein sequence | MTESTRRSSR RREKTVYSVG DLVEVTRDES IATGRLASKQ TDTAKPRWLV KFDESSWPSM ELLETELGPI LDRSDDNASQ KEKQVRQKSS YELGTTLGGR GSPSKRSSSP MVSGKSENGK ASPISTNSSD SKKKVEFIAL QEESDMSGSK KRPGSLSREE RSKRRQAMIE QDKLNWSKPV MSRPPKKKKA QRDEEVVRVP MLTGTLLLYR GAHRRAEFVL TDSDINPPAR RVAFISFGEN SGGVAVIVPS VLNFAHMNLQ NKDEHGDWVA FRDTPASFEA RSVNEKRSVL NVASKNISRR NSGTGSWMGK PINADASWVP VASPAPGTYS PRPSSPQGSR ILFREFVGNR KKERSDILTP SQPFMCRSSA TQSSSLQACK SDILSGKKLK KGRSGSSPVP RIVIPLKPDP SILRTRTRST TGTDSGSITF SKRRDPKIGR DISFGTVHLS DSSSVSIRSE KSGLFEKVFG FPGGQALQKP QVKHSISTRP RILLAATVYH NTATGLWITT INTNQRGVSK NPAQANKFLK AFSFPTEKEA RESAIANAPP KMVSFQESAK CFHCRKLFAV FKRACHCRNC GVCICASCSI SWPAKMLPET YNLKNEASLK VCTSCDTLSS LFKKALLEAK YEEAIAIYET GNVNLRTPFP PANKKDEVLY PIHAAIEGGN LKLARWLVED RFCPLKQIRA GRSKSDKNAL IQTSKGRTVL SIAMEFLRIG ILRFLVVERG ISVFEATDTR SALRTIEAAL VALPCSSEGD GIREDGASIA RWDQAYFDDM SEPSSLGDDD NVTIVSRSVR TRTNTGDCCI ICMDHKINCV ATPCGHQCEP FGMPSLQ
|
| |