Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49612 |
Symbol | |
ID | 7198256 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 202233 |
End bp | 203923 |
Gene Length | 1691 bp |
Protein Length | 439 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184419 |
Protein GI | 219128436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00322088 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGATATTTG TATATAGGCG ACTGTACTGA CAGTGGTCCT CATTATCATC TTGACATAAC AGTGAAACCT CGTGGAGCTC TCGCCGTGTT TAAATTTGAA TGATACACAC GTCAAACGAA TTCCAAACGC CAAAGTTGAT GGCTCCACGA GAAAACAGTT GTCGAATGGA GGCGTTTTCA CCCAATGCTG TCGGATTACA ATGCAGCTGA CTTCTCTCTC TCTACAGAGT AGCGTATACC TTACCACATT CCCTCAGACG CTGTGTCCAA ATAAAGAATG TCCGCTCCAA GGAAAGAGAT CCATAGTAAA CCAAGAAAGA ATGAAGCGAA CCCCTGGCAT CACAACACTG ATGCGCTTGC GACCTTGGAC CACTTTGGAA AGGTTCGTTC TCACGAATTT TCGCTGTGAA AGAAGCCTTT ACAGAACCAA AACCAATTTA TTCCCTCAAA TACTTTGACA TGCGTTTGTA ATACAGAGAG CTCTCGAGCG TGAGGCTGCC GTCCAGCAAA TGTTCGTCGA GTACGTAGAA AAATGGTCAG CCAGCGTCAA CGCCGGAGAG ATCCCGAACC ATAGCCGTAA AAGCGATACG GTGCGATTTC TACCACCAGA CTTTCATTTG GAGGATAGCA CTTGGGTGTC TTTCTCAAAT TTTGTGAGAG CTCATGGCGA CGGCTATTGG TCAGCGAAAC GAACTCGGTT GACTTTGATA GAACTGAACA ACCATCCAAA ACTCAGAAAT CACCGAGGGC CTTTCTACTT TGTCGATCTT TCACATAGCG AGCCACAAAC CCAATCATCT ATTCATCCAG CAGGGGAAAA TCAAGAGGAG GACAGTCAGA CTGTTTCAAA ACGAAAAGAG GGTTCTCGCG AATTCTACGA TTCAAAATTT GTGTCGAATA ATCAGAGAAG TCAGCAACCT AAGTGGCCCA AACAGGCACA AGAGGTCGAG ACTGCATACA GCCAATTTGT TGAAAGCAAG ATGTCGACGG GCCTGAAACA GCTTACTCAT TCTCAATCAA GAAATGTGGG CGATCCCACT TGGATCCTTC CAGCGCCCGT TCCCTTCGAC ACGGTGCAAC TACCTCAGCA CGTCCTCTAT CGGCAGCCGA CTCAGAGTAA AAAAATGTCT GTGGGGGATA TCGATTCAAG TTCACCTCCA ACATTGCGCC TTTTGGGAAA GCATGCTGTT TCCCCTGGCA ACCATGATTT TTCTCGTCGC TCAAAAGTCT TACCTAGGTC AATGGCTGAC AGCAAAAATG GATGTCTCGA TTTTACCGAG TTTCACAGCA GGAACGGCTG CTTTTTGCCG ACGGTACCGC TACTCCACCA TGATCAGCAT CTGTATAGCG CTACACGAGA TGACAACGAA CGATTGCCAG TCCTACCATC CAATACTGAG ATGCTTGTAT ACTCCCAGCT CTCGGTCAAT ATGTGCGATA TCGAGACGCC TTTTAACGAG ACGACGGAGG CCGAGACTAC AGAGTACACA CGCGCAGACC AGATCATGCT TATGGCTCCA CTCATCCCGA CGCCACCCAA ACCTACTCCT CCCCGCATGC AAAGGAACCT TGCCAACCAC TGTCTTCCTA GACTATCACC GTTCGTAAGC CCATTTCAAT CGTTGGAGAA TTCGGTCACG TCCACGAAAA TAAACGACAC CTGCAAAGGC TTACCACTCC AACATGATTA G
|
Protein sequence | MSAPRKEIHS KPRKNEANPW HHNTDALATL DHFGKRALER EAAVQQMFVE YVEKWSASVN AGEIPNHSRK SDTVRFLPPD FHLEDSTWVS FSNFVRAHGD GYWSAKRTRL TLIELNNHPK LRNHRGPFYF VDLSHSEPQT QSSIHPAGEN QEEDSQTVSK RKEGSREFYD SKFVSNNQRS QQPKWPKQAQ EVETAYSQFV ESKMSTGLKQ LTHSQSRNVG DPTWILPAPV PFDTVQLPQH VLYRQPTQSK KMSVGDIDSS SPPTLRLLGK HAVSPGNHDF SRRSKVLPRS MADSKNGCLD FTEFHSRNGC FLPTVPLLHH DQHLYSATRD DNERLPVLPS NTEMLVYSQL SVNMCDIETP FNETTEAETT EYTRADQIML MAPLIPTPPK PTPPRMQRNL ANHCLPRLSP FVSPFQSLEN SVTSTKINDT CKGLPLQHD
|
| |