Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44285 |
Symbol | |
ID | 7198002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 125015 |
End bp | 126356 |
Gene Length | 1342 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178169 |
Protein GI | 219114747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.303201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCACCAGT GCAAACATCG AAACCTGAGA AGCAACATCA TTTGCTTTTC CCGTTAGACT TTACGAGGCC CAACCCTAAA CTGAACAATT TGAGAAATAA AGAACGGGGT TTCCCTAGTT ATCAGGTCCG AAATGTTACC GTCCAGTATG AGTCCGTCAT CAAAAGAAGC TGTGGAATAC ATCACCACCC ATGATGTGGT GTTTGGGAGA GGGAGAGGAT ACGAAAAGCA TCCAGGAAAC GTGCGATTTC ACGAAATCCT TGAAAGCTAC ATGGACCAAT ACATGGCTGC AAAAACACGA GCGCAGAAGG TTGCTATCTC TGTCTCAATA ATTTCTAGCG TCAGTATGTC GGGACGCTTT CTAAGATATG ATGGCACTCC CCAAGGATGG GTTCCAGCCC ATGGGGACTC TGTCTGCAAT AAGGTAAGCC AAGCTCTTCG GCATCGTGCA AGAATAAAGA AGAAAGAAGC ACAATATGAG AAAACGTATA GGGCTACTCT TAATTGCGAC AAATCGCTAG CTGACCGACA GACAAATGTA ATGCCGGATA TATATGAATG CAGTCGACCG GGAATTGAGT TTCCATCAAA CAACAGAGCT AAGCTGGTCG AGCACCAGCA CGGCCTTTCG CCCATTCCTT TTCACGATAG TGCAACAATC GACACAATTG GGCTAGACCA AGTAACATTT GATGACATTG ACCTACTGAA CAGAATCCTA TTCAATAGCA CTCCCTGCAG CACTAACTTC GAGTTTCAAG CTTCCCGCGA CTTTCCCCCC AGCTGGTATC CAACAATCAA GTCAAAGCAA CATTCCAATG TATGGCGACA TGACAATTTC AGCTATTCTG TGGCCATCGA TCCTTCTGAG ATCAGAGCGC GGTATATGTT TGGTTGCGAT TTCAATTCAT TTTGCCACAA GCCAATCACT CAGTTGTCGA TAATGACCGA AGATATGAAC AGTTGTAGTC ATTTGCAGAC CTTACAACAT CAGCAGGCTT TTCCAGATTT GAACCCAGAG CTCAAATTAC AAACAAAAGA GCGCCTGCGC TCGCCATCGA GCTCAATTGA AGAACATGAA ATTCATTGCC ATCCAAAGCT TGGCGATCAT CCAGCCGAAG TCGAATTAAC TTCTATCGAG GAGGATCTCG ATGAGCTTTT TCAAGAATCA TGGAATTTAT TCGACTTTGA CTCTTGCTTG TAAATTTGTT GACCGTCTTC GCGGATCCTG GCTTTTGCCA ACAAAGAGTC AGGAGATTTA GTCATAGATA TTCGTAAGAT GGCTCTTTGT GCCATACAAC AACAATGGCA GAAGTTCATT AAAACAATAC TACGGGCTTT CC
|
Protein sequence | MLPSSMSPSS KEAVEYITTH DVVFGRGRGY EKHPGNVRFH EILESYMDQY MAAKTRAQKV AISVSIISSV SMSGRFLRYD GTPQGWVPAH GDSVCNKVSQ ALRHRARIKK KEAQYEKTYR ATLNCDKSLA DRQTNVMPDI YECSRPGIEF PSNNRAKLVE HQHGLSPIPF HDSATIDTIG LDQVTFDDID LLNRILFNST PCSTNFEFQA SRDFPPSWYP TIKSKQHSNV WRHDNFSYSV AIDPSEIRAR YMFGCDFNSF CHKPITQLSI MTEDMNSCSH LQTLQHQQAF PDLNPELKLQ TKERLRSPSS SIEEHEIHCH PKLGDHPAEV ELTSIEEDLD ELFQESWNLF DFDSCL
|
| |