Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43938 |
Symbol | |
ID | 7204369 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 480081 |
End bp | 482001 |
Gene Length | 1921 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186355 |
Protein GI | 219113543 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0338655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTTGAA ACGCTTCAAG GGTACTCTAC CGATACTATT TACCAGGTTG TTTTGACAAA AGCTTTCTCA AGGCTGGATT CAGACTCGGT TGAAGTCCCT TTGATCGAAA GGAACCATGG TGTGGTCAAG GTCACCGTCC CGAGGACGTA TATGGTTGGC TACGTTTATG GCCTTGACGT CCATGACGGG AAGTGGCTTT GTTTTACTGA CAAACGCGCA TACGGGACCT CATCATCATC AGCCTCGAGG AGGTAGCAAT ACGAGCTTTC GACAAATCTC CGGGCCACCA TCGGCAACTT CGACTTCTGT GTCGGCGCCT GCAGACGATG GAGCAGAAGA GATTGCCAAC AACATTTGCA CCGACGAGAC GCCGAGTCGG CCGATTCCAC TGCAAGCCAC TAACGAGGCC GTTGCGAGCG CCAAGATGCC TAAGGTGTGT TGGAAGCCCC CGATACCTTT TCGTGCTGTA CTCACCACCT GCTGTTGTCT TTTGTATGGA CTATTTTCGT TCTGTGTCTA CCCTCGTCTA ACTAACGTAT CTTCCTTTCG ACTCAACAGA ATCAAGGGGC GCTCAAAGTA TTGTTTTTGT CCGCCGATAC CGGTGGTGGC CACCGCGCCT CGGCCGAATC ATTGGCAAAA CAGTTCCTCA TTCACTATCC GGGCAGTACC TATGACCTAC TCGATGTTTG GACGGAAGAC GGGGTATATC CGTACAAAAC ACTCGTGGAA TCATACAAAC ACCTATCAGC ACATCCGCAG CAGTGGAAAA TGCTCTATCA TCTTTCCAAC ACGAGACCGT GGGAAGTGTT AATGGATTGG CACAGTGCGT TCATGTGCGA AGCCAAAATT AGGGCACGGA TCGCCTCCTA TAATCCGGAC GTAATTGTAT CGGTCCACCC CGCTATGCAA TACGTACCCA TGAAAAGCGT CCGGCATCTC TCTCGAGAGC GAGGCCGTCA CATTCCTTTC TTCACTGTCG TGACGGATTT GGGATCTGGT CACTGTACTT GGTTTCAAAA GCATCCTGAC AAGATATACA TTGCCTCGGA ACGTATTCGA CGCCTTACCA AACGACGGGG TGGTACCGAG GATTGCAAAA TTGTTAGTAC AGGTTTGCCC ATTCGCCACG ACTTTGCTGT TCACTCGAAG GCCATGGGCG ATCGCACAAC GCCATCGGGA CAGGCCTACG TACAAAAGAT GAAGCTCGAT TTGGGATTGC CGGGCGACAA ACCTATGGTG CTACTCATGG GCGGTGGAGA AGGAGTTGGG TCATTGTCAG AAATTGTTGA GCAAGTATAT CGATCACTAG TGTCGGAGGG CGTGGACGCA ACTATTTGCG TCGTCTGTGG CCGAAACGAA AATTTGCGTC TCAGCTTGGA ACAACGAGAT TGGGATGCTG TTTTAGAGGC ACGTCCCAAG TTCTCCAAAC GGAGATTTTT CTCACGTATT CTTTGGCGTC GGCGACGCAG TCGTCGGTTG CAGGAATCAT TAGATCGAGC TGAAGCGTAT CAACACGACA GACCAGATTT GGTCAACGCG AGAGCCACAG TGGATGTGAT AGGCCTCGGC TTTGTTACGC GCATGGCTGA ATACATGGTG GCTGCTGATG TCTTGGTCAC TAAGGCCGGC CCAGGAAGCA TTGCCGAAGC TGCATCGGTT GGCTTGCCTA TTATGTTGAC GAGCTTCTTG CCAGGACAAG AAGCTGGCAA CGTTGACTTC GTCCTCGACG CGGGCTTTGG GGACTACAAT GGCGACCCTG TTGAAATTGC CCAAGAACTC ACAATATGGC TGAAAGATCG CAAGTTACTA GTCGCTATGA GCAAAAGCGC ACAAGGCTCT GGACACCCTA CCGCCGCGGA AGACATTGTT CTTGATATTG GTGAGACAAC GCGTGCTTGG ATGCTTCTCA ACAATAAGTG A
|
Protein sequence | MVWSRSPSRG RIWLATFMAL TSMTGSGFVL LTNAHTGPHH HQPRGGSNTS FRQISGPPSA TSTSVSAPAD DGAEEIANNI CTDETPSRPI PLQATNEAVA SAKMPKNQGA LKVLFLSADT GGGHRASAES LAKQFLIHYP GSTYDLLDVW TEDGVYPYKT LVESYKHLSA HPQQWKMLYH LSNTRPWEVL MDWHSAFMCE AKIRARIASY NPDVIVSVHP AMQYVPMKSV RHLSRERGRH IPFFTVVTDL GSGHCTWFQK HPDKIYIASE RIRRLTKRRG GTEDCKIVST GLPIRHDFAV HSKAMGDRTT PSGQAYVQKM KLDLGLPGDK PMVLLMGGGE GVGSLSEIVE QVYRSLVSEG VDATICVVCG RNENLRLSLE QRDWDAVLEA RPKFSKRRFF SRILWRRRRS RRLQESLDRA EAYQHDRPDL VNARATVDVI GLGFVTRMAE YMVAADVLVT KAGPGSIAEA ASVGLPIMLT SFLPGQEAGN VDFVLDAGFG DYNGDPVEIA QELTIWLKDR KLLVAMSKSA QGSGHPTAAE DIVLDIGETT RAWMLLNNK
|
| |