Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23694 |
Symbol | |
ID | 7198735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 228949 |
End bp | 230665 |
Gene Length | 1717 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184921 |
Protein GI | 219129491 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.728376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGGAGATGC TGGTGTATCT CCACAACCCA AACACGGCTT CACCTACCAA ACAGGTACGT GCGCGCTGGA GTTTTGGGAG CCCATAAAGG AGCTGGGCGA AGGCAGCATT TCTAGCATTC ACATGGTGAG GCGACGAGAA AAACGAATCC ACGTTCCGTA CAAAGAGCGC GTTGATATTA TGAGCTTGGC GAGTAACGGC TCGACCATGA AGAATGACGA CGAACTCTTG ACCCGAAGAA GCAGCTTTTC TAACAAAGAA TCATTCGCAC TGAAGAGTAT TATCAAAGCC TACGTACAGA ATGATAAGTT TTTACAAGAA ATGCGCGATG AAATTTACAC CATGAGTCAT TTAGACCATC CCAACATTGT CAAAGTCTAC GAAGCTTACG AACGAAAGCG CCATATTTAT CTTATTATGG AGTACTGCCG CGGCGGTGAT TTGTGGGCCC GTCAGTTGAA TGAAACAGCT ACGGCCGCGG TAGTACGTAA AATCTTGCTA GCCGTATCGT TTCTACACGA TCACAACGTT GTTCACCGAG ATCTCAAACT AGAAAATATT ATGTTCGATC AACCTGGCCC CAATGCCGAA GTCAAAATTA TCGATTTTGG ATTAGCCACC CGATACTTGT CGAACGAGCA CAAACAAATG ACTGATCGAG TTGGGACCTT GTACAGCATG GCGCCACAAG TTCTACAAGG TGTTTATGAT GCCAAATGCG ATTTGTGGAG TATCGGCGTG ATAACGTACG TTTTGCTGTC AGGAGGTACT CAACCATTTT GGGGCCCACC GCGGGAAATT CCATGGGATA AACGGAAAAA AATTATGGTA GATCGAATAA TGCGATGCGA GTACATGCGC ATGAAAGGCA CAACTTGGGA CGGTGTATCA GAGGAAGCTA AGCGGTTTGT GAAGTCTTTG TTGCAAATCG ATCCCGCCAA ACGACCGTCG GCCAAGGAAG CGTTGGCTTC GAAGTGGATG AAACTGCACG AAGAGGACAA GCCAGCACTG ATTGCTTGCA CACCCAAATC TCTTCAACGT GATCAACTGC ACCGATTCGA ACGGCAGCTT CGTATTGTGC TGACCAACAA ACTCTCCGAA GAGGCATTAA TGAGTTTGAA AGCAGGTTTA GAAAAGCATG ACGAGACCGA CGAAGGCCGT GTTTCGCTTG AGGTGATGCT TCGGTATTTA TTAGAAAATG GTTTGGAGCA AATTTCGCTT GCCGCACTGC ACGAGCTGGC CAATGGCGCA GGGAAAGATG CGATGATAGG CTACACCGAA GTAATTTTCG CTTCCTTAGA ATCCAAAGGA CGTCGTGAAT CAGAACGTAT GGCAGAAGCT TTGGCGGAAA TGGACATCAA TTCTTCGGGA ATGGTTGCGA AAGCGCGAGC GTTGTTAATT TTGTACCGCG TCGTACCTGA CCACACCCTC GACATTGTGA AGGAAACTTT CAACGAGGAC GACACAGACC AGATCTCATG CCAGGTAGTC CTTGATCTGA TTAGCAAGCA AATGGCTGAT CGAATTCACA CCCTTTCGAG CCATCACGAC TCCTCCGATG AAGTGAAGAA TCTGGTTGAC GCCAAAAACG CTGTCATCCC AGGTGGGAGA AATGATCCGT CTGAAAGACC TGAGTATGTC TTTGACGCAT CCACCAATTC GGTTCGAAAA TATGCCGAAA AACAATGATT TGATCTCGAT GCTTCGGAAG GGTCCAA
|
Protein sequence | MVRRREKRIH VPYKERVDIM SLASNGSTMK NDDELLTRRS SFSNKESFAL KSIIKAYVQN DKFLQEMRDE IYTMSHLDHP NIVKVYEAYE RKRHIYLIME YCRGGDLWAR QLNETATAAV VRKILLAVSF LHDHNVVHRD LKLENIMFDQ PGPNAEVKII DFGLATRYLS NEHKQMTDRV GTLYSMAPQV LQGVYDAKCD LWSIGVITYV LLSGGTQPFW GPPREIPWDK RKKIMVDRIM RCEYMRMKGT TWDGVSEEAK RFVKSLLQID PAKRPSAKEA LASKWMKLHE EDKPALIACT PKSLQRDQLH RFERQLRIVL TNKLSEEALM SLKAGLEKHD ETDEGRVSLE VMLRYLLENG LEQISLAALH ELANGAGKDA MIGYTEVIFA SLESKGRRES ERMAEALAEM DINSSGMVAK ARALLILYRV VPDHTLDIVK ETFNEDDTDQ ISCQVVLDLI SKQMADRIHT LSSHHDSSDE VKNLVDAKNA VIPGGRNDPS ERPEYVFDAS TNSVRKYAEK Q
|
| |