Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49898 |
Symbol | |
ID | 7198602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 232208 |
End bp | 234640 |
Gene Length | 2433 bp |
Protein Length | 810 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184756 |
Protein GI | 219129144 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.412602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCAC AGTCTGAGCG GTCCCGTCGT TTACGGCAAG CTACAGGAAC GCCGCCGTCA ATATCTGCAC TGACAAAGGA ACACTCTCGT ATTGGAGTTC GACGCAGCAA GAATACGGGC TCTTCCCCCA GCGCGTGGAT ATTCATGCTC TGTTTCGGTG GTTGCTTAAT TCTGCTACTT CACTCTACAC AGCAGCTTCC GTCTTCGTCC ACCTACTACG AACCTTTAGA ATCAATGCTG CCTTCATTGC AGTCCACCGT CGGCTGGGAC GAACAAAACC CGTTGGCCAT CCCCAAGGGC CAGGCTCCAA TTCTACCATC GCTACGCACC AATGGTGTCG ACAACCAGCG GAAAGGGTAC GGTGGACACG GAGACCAGAA GCACCTCGGT GGATTTACGG AATACGACGG CATGGGCGTC AGTCCTCACA CTTGGAAACA CATGATCCAA GACTACGGGG TGCATTCGCT TTTGGATGTG GGATGCGGAC GCGGTACCTC CACGGCGTGG TTTCTCATGC ACGGCGTCGA TGTTCTGTGT GTAGAAGGGT CACACGACGC GATCGAACGG TCCGTACTTC CGGATCCCGC AACCCTAGTG GTGGAACACG ACTTTTCGCG AGGACCTTGG TGGCCCGCCA AGACCTACGA TGCTGCCTGG GCCGTCGAGT TTCTGGAACA CGTCAACGTC CAGTATCATT TCAACTACGT TACGGCCTTC CGCAAGGCGG CACTGATTTT TGTTTCCACT TCTCAGTGGG GAGGCTGGCA TCATGTCGAA GTACATGGAG ACGAGTGGTG GATTCGAAAG TATGAAGCCT ACGGGTTCAA ATACGATGCA AGCCTTACCA CGCAAGTCCG GAAATGGGCC AGAGAAGAAA AATCTTGGGC GAACAACACC GGTCCAGACG GAAAGCCCCA CAATGCTCAG CATATTTGGC TGTCTATGAA AGTGTTTGTA AATCCCATTG TCGCGGCTCT TCCGCAGCAT GCTCATTTGT TTCCGGAGGA CGGCTGTTTT CTGCGAAGGG GCGACGATGG GGAAATCCTT CACAAGGAAT GCGGAACTGG CAAAGATGCA GGATTGGAGA CACCATTGGC GCAGGGATTT CGTCCCCTCG CTCTGAATCC CGCGATGGAC CAGAGATGGC TACGGCACAT TCAAAAGCAT GCCTCTCGTC TTCATGAGGG GAAAGAGAAG GATGAAGCCA GTCCTGATGA TGACGAGACA AGCGCCATAC AAAGTGAGCC GACCAGGCCG GCAGATTTGC TCCGAAAAAT TGATGAGAAA AACATTACTG ATATACTTCC ACTTCATGTC GTCGCGTGGC CGTACTTGGA ACACGGCATC AGGACGGCTG AACATCAGCA CATCGAGGAA AACGGTATTA GCGAGTCGTC GTATTTGAGG CTTAGCGAAG ATATGTTGGA TTTTCATCCC AACGTTGTTT GGCTTGGGGA CATTGGTTGG GGCTTTCCTT GGAAAGAATG GTGTCAGGAA TACACCAAAC ACATCAAAAA GGCCAAAAAT ATGCGGCGCG AGAAAGGGCT GCCGGAGCAG TGGCCAATCT TCATTGCCGC CTTCACCGAT GGACCATCTC TACCGAGATG CCAAAATGTA GAAGCTGAGG TTGGTAAAGC GAACGTTCGC TACACAAGTC GGTCAATAGT GCACAATCGG CGCTGGAATG AAGCAAAGAA ATGGGTGGAA ACGGGCGAAA AGTTGAATAT GATGAAGAGC GGTATCATCT ACCGGCACAC GCCTCTGGTT GTTCGAACTG ATACGGTGAA GTTTTTGGAA GAAGCCCTGC GGAAGCGCAA CATGACGCTG GCTGATCCTA TAGAGCGTTT GCAACGCGAC GTAGATGTAG CTCATTACTG GCCTCATCAA AGAGACCTAG ACAAGGTTGG TACAGTTGGA TCGCTTTTAC GCCAGGAGAT CAGTAAGCTT CTTTTTGCTT TTGGGAAAAA TACAAATTTT AACGTTTTTG TTGGACTGAA GGGCGAAGCG GTTCGCAAAG GTCGTCGTGG TGTCGCATCT GATTATATTG AGTCCTTGTT GGAGACCAAG ATTGTTGTTG TCTCACAAAG GGATCGATGG GAAGACCACT ACCGACTTAT GGAAGCTCTG GTTGGTGGCG CTTTGGTTTT GACGGATCGC GTTCTGGGAA TGCCGGCAGG TCTAGAGAAT GGCACTTCGG TTGTTGAATA TGAGAGCGCG GATAGTTTAT TGTCTTTGAT CCAGTACTAC CTTACGCACA CGGAAGAGCG GCTCTCCATT GCCCGCAAAG GAAGGGAAGC TGCAATGAAG AAACACAGAA CATGGCATCG TATCGAAGAG ATCATTTTTG GCGAGAGCTT GTCGGATTGT AGGTTTCAAG GCTTGGATAG CCCTTGTCCG TATGTTGTGC ATGGCGTCGA GTCAAAGCGC TGA
|
Protein sequence | MESQSERSRR LRQATGTPPS ISALTKEHSR IGVRRSKNTG SSPSAWIFML CFGGCLILLL HSTQQLPSSS TYYEPLESML PSLQSTVGWD EQNPLAIPKG QAPILPSLRT NGVDNQRKGY GGHGDQKHLG GFTEYDGMGV SPHTWKHMIQ DYGVHSLLDV GCGRGTSTAW FLMHGVDVLC VEGSHDAIER SVLPDPATLV VEHDFSRGPW WPAKTYDAAW AVEFLEHVNV QYHFNYVTAF RKAALIFVST SQWGGWHHVE VHGDEWWIRK YEAYGFKYDA SLTTQVRKWA REEKSWANNT GPDGKPHNAQ HIWLSMKVFV NPIVAALPQH AHLFPEDGCF LRRGDDGEIL HKECGTGKDA GLETPLAQGF RPLALNPAMD QRWLRHIQKH ASRLHEGKEK DEASPDDDET SAIQSEPTRP ADLLRKIDEK NITDILPLHV VAWPYLEHGI RTAEHQHIEE NGISESSYLR LSEDMLDFHP NVVWLGDIGW GFPWKEWCQE YTKHIKKAKN MRREKGLPEQ WPIFIAAFTD GPSLPRCQNV EAEVGKANVR YTSRSIVHNR RWNEAKKWVE TGEKLNMMKS GIIYRHTPLV VRTDTVKFLE EALRKRNMTL ADPIERLQRD VDVAHYWPHQ RDLDKVGTVG SLLRQEISKL LFAFGKNTNF NVFVGLKGEA VRKGRRGVAS DYIESLLETK IVVVSQRDRW EDHYRLMEAL VGGALVLTDR VLGMPAGLEN GTSVVEYESA DSLLSLIQYY LTHTEERLSI ARKGREAAMK KHRTWHRIEE IIFGESLSDC RFQGLDSPCP YVVHGVESKR
|
| |