Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46912 |
Symbol | |
ID | 7204743 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 789641 |
End bp | 791979 |
Gene Length | 2339 bp |
Protein Length | 693 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185953 |
Protein GI | 219121459 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.744308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTCAAGTA CGACACGTGC ATGTAATTGT GCTTTACTAG CTGGTACGGA GATCATCATC ATCGTCGTCT TGGCGAACCT TACGCGTCAT GTCGAAACGC AAAGGCGAAG ATCTCCCGGT GTCGCCCGAT GACTTTTCAC CCCCGGGTGT GAAAACACGT TCGATTCGGA AAAGTGGGAA ACGTCAACGG CGGATCCAAG TAGCCGAAGA GTCGGACGCC GAACCCTCGT CTCCGTCCGT CGCGGCTTTT CAAAACCCAC CATCTTCGTC GTCATCGATA CCAACTGATC GACAATCCAA CAACAACAAC AACAACAACA ACAAGAAGGA TGACGACGAC AGGGACGATT CGTCGGATCC CGAAGACCCC ATGGAAGACT CGAATCGACG ACGACGAACC AACCCGAGTC GAACGTTGCT GCAGCCGTGA AACCTTCGGC CAACGATTCG ATACCGGTGC CACAGTCGTC CGCACCATCC TCCTCTACGG CTCGTCCACA AACCACCAAG CCTGTTAACA ACGGTAAAGG TAAAACACAC GACACTGCAC GGCGCAAGGC TCCACCCGCC GCATCCGGCA GCCCTAAGGA AGCGGGAGAG AACGTTCCAC TTTCCCGTCG ACTAGACTAT CGAGAAACCG ATACTCGCAA GGAATCTCCA ACAACGTTGC CTACCGAATC GGTTTTGCGG GAACGGGACG TGTTGGAAAA CGGACACGAC AAAACCACCA CAGACGACGT TGATGCGGAG GAGCATCGCC GCAACAACGA CCACGACGAC GTTCGTCCAC ACGTTTCGGT ACACATTCGC GTTTACCAGT TTGTGACGGA ACTCGTTGCG GGACCGACCA GCGCACCCCA GGTCGAGACG GAAATGGACG TACCGGTAAA TACGGATACT GCGTCTGCGG GATCACGCAC ATGGATGCAG CTGTCCTGGG TTTGGGTCCT ACTCATCCTC GCGTGGCACA TCATTTGCTT TCCATTGGCC ATCAATCCAG GCTTGGTCAC GGTATCGTCT ACGGGCAATT ACCTGACAAC GGTCTATCGA GGTATACTGC AACCGGTCCG ACCGTTTCTC TGGAACAATC GTCCGAAGCA CAGGCCAAGC TGGCGCAAGC GGTGCAAGGT CTGCAAACCG TACAACATCA GCAGCTGGCG CGATTGCGCA CCGCGCGGAT CGCGCTGGAA CAAGCCGAGA ACGCGTTTCG GTCACAACAA CTCGCCGCGG AAGAGAATCT GGCGAGACTG GAACAGAGTT GGGAGACCGA GGCGTCTTCT GTGATGGAGC GATTGGAGAA AGAGGAAGCA ACGGCTCGGA CATTGAATCA CTGGATCGAA CAAGTCTTGG TGGAAGTTCC GGACGACGAA GAGGAAGAAG AATACGTGGA GGCGGTATCT GTGCCGCCCG AAATTCGCAA TGTTCTCGGC CCCACACAAG ACGCGCTGTT GGACTCTTCG TTCATCACGC TATGGGATGT TCCCGAACCA GTGATTTGCG AAACACCCGA TGTCAGCGGT CTAGCTGGTG GGCTATACAA GGAAGATGTG GAGCAAGCCA TTTCGGACCT AATAACAGAT ATTGTACAAG TTGATGAGGA AATGGAAGAA ATGGTCCGAA AATGGGTGGA AAACTATCTG GACACCAAAG CAGGAGATGC AATGACGACG ACGACCACCG CCGATGCTAA TATACCGCCA TTGGATGGTG TCGTCGACGC TGATGCATTG AAGAAGCTCC GCGCGTTCAT TGACGGCCGT ATGGAGGTTG AACGTGCCGA CCAAACTGGC TTAATTGACT ACGCATCGTT ACTGAATGGT GCTCGGATTA TAAGGGTGGG CGACCGGTCA ACTAGTATGT CGCTCGTAGA CCAACTACCG GTCTTTAATC GGTTGGCCGC TCTACTTTCG CTTCGCTTTT ACGGACACGG ACCGGAAGCG GCTCTGCTAC CCACGTATCC ACCCAATGCT CTCGGGCAAT GTTGGTCCTT TGAGCCACCA GCCAGTAGGC GAAGCGGTCC CTTTGGTGTA TTGACGGTGC AGCTTTCACG CCCAATTCAC GTGCAGAGTG TTTCTATTGA ACACCCACCA CCCGAACTGA CGGACAAATC CCAGACAGCA ATCCGGTCCT TTCGTATCGA GGGCTTTGAA GATACGCAAA CACACGGCAA AGCGCACTCC CTCGGCTCGT TCGAGTACGA CGGCCAGAAA GGTCTGCGTC AAGATTTCGA TGTAGACCGA AACGTTCCGC GGTTGCAATC CATCAGTTTA GTCGTTGACA CGAACTGGGG CGAACCGTAC GCTTGTCTCT ACCGGTTCCG AGTACACGGT CAGGAGTAG
|
Protein sequence | MSKRKGEDLP VSPDDFSPPG VKTRSIRKSG KRQRRIQVAE ESDAEPSSPS VAAFQNPPSS SSSIPTDRQS NNNNNNNNKK DDDDRDDSSD PEDPMEDSNR RRRTNPSKTH DTARRKAPPA ASGSPKEAGE NVPLSRRLDY RETDTRKESP TTLPTESVLR ERDVLENGHD KTTTDDVDAE EHRRNNDHDD VRPHVSVHIR VYQFVTELVA GPTSAPQVET EMDVPVNTDT ASAGSRTWMQ LSWVWVLLIL AWHIICFPLA INPGLVTVSS TGNYLTTVYR AQAKLAQAVQ GLQTVQHQQL ARLRTARIAL EQAENAFRSQ QLAAEENLAR LEQSWETEAS SVMERLEKEE ATARTLNHWI EQVLVEVPDD EEEEEYVEAV SVPPEIRNVL GPTQDALLDS SFITLWDVPE PVICETPDVS GLAGGLYKED VEQAISDLIT DIVQVDEEME EMVRKWVENY LDTKAGDAMT TTTTADANIP PLDGVVDADA LKKLRAFIDG RMEVERADQT GLIDYASLLN GARIIRVGDR STSMSLVDQL PVFNRLAALL SLRFYGHGPE AALLPTYPPN ALGQCWSFEP PASRRSGPFG VLTVQLSRPI HVQSVSIEHP PPELTDKSQT AIRSFRIEGF EDTQTHGKAH SLGSFEYDGQ KGLRQDFDVD RNVPRLQSIS LVVDTNWGEP YACLYRFRVH GQE
|
| |