Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44593 |
Symbol | |
ID | 7197612 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 994077 |
End bp | 995742 |
Gene Length | 1666 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178338 |
Protein GI | 219115085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCC TGTTTGGGCG GACGAAGAAG TTGGGCGCAG AACAAGAACA GCCAACTCCT TTGGCAGGCT CAGGTGTTGT GGGCACCACT CGGCCGCGTA GAGATCGACA GCAATCATTG TTGAAGGAAC CTGAACAAGG GTCTATGGTA GCTTGTCTGG AACACAAATC TCAGCGGGGA GATGTGGAAC CTAGCTCGTC ATATTCTCTC ATGAAATCGA ACAAGAGCGA CAATATGCCA TCCGCAGTTA ATGACGTCGA AGCCGGTGCA GGGGGAAATG AAGTTACGTC AAAGATTCCT GCTTCCAATG GTCTGTCCGA TATTTTTCAT CGATTACGCT TCAGTGCTAC ATCATTCACG AATGCAACAA AAGACAAAGT CGCAGCGTCG GGGACAGTCG CAAATGAGAC TGTCGCCCCG ATGCAAGGCC CCACGTCCAA GCAGCCAACG TACGAAGGCT TTCAAGGGAA GATGCGGTAT CTGGCAACTG TTGTATGTGG GGGATGGTTC GTGTATGTGC TTCTTTTCGC CATTATGGCA ACAGGGCTTG CCTATGGTGC TCTCAATGAT GGAAATTTTC TTGACTTTGT TGATTGGGAC AAAAACATAA ACGTTGCTTT CGTTGGAAAC TCATATTTCT TCCTGAATGA CATTCCGCGA TTGGTGGAAA CCATAGCCGA CGGACACGTA TACCAAAACT CAGTGTTGCA TTCGGGAGGG TCTTTGGGCG GCCTCCTTGT CACTGGCAAC GGCATGTATC CTCGGTGGCA AACCAACGAA GCAATTTTAG ATAGCAGTTA CAAGAACAGC TACGGAAATT CCATGACGCT CTACGACTAC GGTCTTTGTA CCGTCCAGCA AATGCTGAAC GGGTCGGATT CAAATGTGAC GTACTTGAAC CAAAATTGTA AGTTGGATTC CATAAGTTTG GTCTTATTGT CGCCCAATGA TTGAATCTGA CCGTTCCCTT TTCGAAGACG CTTACTACAA CGATGGTCAC AATCCATGCT TTGCAGATCA AAACTATCTA ACATATACAA CAAATCAACT CAGCCAGAAC AAAGTAAAGT GGGATTATGT TGTATTGGTG GACCAGACCA AGCGTATGGC GATTGAGTCG GCAAGAGAGG ACACAACGTA TGCTCTGGCC AATTTCTACG CGCCGCTGCT GAAAAAGACC GGCGCGGTCC CAGTGGTTGT AGATACACAT GCATTTTGGT CATCCAATAC CAATATGACG GGCCTGGGAG ACATTCCAAA TTTTCAAAAG ATGATCTACA ACGGTGTTAG GGATTACGTT TCAGTTCTAT CCAAGCATCT GCCGAGGAAG CAGACTCCCA TTGTAGCTCC AATTGGCATT GCATATTTGA CAATCTACGA TGAAAAGCCA GAGCTCTACC AACAATTGTT CATGGATGAT GGAGTGCATG CTTCATTCTA TGGCTCATAC TTGTTGTCAA TCGTTATATA CACAACAGTT TTCGGGCATT TGCCTTCGGA TCAAGTTTCT ATCCCTGAAC GGATCGAAAA TTTATTCGCT AACTCCCGGA AGCTTGTCTA CGGGGAGTCC ACGGCCGCCT TCCCTAGTTA CGATGAGGCA TATTATCTTC GAAATGTTGC GCGGCGTGTT GTTTTGGGAC ATCATCGACC AAAGTCGTTT CAATAA
|
Protein sequence | MKRLFGRTKK LGAEQEQPTP LAGSGVVGTT RPRRDRQQSL LKEPEQGSMV ACLEHKSQRG DVEPSSSYSL MKSNKSDNMP SAVNDVEAGA GGNEVTSKIP ASNGLSDIFH RLRFSATSFT NATKDKVAAS GTVANETVAP MQGPTSKQPT YEGFQGKMRY LATVVCGGWF VYVLLFAIMA TGLAYGALND GNFLDFVDWD KNINVAFVGN SYFFLNDIPR LVETIADGHV YQNSVLHSGG SLGGLLVTGN GMYPRWQTNE AILDSSYKNS YGNSMTLYDY GLCTVQQMLN GSDSNVTYLN QNYQNYLTYT TNQLSQNKVK WDYVVLVDQT KRMAIESARE DTTYALANFY APLLKKTGAV PVVVDTHAFW SSNTNMTGLG DIPNFQKMIY NGVRDYVSVL SKHLPRKQTP IVAPIGIAYL TIYDEKPELY QQLFMDDGVH ASFYGSYLLS IVIYTTVFGH LPSDQVSIPE RIENLFANSR KLVYGESTAA FPSYDEAYYL RNVARRVVLG HHRPKSFQ
|
| |