Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39735 |
Symbol | |
ID | 7195450 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 575863 |
End bp | 577153 |
Gene Length | 1291 bp |
Protein Length | 311 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183636 |
Protein GI | 219126798 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.326114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGT TTTTGAAGTC TGCTCCAGCG TTGCTGGAAG ACCCCCGGGA GCTCCCACAA CCAGCTTCTG AAAAGCGGGG CTTCGGAGGA GCCAAAATTG CCACACGAAT CGAACCATCC CGCTCCGAGC AGTTCCTATC GTGCCTCGTG AATTGCCAGC ATTTTCACCA GCTACTAGAG CTGCTTGACT CCGAACCCCT TAGATTTTTC CACGAAATCA TGGAAGTTAT AAACACAAAT GAAAGGGGAG AAATACAGAA TAGTATTGGT CTACACCATT CTGACGACAA CGAAAATTCG ATTCAGATGC TGAGCTCTTT GCTCTCAAAA CGTGAGGACA AGGTCCCAAC GTATCGTGTC AAGAAGGCAG CAGTCGAGAT CAATCATTCA AGTGTTGAAG ATGAAACTAC GGTTGCTCAT ATTTCTTTTT CCAAGTTCCC CACTGATCTC CTGCAGCAAA TTGTTATAAC CTCGGAAGAA AACCAAGACA GCTCGGATCA AGCGAAAGGT GTTAAGCAGC TTTCTCTCGA GTATCTTGTC GAGCTAGAGA AGAACCCATG GCGCTACCAG AAGCTTTTTA ATTTTGGCGA TGGTGGGAAA GCAGCCGAGT CTTTTGTATC AATTCCTGAT AAAGTCAAAC TTCGCGAGGC TATTGGTGAG CGTCGTTACA ATGTTTGGAA ACTTTCTCTC GAACGCGATG GCGACGACAA TAGCACTATT TCGGAAGGTG GCAGAACTAG AACGGGAAGC CAGTCATTGA ATTTGGGAAG TCTTTTAAAC AACGGTACTG ATGAGATGAC ATCGTCTTTC TCTGCGTCGG CTTCGGTGAC TGCACCGCAG GATTCCAGTC GCGATGCGTC TCTTAGAAAG GTATCTGATG CAAAACATAG AGACGTTGTT CGACGTTGTC TTGAGCTTGC AAACTTCGCT ACCGGTAAAA GTGCAGGCAA AATTAGCGAG AAAGAACTGT CAGTGATTTC GGAAGCGGAG ATTGCGCTTC GTAGCCCTTC AGCACGACAA TATTTACTCA CAATCCTCGG AAAGCGGTGG CAAGGAGATG GGACCAAGAA CAACAACCTG AAACCAAACC TTCGCTCTTC GAATACTGGC GAGAGGCTAG ATCGTCATGC CTTTGAAGTG CTGGTTCGGA TTGGATGCAC TATGCTGGAT GCGTGCCTGG ACTTCAAGGA GTATGAGTCG GCATATACTT TACTGAAGTA CACAGCTGGG CTATACACGA GCGCAAGCAC AGAAACTGCA GTCGCAACCA ACTACGTCAC TGCTCGACTA A
|
Protein sequence | MEKFLKSAPA LLEDPRELPQ PASEKRGFGG AKIATRIEPS RSEQFLSCLV NCQHFHQLLE LLDSEPLRFF HEIMEVINTN ERGEIQNSIG LHHSDDNENS IQMLSSLLSK REDKVPTYRV KKAAVEINHS SVEDETTVAH ISFSKFPTDL LQQIVITSEE NQDSSDQAKG VKQLSLEYLV ELEKNPWRYQ KLFNFGDGGK AAESFVSIPD KVKLREAIGE RRYNVWKLSL ERDGDDNSTI SEGGRTRTGS QSLNLGSLLN NGTDEMTSSF SASASVTAPQ DSSRDASLRK LGYTRAQAQK LQSQPTTSLL D
|
| |