Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47359 |
Symbol | |
ID | 7202511 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 386426 |
End bp | 388002 |
Gene Length | 1577 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181547 |
Protein GI | 219122428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGCCA AGGCCATGCC CAAGTTACGT ATGGACGGCT CTCCTTTAAA ACGGCCACGA TCCATTGATT TACCAAAGAT CGATGTAAAT GAAGCGATTG CACATCTGCA AAGGGCCTTG ACGACAAAGC AACGCCTGAT GGCCTTGCAT CACGTTCAAA ATCTGATCGA GGGCGATCCG ACGGCAATTT CTTGGATGGT AGATTCTGGT CTGATACGCA TTCTGCAACT GCAGCTTAGT TATGCTCTTC AACGGCATGG ATCAACAAGT CAAGAGCTCG GAACACTTTG TCAAGTTTTT GATCTTGCGC TCCGGACTTC GCCCGCTGGT TCGCTGGAGA GTGCTCTCGA CAAAGAAGCA GGGCGATCCC TGGTAAACCT TGTTGCCGAT GCTTTCCCAT GGGGATTCCA CCATGTCATC GTGTCAATTT TGCACACTAT TTCGCAAACG AGCTCTGGAG CTTTTCTGAT CCTTCACTGC AGCAAAGCAA TGCATTGTGT TACTGAACTT TTCCGATGCT GTCGTGCGTC ATCCACCAGC AAAGAAGCTG TCTTCGAGGC CCTGGGATTG CTTAAGAACC TAACCTACTT TTCGGAAGAA TCACGTAATA TATTGCTTGA CTTACCAGGA ATCGTCGGGT CACTAGCGAA CGTGGCTGTA TTTGTTGATC AAAAGGGTCA CGAGAGATTG TCAGCTATCT GGCGCAACCT CTCCGTATCA ATGGAGACAC GACGGCGTTT GGCGCAGGAT CCTGATGTAT TAAACGGCCT TCTAGAGCTG GCTGATTGTA CCTGCTCTTA CGCCCTACGA AATTTGCTAA ACACAACAAT CAGTCTTTCA ATGGACCCAG AATCATGCGT GATACTTGTG TTGCACGGAG ACGGAATCTT TGTAAACGTC TTGCGGAGAC TGCTAGTCAC CGAGACAGAT GCGCTCATTC GAAAGCGTGC AGCACGCGTC ATTAAGCTTT GGGCTTCAAA CGATTTTGTC GGTCCAATAC TAGTCAAAGA CAGAGCGTTG ATGGACGTTC TGTCTCAGCA GGCTTTGCAA GACCAAAACG TAGATGTTCG CCACGAAGCA GCTGACGCAT TCTGCCGGTG TTCTCAAAGA ATCCAGTCAC CAATGCCTCA GCACCAACTG GTGCTAGATG CGATCATGTT TCTGGCAGAG CAGTCAAGAT TACCCGCTGA AGTGCTAGCA CGAACTCTGA AAGCGCAAGC ACTTCATCCC AGAAATCGCA TTCCAATGGC TGAGCGCAAT TCGCTACTTT CTGCACTAGC TCGCATTGCT CAGCAAGAAG GTGTCCCCAA TTCAGCTCGC GAAGATGCAT GCTGTGCTTT GGCTTATCTG TCAGACGAAG CCGCCAACCT GCCAAAGCTA TCAACCGCTG GTATCGTTGA AGCAGTCACG GTCAATGCGA TTGGCGGTCG TGGCCTCAGA AGATCTTATG CAGTGCAAAC TATTGTAAAT CTCACCAGTA CAGCAGAGAA TCTTCCCAAG CTAGCTACAC ATACAAATCT TCTTCAGGCT CTGATACAAT TTGCGGCTAC TTCAATAGAA GACCAACTCA AGTCTAA
|
Protein sequence | MVAKAMPKLR MDGSPLKRPR SIDLPKIDVN EAIAHLQRAL TTKQRLMALH HVQNLIEGDP TAISWMVDSG LIRILQLQLS YALQRHGSTS QELGTLCQVF DLALRTSPAG SLESALDKEA GRSLVNLVAD AFPWGFHHVI VSILHTISQT SSGAFLILHC SKAMHCVTEL FRCCRASSTS KEAVFEALGL LKNLTYFSEE SRNILLDLPG IVGSLANVAV FVDQKGHERL SAIWRNLSVS METRRRLAQD PDVLNGLLEL ADCTCSYALR NLLNTTISLS MDPESCVILV LHGDGIFVNV LRRLLVTETD ALIRKRAARV IKLWASNDFV GPILVKDRAL MDVLSQQALQ DQNVDVRHEA ADAFCRCSQR IQSPMPQHQL VLDAIMFLAE QSRLPAEVLA RTLKAQALHP RNRIPMAERN SLLSALARIA QQEGVPNSAR EDACCALAYL SDEAANLPKL STAGIVEAVT YSRESSQASY TYKSSSGSDT ICGYFNRRPT QV
|
| |