Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_36976 |
Symbol | |
ID | 7204451 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 796410 |
End bp | 798086 |
Gene Length | 1677 bp |
Protein Length | 511 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185954 |
Protein GI | 219121461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTAT CCTCTTCAGC AACCACATCT TCTACCACAA CCAACACACC ATTACCGAAT ATCACCCCCA ACGCACTCGC CTCCAGTAGC GATACTTCCC CAGTAGTCGT CGCCGACACC AACACACTCT GGTCTCAATT GAACGAAAAG ATCGGCACGA TTGACGAATC ACGGATTACC TTTGCGGAAT ACGATAACGG CGACGTCCCG CGCATGTTCA GTGCTCTGCA GTACAATCGC AACGAACGGG ACGGGCAATT GCGGGCTTCG CATATGGCGG GTTCCGTCGG TGGCGCCGCA GCGCTGGTAG CTGGGACCAC CATCGGAGCC GGCGTCTTGG CCGTGCCCGC TGCCACTGCC GCTGCCGGAT TTCTACCCAG CTCCGCCGCC ATGCTCGTAG CTTGGTTCTA CATGGTACGT GAATTCGAGC ATCTCTCGGC GGACTACCGC ATCCGCCTGC ATGAATAATA AGCAGTTTCC TGAATCCATG CATACTCCCT CAACCCCTAT TCTCTCACAC TGTTTTAACT TGCCTTTGCT GCCACTCTTG TGTAGACCAT GTCCGGTTTG TTGATTGCGG AACTCACTTT GAATCGTATG GGTGGAACGG GTCGTCCTGG ACTCGGCTTG TTGGATCTTT ATAACAATAG CCTCGGTCGT ACCTGGGGCG CCGTGGGTAG TGCCGCTTAC ATGTTCCTGC ACTACGCCGT CATGGTTGCC TACGTGGCGC AAGGGGGAGC CAATCTCGCC AAAGTCTTGC CCTGGGATTC GGTGCCCGAC GGTGTCGGTC CCGCGGCCTT TGTCAGCGTC TGTGCCGTGG CTCTATTCAA CGCTAATCGC GACGTGGTAG AAAAAGTCAA CAACGGGCTC GTGGTCGGTG TTGCCGCCAC CTTTCTCGCC ATTGTTGCCG TCGGAGCTCA AACGGCCGAT TTTGGAGCCC TCGTAAATAT ATCGAACCAG CACCCGGAAC ACGTCGTGGA TTGCTTTCCC ATTCTTTTTC TATCACTGGT CTTTCAAAAT GTCGTACCCA CTGTGGTGGA TCAACTAGAA GGCGATCGGA GCAAGATCAC CAAAGCTATC ATTGCCGGGA CTACGGCTCC GCTGCTCCTG TTCCTGGCTT GGAACGCCGT TGTCCTCGGG AACGTGGCCG GTACGGGCGT GGACCTGTCG GTGGTCGACC CCGTCGCCTT GCTGCAATCT GGTGGCGGCG CGGGACTTTT GGGTCCCCTT GTAACTGGCT TTTCGACACT AGCCGTAGTC ACCTCCCTCA TTGGCTTTAC CTACGGCTTA CGGGACGGTT GGGCTGATTT ACTTAAACTG GATACCAAGA GTGCCGACTT TGAAGCAAAA TCCAAACTAC CGTTGTTTGC CTTGATTTTT GGCCCCCCGT TGGCCCTGGC GTGCGCCAAT CCCGATATCT TTTACGACGC CCTCGAATAC GGCGGTGCCT TTGGCGTGAG CACCCTCTTT TTGATTCTTC CTCCGCTCAT GGTATGGAAA GAACGGTACG GTGATGATCA AACGCCGTTG GCGACCAAGC CCATGGTACC ATTCGGCAAA CTACCACTGG GGAGTATGTG GAAAGCGGCT GGAACACTCA TTCTGGAACA AGGAGCCGAA AAACTAGGAG TGTTTGCGTT CTTGCAGGAA CACGTGCTGT CCAAGTTTCA GTCGTAA
|
Protein sequence | MNVSSSATTS STTTNTPLPN ITPNALASSS DTSPVVVADT NTLWSQLNEK IGTIDESRIT FAEYDNGDVP RMFSALQYNR NERDGQLRAS HMAGSVGGAA ALVAGTTIGA GVLAVPAATA AAGFLPSSAA MLVAWFYMTM SGLLIAELTL NRMGGTGRPG LGLLDLYNNS LGRTWGAVGS AAYMFLHYAV MVAYVAQGGA NLAKVLPWDS VPDGVGPAAF VSVCAVALFN ANRDVVEKVN NGLVVGVAAT FLAIVAVGAQ TADFGALVNI SNQHPEHVVD CFPILFLSLV FQNVVPTVVD QLEGDRSKIT KAIIAGTTAP LLLFLAWNAV VLGNVAGTGV DLSVVDPVAL LQSGGGAGLL GPLVTGFSTL AVVTSLIGFT YGLRDGWADL LKLDTKSADF EAKSKLPLFA LIFGPPLALA CANPDIFYDA LEYGGAFGVS TLFLILPPLM VWKERYGDDQ TPLATKPMVP FGKLPLGSMW KAAGTLILEQ GAEKLGVFAF LQEHVLSKFQ S
|
| |