Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50806 |
Symbol | |
ID | 7197801 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 903075 |
End bp | 905137 |
Gene Length | 2063 bp |
Protein Length | 588 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178326 |
Protein GI | 219115061 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAATTCCT GGAACAATAT TCCAATTTTC AGGAGAGAGA AAGAACTCTT GACCGCAACA ATGAGGGCGC AGGAGCGGAG TTTAAACTCG TATAAGTCAA ATAATTGAGA GCAGCGCGAT GAAGAAAATT TCTGTTATTC ACTGCTGGTC TGCTCCGCGA AGTCGATCTA CGGCGCTAAT GTATAGCTTC GAGGCTCGAG GATCTGACTG CGCTGCGCTG GATGAACCGC TGTACCGCGA ATGTCTGATT CAACGCGGCG ATGCGGTTGC CCGGCCATAC CGTAACGAAC TCATTAACGG CACACCTCCA TCTGGGAGCA TAACAGATCA AGTAGATGTG TGGGTGCGGG AACTTTCTAG CTTGGAGGAG CGCATTCGAT TGTGCGCACA GCGTTTGCCT GAAAACGGTG TCGTCTTTTG TAAACACATG GCGAAACATT CGTTTCTTTA CGACTTTCAG GAGGAATTTT TCGCGGATGA CCTGAACATT AAGCTCATTC ACAAGCATCT CTTCTTAATT CGCGATCCCG TGGCAGTCCT ATCGTCTTGG GGAGCGTCGG ACTCAGTGCA TGGAAGCAGC GCCACTCCTG ACGAAGTTGG GATTGTTCCC ATGCTCTCCA TCTTTTCGGC TCTCTGCAGC CGTCCCCACA GAATACGCTC TATTGTTTCG TTCCTTGATT CGGACGAACT AGTCAAAGAT CCGGAGAGAA CTCTGGGATC AGTTTGCGAA GACCTGGGTA TCCCGTACAA AGAATCAATG ATGTCGTGGC CAAGCGGTCC TCATGCTTGC GATGGTACGT TTGAGGTCGC CGGTCAAGAT TTGATAGATA AACAAAAAAC CATCCGGACT GCCATCTCAC TTTGTTAAAA TTTTAGGACC CTGGGCTTCC TGGTGGTATA GCGACGTGCA TCAATCGACA GGATGGAAAC GGAAAACACC CGACTATGGG GCTAGTCGGT ACCGAATTCT GAATCCCGAT TTAATGGATG CGCTAAAGGT CTCCTATCCT GCCTACGAGT TCTTGAGTAA ACTTACTAGG GGATATCAAA AGCGGGGTCC CTCAACCAAA ACACTGTATG AAGACCCAAG AAACGAGCAT TTACTGACTT ACATCGGCGC CCCAGGTCGA GGCAGAATAA TTCCGAGATC CATGGCTGGT GTGAGTCCGT GGGATTCATC AGTACAAGGC GGAGACGCTG CATGGGAAGG ACTCCGAGTA TACCGTGGAA AAGTGCTTTC GTTGGATAAA CACCTTCAGC GTCTTTTTAA ATCGTCGAAA GCGCTAGGTT TTGAGAACGT GCACACCAAG GCAGAAGTAG TGGAAGCCAT TTTCCGCACG CTGGCAGCGA ACGGTATGCG GGACGGGGCA CACATGCGTC TTACATTGAC ACGAGGTGAA AAGTGTACCA GCAGTATGAA TCCAAAGTTT AATGTATACG GAACGACGTT GATCATTCTA GCCGAATGGA AGCCCACGGA GGGTGCCACG ACCTACAACA ATACTTCCGG TATTGCGTTG ATTTCTGCGT CTCAACGGCG AAACTCACCT CAGACGGTGG ACTCCAAAAT CCACCACAAC AATTTGATCA ACAATATTCT GCCAAAGATT CAAGCAAACT TGGCGGGATG CGATGATGCA ATTATGCTCG ATCTCGAGGG TTTTGTATCG GAGACAAACG CTACCAACAT TTTCATGGTT GATAATGGAG TGCTGTTGAC GCCGCATGCT GATCATTGTC TGCCAGGGAT CACTCGAGCT ACCGTTTTGG AACTGGCGAA AGAAATCAAT ATACCTACCG AAACTCGTCG GATTTCTCTT GCCGAATTCC ACGCCGCGGA TGAGGTCTTT ACTACGGGAA CCATGGGCGA ACTGACTCCG GTTCGCATGA TCGACGGTCG GGTCATTGGT ATCGAAGGAA AGCGGGGTCC GATTACTGCC AAACTACAAA AAGTCTATCA GAGTTTGCCG GAACGTTCTG GTTGGGCTAC GGAGATTCCG CCTTTTGAAG CCTAAGTTTT TGGCAACGGA ATAACTACGA GTATAAAGGA TGAATTTATT GCG
|
Protein sequence | MKKISVIHCW SAPRSRSTAL MYSFEARGSD CAALDEPLYR ECLIQRGDAV ARPYRNELIN GTPPSGSITD QVDVWRLPEN GVVFCKHMAK HSFLYDFQEE FFADDLNIKL IHKHLFLIRD PVAVLSSWGA SDSVHGSSAT PDEVGIVPML SIFSALCSRP HRIRSIVSFL DSDELVKDPE RTLGSVCEDL GIPYKESMMS WPSGPHACDG PWASWWYSDV HQSTGWKRKT PDYGASRYRI LNPDLMDALK VSYPAYEFLS KLTRGYQKRG PSTKTLYEDP RNEHLLTYIG APGRGRIIPR SMAGVSPWDS SVQGGDAAWE GLRVYRGKVL SLDKHLQRLF KSSKALGFEN VHTKAEVVEA IFRTLAANGM RDGAHMRLTL TRGEKCTSSM NPKFNVYGTT LIILAEWKPT EGATTYNNTS GIALISASQR RNSPQTVDSK IHHNNLINNI LPKIQANLAG CDDAIMLDLE GFVSETNATN IFMVDNGVLL TPHADHCLPG ITRATVLELA KEINIPTETR RISLAEFHAA DEVFTTGTMG ELTPVRMIDG RVIGIEGKRG PITAKLQKVY QSLPERSGWA TEIPPFEA
|
| |