Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49525 |
Symbol | |
ID | 7195742 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 509345 |
End bp | 511601 |
Gene Length | 2257 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184266 |
Protein GI | 219128113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0165471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCATTCAGC GTATTGCGAA CGATACCTTC CTTCCGTTAC CCAAGTAAGC AATTCGTGTT CGTCCTGTGT GAATGTGCAT GTGTGTTTGT ATGTGTTTGT TAGTGTGTGT ATGGGGGGAG GGAGAGGAAA GTAAGCACAA GGGAGGCGCC TTCCATCGCA AAGAACGGGT CAAAGGGAGC GCCAGAGTTC GAATGTGTGA GAAAAAGAGA GAGATGCCGA GCATTCGTAC AAACTAATGT TCGGTTGCTT GGTGGAAGCA CCAAGGGTTG CACGCAGTGC TGCCGTGCTT TGCTGTAGTT CTATTTGGTA CACGGCGCAC GTTTGCCACC CGACACTATA TCTCACGCAA GAGCACCTCT TTGCTTGCTT GTTCGTTACT GTTAGGGATG ACCACGGCGC CACTCACTGC AGCTCAATCT TCTCCCTTGA CTATTGCCAA TACGACGACA ACGACAATCA ATAGCAAGCC TCTCGTGGAT TCTCTTACCA ATACAGTTCC AGAGACAGTC CAATTCTCTG GACGAGACGG TAATTCCTCA AACAGTGAGA CCAACAAAAG CAACAGCAAC AGGACTCCCA CTAAATTTCC TGCGTTGTCC GAAGTGTCGC ATCGACGCTC GGCATCGGTG TCTACGGCTC ACTCCGTCAC GTCGAAATCC AAACAGAATT CTACTCCACC GGTAGACATG GCAACGGCGT CCGGATCAGC CAGTCAAGGC TCGCACGGCG AAAACACGGG ACGCTGGACC GCGGAAGAAC ACCGCTTGTT CTTACAGGGG TTGGAACAGC ATGGCAAGGG ATGGAAGAAA ATCGCGTCGC TCATCAAGTC GCGAACCGTC GTACAGATTC GGACGCACGC CCAGAAGTAC TTTCAGAAAT TGGCCAAGGC TCGCCAAAAT GGGGAAGAAG GCGATGTCGC CATGGAAGGT CGCGGTGGCG TGGCTTCCAT TACCTCCGTC TCGACAACTG CTGTTTTACC CAAGCGACGT CGCCAGACAA CCGGAACAAA ACGCAAGGCC ATTCAATCCG TCGTGGCTTC CGCCCAGCGG CAAGGCAAGA AACTTGCCGC CGCAAAGACG AATCCTACTC GACACCATCC CTTGCCGCCG CCCCTACCAA CGGTCGCCCC CGCACTCGCG CATTACACTC TCCCCAGTAC TGCGATGATG GCCAAAAACG GCACCGCAGT GAAGGAAGAA TACGTCTCGC CCACCAATCT TTCAGGACCG GCCCTAGAAG ATTCATTGTA AGTCGTCCCC ACCGTCGCAT CGTGGTAGCC TTTCGTCTTC GTGGGTTGGA AAGCTGACCC ATTCCATTCT CTTCTTTCGC AGATTCCGCT TCTTAACCCC GCTTCCGGTA TCGGAACCAC CGCTCAACGA AGTAGCTCGT CAAGCCGGTG CCAACCCCAT TTCTCTCCCC ACCGACAACC CAAGCTCTAT TCCAACGGTG GGTGCAGGAG AAATCTCGCC CACGGGAGTT TCGGATTTGA TGCTTTACCC CTCGTGGACA GACTCAAAAG AGCCACCTTC TTGGTACAGC AAGGGCGCCG ACATTGACGC ATTGCTCGAT ATGGGGGATT CGTTGGACTG GTTGGACGAC ACGGGGGATT TGAACGAGTC ATATGTACCA CCCGTCGTGG ACACAGCAAT GGCCGCTCCA GAACCGCACA CGACCTTTCA CAGGTACTCC GATCTGGGAC ATTCAAAGGG ACTTCACAGT ACCAGTGTGA CGTCTCTGCC ACATGTCGAT TCCAACGCAA ATGTGGAATC CGTTGTGCCG CCACTTCCCT CCATATTCGA TGGAGCCCCC GACTCGGGAG AGCATCTTGA GACCACGGAA GGGATGGTAC CTTCCAACAG TACTTCTCAC TTGGCGGATG AAATCGACGA CAGTGAAGGC ATACACGAAC ACCTACAAGT ATTTGACAGT CCTTTGGAGG AGAACGACTT CGTATCGGCC ATCCTCGAAG AAGACACGAT TGATGTTACA GCAGCTCTAG CAGCGAGCTA AAGAGCGGGA CGGCTCTGTT TGTACCACCA CGATGCATAC TGCCAAAAAG TTGTATCGAC GTACCCACGC GCATGCGTAA CGTGTTCACT CTTGAAAGTT CTGGGAAACC GCCATCACGC TCCCCCCGTT TCATCCCATA TTATCTCGCA CCAAATACAC AATCGAAACA ATCACCGTCA TGGAATTTCA TTTCATGACG GCGGCTTTTA TCTTCTTCCT TGTAATAAAA TTGCCTTAAT TTTGTTC
|
Protein sequence | MTTAPLTAAQ SSPLTIANTT TTTINSKPLV DSLTNTVPET VQFSGRDGNS SNSETNKSNS NRTPTKFPAL SEVSHRRSAS VSTAHSVTSK SKQNSTPPVD MATASGSASQ GSHGENTGRW TAEEHRLFLQ GLEQHGKGWK KIASLIKSRT VVQIRTHAQK YFQKLAKARQ NGEEGDVAME GRGGVASITS VSTTAVLPKR RRQTTGTKRK AIQSVVASAQ RQGKKLAAAK TNPTRHHPLP PPLPTVAPAL AHYTLPSTAM MAKNGTAVKE EYVSPTNLSG PALEDSLFRF LTPLPVSEPP LNEVARQAGA NPISLPTDNP SSIPTVGAGE ISPTGVSDLM LYPSWTDSKE PPSWYSKGAD IDALLDMGDS LDWLDDTGDL NESYVPPVVD TAMAAPEPHT TFHRYSDLGH SKGLHSTSVT SLPHVDSNAN VESVVPPLPS IFDGAPDSGE HLETTEGMVP SNSTSHLADE IDDSEGIHEH LQVFDSPLEE NDFVSAILEE DTIDVTAALA AS
|
| |