Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48558 |
Symbol | |
ID | 7194727 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 209335 |
End bp | 211237 |
Gene Length | 1903 bp |
Protein Length | 468 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183048 |
Protein GI | 219125567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACGTTTATT CTTCAAATTC GCTTACATTG ACTGTGAATA TACGTTTCGT GGGTTTGAAA GTTATTGACT GTTATCGCGA ATATAGAAAG ATTGCTTCCT TGAATATAAT GGGATCTTTC CGTACGAGTA TCCTCTTGAT TTTTTTTGCT ATTCCGTTTG CCCAGCGCAA TCACACCATT CACTGTCAGC TACGCACAAT TTAAGGCGAT CCACTCCTTT GCTGCAAACA ACGCTGTCCG AACAGAGTGT CAAACTAGCT ATATATCTTC TTTTTGACAT CATTATTTGT GCAAGAATGG ACAACAGGCC TGCACCGCTT ATGATAAGCG ATGATGAGGT CAGCTTCTCG GGCAACGCGA AAGTGTCGAA GACAGCAAAT TGCGCCACCA AATCCTATGG CGGCTACATT GATTACGCCA AAATATCATT CGATCAGGTG AGTCAGAGTA TTCGCTCCCC AGTAGCCATG CGTCGTGGCC CTCGAGGAGG AGTCACGGTA CCTTTTCCAG CAAAACTTGC AAGTTTGCTT GAGGAAGTAT CTCTGGAGGG AATGGAAGAC ATAGTATCTT GGGGTGCACA TGGCCGATGC TTTTCAATTC ACAAACCATC TCAATTTGTC TCGGAGCTCC TACCGCGGTA AGTTGGGTTC CTGTACACTT CGACGAAACC TATCACTGAG CTCACTCGAA TGTAATTGCT TTATCGTATT TATTAAGATT CTTTCGCCAG AGCAAGTTAA CTTCGTTCCA ACGACAGCTC AACTTATATG GGTTTGGGCG TCTCACTAGG GGCCGTGACA GAGGAAGTTA TTACCACGAA CTGTTTCTCA GGGGCCGGGC TGATCTTTGC AAAATTATGT TGCAGACTCG TGTTATTAGG GACGCCGCCC AGCGCTGTCT TGAGCCTATC GCAGCAGTGG CGCCGAATTT CTACGCTATG CCAACTACTG AAGGTGGAGA CAAAGACCAA TTATTCTCGA TCCAAAATAC GTGGAAGGGG TTGTATTCGA GACAAAAGAC AGCAGCAACT TTGATAAATG AGGACCTTCC GGAAGAATTT GATCTTGTTA AACCGCTCCC CTTTGCTGAG TTACCACCAG CTCCTTCCAC ATCAGCTGCA TCATCTCCAG ACGTTTGTGT GGTGAAGGCC AGAAATTCAA CCGCTCAATC CTTGGCCTCA GAATCGGAAT CAAGTGTAAG AGCAGCGAGA AAGTGCTCCC AGCCACGAGG ACACCCTCTC GCATGTCCTC CAAAGTTAAA GCTTTGTGTT GGCAATATAA TATCGCCAAT CATCGACGGG CCGGTCGTAG CCCCACCAAA TGATTTGTGC GTCCTTGTAG ATGAACCTTC CAGCGTTCCC TTCTTTTTCG AAGATATGCC CTTCACTCCA GTTGCTCACA ATGTTGTGAC TTTCTGTCGG ACGGATTCCC GGGGGACCGC ATCATGTTCA AATGGAAGGA CAGCCGAAGA GTTATCGATC GATGTCAACG TGGACGAGTT TGGATCATCT CTCTTCGAAG AATGCTTTGC ATCCTATAGT ATTCACTTGT TGTGAGCATT CTCTGTGGAT ATTCAAGCAC CAGCACATCC TCTATTACTG ATACACCATG CAGTCTCTAC ACATTTTTGA CAGGTTTGCG CAACGCAACT AAATTTAGTC TACATATTGT TTCTTTAACG TTGCCAAGAT ACTTTATGGA CCGCTTGCTT TGGCCCAAAC GATTGGCTAG TGAGCCCGTT GCATTCAATG CCGGTTTAAA CCAGCAAGAA AGAAAGGCCA ATTATAAAAG TGAGAGTGTA TTCAACAATT TAGGAAATTG CAGGATTTGC CCTCAAGGCT CTGGTAAAAT TCAACTCGAC CGTATCTGGT AAGATCTGAT AACTTTGCTT GAGCCCATCG TAT
|
Protein sequence | MDNRPAPLMI SDDEVSFSGN AKVSKTANCA TKSYGGYIDY AKISFDQVSQ SIRSPVAMRR GPRGGVTVPF PAKLASLLEE VSLEGMEDIV SWGAHGRCFS IHKPSQFVSE LLPRFFRQSK LTSFQRQLNL YGFGRLTRGR DRGSYYHELF LRGRADLCKI MLQTRVIRDA AQRCLEPIAA VAPNFYAMPT TEGGDKDQLF SIQNTWKGLY SRQKTAATLI NEDLPEEFDL VKPLPFAELP PAPSTSAASS PDVCVVKARN STAQSLASES ESSVRAARKC SQPRGHPLAC PPKLKLCVGN IISPIIDGPV VAPPNDLCVL VDEPSSVPFF FEDMPFTPVA HNVVTFCRTD SRGTASCSNG RTAEELSIDV NVDEFGSSLF EECFASYSIH LFLYTFLTGL RNATKFSLHI VSLTLPRYFM DRLLWPKRLA SEPVAFNAGL NQQERKANYK RFALKALVKF NSTVSGKI
|
| |