Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29821 |
Symbol | |
ID | 7195028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 123260 |
End bp | 124826 |
Gene Length | 1567 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183425 |
Protein GI | 219126356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.979957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTCCTTA TTCGAACCAC CTCATGCATC CTAGTCCACT TTTTTGCTCT GGTATAAATG CGTAAAGCCA TAATCTCAAA GAATGTATTG TGCGTGCTGT TTCTGTTGTG CCAAAAATTG ACTGCTTCTT TCTCCTCCTG GGACGATCTC GATAAGCTTC CACTACCAGC ATGGTACAGC GAAGCTAAAT TCGGCATTTT TGTGCATTGG GGAGTTTTTT CTGTTCCTGC GTACAAAAAT GAATGGTTTC AAAACAATTG GCAACACTTC AAGTACGAAG ACTACGTTGC ATTCGTCAAC AAGACAGAGA GGCAAAATTT CGCCTACCAG GATTACGCCC ATCGCTTTCT AGCCGAGTTG TATCGACCCG ATAATTGGGC GGACACTTTT GCGGCTGCCG GTGCGCAGTA CGTTGTACTC ACAAGCAAGC ACCACGACGG CTTTTGCATG TGGAATTCGA CCAGCATTCC GACGACTTGG AACTGGAATG CTGTTGATAT AGGTCCTCGA CGAGACCTTT TGGGCGACTT GTCGAGTGCA GTAAAAAGAA AGAAAAGCCC CCAATCTGGA AAAAAACTGC GTTTCGGAAT TTATCATTCC CTACTCGAAT TTTTCAATCC GTTGTACACG TACGACAAAG GAAACAATTG GACGACGCAA AATATGGTGG ACCTGAAGGC TTTGCCGGAG CTGTACGATC TCGTAGATAG ATACGAGCCT CACTTGATGT GGTCGGACGG GGGCTGGGAA GCCAGTAGTA CATATTGGAA GTCAGAAGAG TTTATTTCAT GGTATGCTCT TAACAGCACC GTTGGAAAAG AAGCTGTCTG GAACGATCGT TGGGGCACCG ATACTCAATG CAGGCATGGA AGCTATCTGA CTTGTCACGA TCGTTACCAA CCCGACGGGC TCGTCGACAA GAAGTGGGAA AAATGTATGA CGATCGATTC TACATCTTGG GGTTACAGTC GTGTTTCCGA TGTTCTTCAA TACTTGAACA CAACTCAACT CGTACATACA CTTATCGAGG TCGTGGCGCT GAACGGGAAT CTCTTACTAA ACGTCGGGCC CTCCGCTGAT GGAACAATCA ATCCAGTATT TGTAGACCGG TTAATGGGGA TAGGAAGCTG GCTGTCAGTG AATGGAGAAG CAATCTATTC GTCGAACCCG TGGAAAATTT GTCAAAATGA GACCAAGTTC TCTGTATTCT ACACGCGAAG GACAGATGTT CTGTACGCTC ACATTACGGA ATGGCCAGAG AATAGTTTGT TGCGACTCGA CTGTCCTGTT CCAACAAAAT CAACCCGGGT GAACATGCTT GGCTTGGACG ACAGAAGCGA GATTTCCTTT TCTTTGAACG GCAATAGGTC AACTGGAGAG TCCATGGTTG TACAGCTGCC ACTTTTGACA CCTGATACGA TTCCTTGTCA GCATTCGTGG GTCTTGGCAA TCTCAAACCT AGCAAATTTA TACAAATGAG AAACCTTGCC GTTTAGCGCC CATGGTCCGA ATTAATTTTG TTGCTTACAA AACAATTTGA CATAACAAAC AGCACCATTT GTTTAAC
|
Protein sequence | MRKAIISKNV LCVLFLLCQK LTASFSSWDD LDKLPLPAWY SEAKFGIFVH WGVFSVPAYK NEWFQNNWQH FKYEDYVAFV NKTERQNFAY QDYAHRFLAE LYRPDNWADT FAAAGAQYVV LTSKHHDGFC MWNSTSIPTT WNWNAVDIGP RRDLLGDLSS AVKRKKSPQS GKKLRFGIYH SLLEFFNPLY TYDKGNNWTT QNMVDLKALP ELYDLVDRYE PHLMWSDGGW EASSTYWKSE EFISWYALNS TVGKEAVWND RWGTDTQCRH GSYLTCHDRY QPDGLVDKKW EKCMTIDSTS WGYSRVSDVL QYLNTTQLVH TLIEVVALNG NLLLNVGPSA DGTINPVFVD RLMGIGSWLS VNGEAIYSSN PWKICQNETK FSVFYTRRTD VLYAHITEWP ENSLLRLDCP VPTKSTRVNM LGLDDRSEIS FSLNGNRSTG ESMVVQLPLL TPDTIPCQHS WVLAISNLAN LYK
|
| |