Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50345 |
Symbol | |
ID | 7198996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 373373 |
End bp | 375034 |
Gene Length | 1662 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185185 |
Protein GI | 219130045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0816818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATATTTTC CCATGTCTTC TGGTTCTTCT CTTTCCCCGC TGTACGCGCC ACGCTTTGCG GAAGGTCTCC AAATGCCGGC GGAAGACTGT CGCGACTATC TACGAACAGG CCGATGCAAG TATGGGCCGT CATGCAAGTA TAATCACCCA GCTAATGTAC AAAGCGGAGG GGGTATGCGG GCGCCTATTG ACCCTTCGGA ACCACTCTTT CCCGTGCGTC TCAACGAGCC ACTCTGCCAA TACTACATGA AGCACGGCAG CTGTAAATTT GGACAAGCAT GCAAGTTCAA CCACCCTCCT CAGCTTAGCC ACAGTTCACA AGTTGCAGGG GACACCCCTG TTACCGGCAA TGGGCGTAGT ACCGACGTAC CTGTCGTCTT CAGTCAATGC GATGGTCCAA TGATGCTACA ATTTCTTCCA CAACGCCCCG ATGAACCGGA CTGCATCTAC TTTTTGAAGA ATGGACGATG CAAGTACGGA GCAACTTGCC GCTATCATCA TCCGGTGAAC TATCACAAGC ACCGCGCGGA GGAATCTCGT CGTCAACATC GAGCGCAACT TCAGGAGCAG TACGCGCCTC AAAAGGTACA ATACATTGCC CAAACAGTGC CCAACGGAAA CTTCAAGGGT CAGCACGTGA TGTCTGATAA CCCTTTGACT TTTATGAGCT ACGATGTGCC ATCCGGAACT CCAGGGTTCC AGCCAATGTC TCTTGTCATG GGAGCCGATG GTAGTACTTC GTACGCAACT CACATCGGCC CAAATATAGT TACCGAGCAA GGATCTTCGG CTTCATCTAT TGCTTCTTCT TACGAAACGG CGCCAACAGG CTTTGATCAA TTCCAAGGCG ATCCATCCAT GTGGGCTCGT GCTCGACGCA ACGGCAGTGG AAATAGCCTG ACAGCATACA CGATCGACTC ATCTAATCGA GGAGCGCGTC TTGCCATGAC TCACAGTCCA AGCGAGGGTA GTATGGCTTC GCGCAGACAT CGTGCGAGCT CTCACGGAAG CGCGAGCGAG AGCTCCTATC ACGATGTGAA CCAATCTGGA TTGAGTCGAA GCGGTTCGGT TGGTTCGTGG CGCAATGATC GAGTTCCTTC CTCTACCTAT GATCGCCGGC TTCCGACACA ATACATTTCA AGAATAGACG GAGTTGTGAG CGATCAACAA CCGCGTGGAC GACCCCCTTC TATGTCCATG GCACCAGGAC ATCGACCCAG CCCCAGAAGC CGCAAACCAA GGGCGCACGG AGAAAATGAT GAGGGCTTTA CCATGATGAC CTCCGCTCTG TTGAATATGC TTGACACACC GGAAGAGGCA TCGACTGAAA GCTTCAGCGA CGAAGACAAC AATCGCTATC GCTTACAAGA GCCGTGTGAA GAGCAACGCC CCCTATACGG CGACCCGTTG GACGTCGAAT CATCCATGTT TGAACGTTTG TCTTTGAATG GTGTAAAGCA CAATTACCAA ATTCGATCCG TATCAGATAC AAACACGAGT GATTCATGGT CTCCAACGTG GCAGGGTTCC TTAAGAGGGC CAGCTTCCCC TCCTGCATCC TCACTCGATG GCAATGCTCA AGCTTTGTCG GCTATCCCAC CACGCCATTC GCAAGGTCAT AACACCCCAC CATCCTCTGA TATCGGTCTC TTTATACCCT AG
|
Protein sequence | MSSGSSLSPL YAPRFAEGLQ MPAEDCRDYL RTGRCKYGPS CKYNHPANVQ SGGGMRAPID PSEPLFPVRL NEPLCQYYMK HGSCKFGQAC KFNHPPQLSH SSQVAGDTPV TGNGRSTDVP VVFSQCDGPM MLQFLPQRPD EPDCIYFLKN GRCKYGATCR YHHPVNYHKH RAEESRRQHR AQLQEQYAPQ KVQYIAQTVP NGNFKGQHVM SDNPLTFMSY DVPSGTPGFQ PMSLVMGADG STSYATHIGP NIVTEQGSSA SSIASSYETA PTGFDQFQGD PSMWARARRN GSGNSLTAYT IDSSNRGARL AMTHSPSEGS MASRRHRASS HGSASESSYH DVNQSGLSRS GSVGSWRNDR VPSSTYDRRL PTQYISRIDG VVSDQQPRGR PPSMSMAPGH RPSPRSRKPR AHGENDEGFT MMTSALLNML DTPEEASTES FSDEDNNRYR LQEPCEEQRP LYGDPLDVES SMFERLSLNG VKHNYQIRSV SDTNTSDSWS PTWQGSLRGP ASPPASSLDG NAQALSAIPP RHSQGHNTPP SSDIGLFIP
|
| |