Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44480 |
Symbol | |
ID | 7197711 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 695146 |
End bp | 696742 |
Gene Length | 1597 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178283 |
Protein GI | 219114975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000641793 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGGCGACA CCATCGTACA ACGTGAGCTT GGATCTTCTT TCGTCCGCTT CTCACCAGGA GATCCATTCC CACCAGCAAT CCGATGCCCC CTATGTAGCC ATGGAAATGG CAGGACCTCG ACCAAAGAGC AACCAAAATG AAAAAATTTT CAGTGATGTT CCGCCAGTGA TCGACAAAGA TGAGACAGAC GATGGCGACT CCGTTGCCGT CTCGATTGTC AGTTCCGTCA TGACAGCAGA AGGAATACAC GACTGGACAG GATTCTGTTG CGTATGCCTG GTTGTTCTTA TTGGTGACAT GAGCCGAGGT GTTATGTTTC CTTCAATGTG GCCCTTGGTG GAAAGTCTAG GCGGAAGCCA AATCACGTTG GGCTATGCAG TCGCCGCCTT CTCGTTCGGT CGCGTCCTTG TGAATCCAGT TTTCGGATCC TGGTCACATC AGATTGGCTA TACAAAAACG TTGCTGATGA GCTGTTCGAT TCTGTTCGTA GGGACGCTCT GCTACGCTCA AACACAAAAT GTAGGTCGAC CGGAATTTCT GATCGTTTCG CAGACGATTC TAGGAGTCGG AAGTGGTACG CTTGGTGTAA CACGCGCATT TGTGGCCGAT GTGACGGCAA AAAGGCAGAG GACCACATAT ATGGCATGGA TTACCGCTGT GCAGTATGCT GGCTTTACGG TCACTCCCTT TTTTGGGGCC TTGTTCAACT TTGCCTTTCA AAACAACGAT TACCAGTACG GCATTTTCCG TTTGAACATG TTTACAGCCC CTGCGTACTT CATGGCCTGT TTTGTGGTAG GGACATTTTT TGTGCTGATA TTCTTTTTTC AAGATCGGCA CCGCATATGC ATAAGAAAAG AAGCGAAAAA GTCGAAGAAG CGAGCTGCGA TGGAAGATAT TGCCAACGCT GTTACATGGA TCGGTATATC TGTGTACGAC TGTTGTATTC TGGGTTGTAT GTTACTGAAT GTGTCGACAA AAGGATCGAT ATCTTCTTTC GAAACTTTAG GAATTTCGAT TGCGCAGTCA CACTTTGACA TGGTGGCATC GCGAGCTGGA ATGATCGTCG CTACTTGCGG AACCATGGGT GTGTGTGCTT TACTTTCAAT GGGGACGTTG TCGAACCACT TCAACGATGT TCAGTTGATT TCCGGAGGTA TGACAATAAT GGCAATAGGT ATTGCTTCAT TGACTACCAT TGAAGAAGGG ATTCGAAACC CTTCTTGGAG GTACTTCGTT GCTATGTTTT TGATTTATTC AATTGGTTTT CCTATTGGCC ATACAGCAGT TATTGGACTC TTTTCTAAAG GTACGTGCTG CATTTTTCCT CGCGAACGCT GGAATTGCAG CCAAGAACTT ATCACCACTT TTGTTTGGAT CATCTTTGCA GTTGTTGGAC GTAGGCCGCA AGGCGCATTG CTTGGATGGT TCGCTTCGGC CGGATCTTTA GCGCGTATGT TCTTTCCAAT TATGTCGGGG TATGTTGCCG ATTACAAAGA TGTGGAAACT CTGTTTTGCA TACTGACGGG GGTATTATTT GTCTCGATTG CTTTTGTGTA CTGGTCCCGG GATATTCTCT TATTTTTGTC GTCCTAG
|
Protein sequence | MATPSYNVSL DLLSSASHQE IHSHQQSDAP YVAMEMAGPR PKSNQNEKIF SDVPPVIDKD ETDDGDSVAV SIVSSVMTAE GIHDWTGFCC VCLVVLIGDM SRGVMFPSMW PLVESLGGSQ ITLGYAVAAF SFGRVLVNPV FGSWSHQIGY TKTLLMSCSI LFVGTLCYAQ TQNVGRPEFL IVSQTILGVG SGTLGVTRAF VADVTAKRQR TTYMAWITAV QYAGFTVTPF FGALFNFAFQ NNDYQYGIFR LNMFTAPAYF MACFVVGTFF VLIFFFQDRH RICIRKEAKK SKKRAAMEDI ANAVTWIGIS VYDCCILGCM LLNVSTKGSI SSFETLGISI AQSHFDMVAS RAGMIVATCG TMGVCALLSM GTLSNHFNDV QLISGGIASL TTIEEGIRNP SWRYFVAMFL IYSIGFPIGH TAVIGLFSKV VGRRPQGALL GWFASAGSLA RMFFPIMSGY VADYKDVETL FCILTGVLFV SIAFVYWSRD ILLFLSS
|
| |