Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50031 |
Symbol | |
ID | 7198728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 198698 |
End bp | 200432 |
Gene Length | 1735 bp |
Protein Length | 435 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184838 |
Protein GI | 219129317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000158311 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTCAACAT CAGAGGAACA AGTCCAACCA TTCAACATCA GATTCATTCT CTATCGAACC CAACTCACAA TCCAAAAGTA AAAGTCTTGT GTACACATTG CAATTGCGTC GACACGAATA CTCGCCAAAC TATTCTCTCT CGTCGTCCGA ACAGCATCGA TCGATTTCTG AAATCTCCAT TTCTCTCCAT CGCAAATTCT TTCCATTTGT CCTCTACTCC GCCATCCATT GAAATCATGA TCGAGTTCCG TCGGAACGCA TACCATCGAA CCACCATACT CTGTCTGGTG TTGACCGTGA GTGCTTCCAT ATCTGCGTGG ACTCTTCCGC GACTCCCGAT GCGGACCCAT CGCTCGTCCA TCGGATCCTT ACCAGCGACC GTCGACGACA GTATCACTGC TTCTACGATC AGCGCCAGCG GTACCACCAT CACCAGCAAC AACGTAGTAA CGACGAAGCT TCTCCCCGAA TTCCAAGCCG TAACCGACGC AGCACAAGCC AAGCTTCTGG CTTCCATTCC GGAAGCTTAC CACGCAAAAA TTGTCCCACT GCTGGCGCAC TTTGTCAACG AGTACATGAC GGCCTCGCAA AACGCCTACC TCGCTACGGG CAATCCCTCC AGTGCTCCGG AGCAGGCCGC TTCGCGGATT CTCCAAGGCG TCGGATACGG CGTACGCCTC GGTTTGCTGG AACCCTTTCA GTTCTCCACT TCCCACGTCG CGCTCCGGGG GAAGAATCCG GAATTGGAAC AAGGCAACGA GATTGACTTT TACGAATTCG GCTGCGAATT CTTCCGGACC GTCATGGACT TGGAACGCTC CGTCGTGCTC GGGCAGGATC AAATTCCCAC GATACTGCAG CAGCTTGCCG ATGGTGAGAA CGTTGTCTTG TTGGCCAACC ACCAGTCTGA AGCCGACCCG CAAGTTGTCA GCTGTTGTCT AGAAGCCATC GGATACGGAG ACTTGGCCGC CGACGCCGTC TACGTTGCCG GACACAAGGT TACTACCGAT CCCCTCGCTA TTCCGTTTTC CATGGGACGC AACCTGATCT GCATCCACTC CAAGAAACAC ATCAACGCCG ACCCGGAAAC CAAGTCCGTC AAACAGCGTG AAAACCTCAA AGCCATGGGC GCCTTGCTCA ACAAGTTCAA AGAAGGTGGT GCGCTTTTGT GGGTCGCCCC TTCGGGAGGG CGTGACCGAC GGGATGTCAA CACCGGAAAA GTCCCACTTG CACCCTTTGA CTCTAAAACC ATCGACATGT TCCGACTGAT GGGTAACAAG TCCAAAAAAA CGACACACTT TTATACCCTC GCCATGGTCA GTTACGATCT CTGCCCCCCA CCGGACGTTA TTGAGCCCGG CACGGGTGAA CCCCGCAACG TACGCTTTGG ACCCGTTGGT ATTGCCCTCG GCGCCGAGTG CATTTCTGTG GGGGGACTGG AATCGCGGCA GGACTTTTGT CAACACGCCT TTGCCCAGTG CCAGGACGAT TACCTACGCT TGCAGCAAGC GATTGCCAAT CCGACAACGA CGGATCAGGC CTAGTGACCA CCAGCGTTGC GCATTTCACC TCCTTTACAC AATTGTGGTC GGCACAAGCA CAATACGTCA CTGCAGGAAA TACACCACGT ATCCATCCTT GTGGCATGCG AACAATGCCG TGTTCGTTCC CATTTATTTG GTCCGTATTC GTAGAACATA ATTATGTGTG ATCGAGGGAC ACTATTATAC AACCA
|
Protein sequence | MIEFRRNAYH RTTILCLVLT VSASISAWTL PRLPMRTHRS SIGSLPATVD DSITASTISA SGTTITSNNV VTTKLLPEFQ AVTDAAQAKL LASIPEAYHA KIVPLLAHFV NEYMTASQNA YLATGNPSSA PEQAASRILQ GVGYGVRLGL LEPFQFSTSH VALRGKNPEL EQGNEIDFYE FGCEFFRTVM DLERSVVLGQ DQIPTILQQL ADGENVVLLA NHQSEADPQV VSCCLEAIGY GDLAADAVYV AGHKVTTDPL AIPFSMGRNL ICIHSKKHIN ADPETKSVKQ RENLKAMGAL LNKFKEGGAL LWVAPSGGRD RRDVNTGKVP LAPFDSKTID MFRLMGNKSK KTTHFYTLAM VSYDLCPPPD VIEPGTGEPR NVRFGPVGIA LGAECISVGG LESRQDFCQH AFAQCQDDYL RLQQAIANPT TTDQA
|
| |