Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20708 |
Symbol | |
ID | 7201382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 894693 |
End bp | 896203 |
Gene Length | 1511 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180535 |
Protein GI | 219119555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.558415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTATTGTTC GCTCGGCACC AAGTCCATTC CCGTCGCTTC CGTCGTACGC AACCCCCCAT TTTGGCACAC ACAGTCCGAC ATCCTCATGA GGTCCGGTTT GGCTGTCGTC TCCATTTCTG GGCTGGTGAT GCTCACCTCC ACTGCGGTTG GGGGGTTAGC GACGAGTCGA CCCAACTTGC CGCCGCTTTC GGGTACCACT TCCATTGCAC CGCACCTGCC TTTGGCACGT AGTGCCATGG AGTACTTGGA CGCCAGTCCC GATCCTTTCC ACGCCGTACA AACCTCCATC GAACGATTGG AAGCGGCGGG CTTTACCTCA CTGTCCGAAA CCTCCACCGT CGACACCGGA AAAATCGTTC CGGGCGGCAA ATACTACTTT ACCCGCAACA AATCCACACT CGTCGCGTTT GCTGTCGGAG ACCGGTACCA ACCCGGCAAC GGCTTTAAAA TTATTGGCGG ACACACGGAC TCGCCCAATC TGCGCGTCAA ACCGCGCTCG CTACGGACCG CCGCCGGCTG CGTGCAGGTC GGGGTGGAAT GCTACGGGGG CGGACTCTGG CATACCTGGT TCGATCGGGA TTTGGGTGTG TCCGGGCGTG TGTTGGTCCG GAGCCGCGAT GATGCGCGCA AGGTCACGCA GAGACTCGTT CGTATGGATC GAGCCCTCCT GCGGGTATCC AACCTGGCGA TTCATCTGCA GTCTGCCAAA GAGAGGGAAG CCTTTAAAGT GAACAAGGAA GACGATCTTT CACCAATTCT CGCGATGGAA GCGGAAAAGT CCCTCAACGG CGGGGAAAAC AAGACCAAGG ATGGGTGGAC CGAGTACCAA GAACCCGCCC TGCTCGAAGT ACTCGCACAC GAACTCAACG TCCGAGTCGA AGATATCGCC GACTTTGAGC TCAGTCTGTT TGATGTCCAA AAAGCAAGTT TGGGCGGAGT CTTTTCGGAG TTTATCCACT CGTCGCGTTT GGACAATCTC GCCAGTTGCT TCCTCGCGGT ACAAGCTTTG GTGGATCACG TGGAGGCCGG CAGCACTGCT AAGGACTCGG ACATTTCCAT GATTGTCTTG TACGATCACG AAGAGGTCGG TAGCAACTCC GCCGTGGGAG CCGCATCGCC AATAATGGCG GAAGCCGTCC AACGCATTGC GGCAGCCTTG GGCAACCAGG AAAGTACGGA AACTTACGCA GCCTGTATTC GCAACAGCTT TTGTTGCAGT GTCGATCAGG CCCACGCTTT GCATCCGAAC TATGCAAGCA AGCACGAAAA GAATCACCAG CCAAAGATGA ACCAGGGCAT GGTGATTAAG CGCAACGCCA ATCAAAGGTA CGCCACCAAC GCCGTGACGG GCTTCTTGAT GCGCGAAATT TCCCGCCGCG CCGGGCTGCC ACCCATTCAG GAGTTTATTG TACGACAAGA TTGTGGCTGT GGTTCGACAA TCGGACCGCT GATCAGTACA GCTACGGGTA TTCGGACAAT TGATATGGGC TGCCCCCAAC TTTCCATGCA T
|
Protein sequence | MRSGLAVVSI SGLVMLTSTA VGGLATSRPN LPPLSGTTSI APHLPLARSA MEYLDASPDP FHAVQTSIER LEAAGFTSLS ETSTVDTGKI VPGGKYYFTR NKSTLVAFAV GDRYQPGNGF KIIGGHTDSP NLRVKPRSLR TAAGCVQVGV ECYGGGLWHT WFDRDLGVSG RVLVRSRDDA RKVTQRLVRM DRALLRVSNL AIHLQSAKER EAFKVNKEDD LSPILAMEAE KSLNGGENKT KDGWTEYQEP ALLEVLAHEL NVRVEDIADF ELSLFDVQKA SLGGVFSEFI HSSRLDNLAS CFLAVQALVD HVEAGSTAKD SDISMIVLYD HEEVGSNSAV GAASPIMAEA VQRIAAALGN QESTETYAAC IRNSFCCSVD QAHALHPNYA SKHEKNHQPK MNQGMVIKRN ANQRYATNAV TGFLMREISR RAGLPPIQEF IVRQDCGCGS TIGPLISTAT GIRTIDMGCP QLSMH
|
| |