Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43539 |
Symbol | |
ID | 7197216 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 758812 |
End bp | 760024 |
Gene Length | 1213 bp |
Protein Length | 354 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177679 |
Protein GI | 219111855 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGTAGCAC AAAGCGACTC GGCGCAGCGT CATCTGCTTC TGTTGTATAC ATTCCTACGA AGCACCATGA TGCCCTCTAA TACGATAGAC GCTTCGTCCT CGTCCGCTCC GCCGCCTCCC AACCCGGTGC TAACCGCCTA CGAAGACTTT CGCAGGAATA CACCAGTCGT CACACGTTCG ATCCTCACCG TCCTAGTCTT GTCTTATCTT CTCTCGTGGG TCATTGATCC GCACTTTGCC ATGGCAAACA TTCCACAGTT TTCCGTCTTC GGATTCGAAA TTTACCGGAT TTTGACGAGT CCGTTGGTTA ATACTCGCTT CTTCTCGTTG CTGTTCGCAT TTCTGTCCTT TACCTCGCAA GGAAAGCGAA TGGAAAATTC CATGGGATCG ACAGCGTTTG GAGTGTTGTG TTTGACCATG GGAGTACTGG CTAATGTACT ATTTCTGGTG ACCAATGTCC TGCTCTACTA TGTGTCGGGT GGAGAACAGG CTTTTCTGTT TACGGCAGCC GCTGGGATAT GGTTGATTCT CTTCGGGATC ATCGCAATGG AATGTGTACA GGCACCTCGT GGAACCCAAC GGCCGCTTTT CTTTTGTAAA ATTCCGACCA TCTACTATCC GTTGGCCTTG TTCGCCGTCT TTGGCTTGTT CGGGCAGTCG TTCTCCGTGG CAAATCTCAT TTCCATGGGG ATAGGATACG CGTACGGATT CGGTTACTTG GATGGCCTTA AACCCAGCGC GCCGCGGATT TCCCAATGGG AGGAGACGAT ACTAGCAGAC TGGACCCGCA ACGAAGGATG GGTGGCTGGC CAGGCAATCC TGGGTAGTGA TGCATGGAGC GAGGCCAACG GGGGCTCTGC CTCGGAGGGA ATGGTACGTT AAAAACTGCG CAAAGCGACT TCGATTCATA AAGTTGGACT CACGTCAACA AAAAATCTCT ATTCTCTTTC GGTAGAGCTT ACCTACAATG CAACGAGGTT CAGCACAACG CACCAGTGCG GTTTCCGAAG CTACGGGATC TGCTCGTGCT GGATCGGTTT TTCGCAGCGG AATTCGAGAG GACGCGAACA ACGCTGCTGA CCACGCATCG TTGCTAGCCA ATGCCGGGTC GGGGCATACC CTGGGGACAA CCTCACGGAG GCCCACGGAT CCCCGAACGG CACGATTAGA AGCGCTGGAA CGCCGGGGGA TCGTGGGCGA CAATGCCGTC TGA
|
Protein sequence | MMPSNTIDAS SSSAPPPPNP VLTAYEDFRR NTPVVTRSIL TVLVLSYLLS WVIDPHFAMA NIPQFSVFGF EIYRILTSPL VNTRFFSLLF AFLSFTSQGK RMENSMGSTA FGVLCLTMGV LANVLFLVTN VLLYYVSGGE QAFLFTAAAG IWLILFGIIA MECVQAPRGT QRPLFFCKIP TIYYPLALFA VFGLFGQSFS VANLISMGIG YAYGFGYLDG LKPSAPRISQ WEETILADWT RNEGWVAGQA ILGSDAWSEA NGGSASEGMS LPTMQRGSAQ RTSAVSEATG SARAGSVFRS GIREDANNAA DHASLLANAG SGHTLGTTSR RPTDPRTARL EALERRGIVG DNAV
|
| |