Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48060 |
Symbol | |
ID | 7203418 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 79347 |
End bp | 80996 |
Gene Length | 1650 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182470 |
Protein GI | 219124354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.387117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTTC CGTTGGCCTT CCATCATCAC GTTGCAGTAG TAGGTGGCGG AATCACCGGA GCATGCGCGG CATCCACTTT TGCCAGTCGG AATATTAAAG TCGACGTGTT CGACCAAGGT CGCAGTGGCC CAGGCGGTCG CGCCAGTCAC CGTGTCACGG AAGAAGCAAA ATTGGAATGG GACCACGGCT GTCAGTTCTT CCGGGCCGAT ACAGAACGAT TCCGACAAAA AGTCGAGGGA TGGATTGAAG GGGGCATGTG CCAAGAATGG TTCGGAAAAT TCGGGCAAGA TTCCAGCTCG GCAGACTTTT TTGGTCTCCC CGGCAAGCCG CCGTTTTTTG TTGGTATGAA GGGTCTAATC GATTCTTTAC TCAATGAAGA AGGAATTCAT GTGTATAGCG ATCAGCGTGT AAGTAGCTTA GAAAGAGAAG GAAAGGTATG GAAACTGCTT GGAGTGCACG GTGAAGCTGC GTTTCATGAC ACTTCCGTGG AAGCGAAACC ACAGCCAATT GGCTCAACAA ACGGCTACGA TGCTGTAGTT TTGACCGATG TCAGCTCATC TTTTGATTCT TGGCACCGTG CGTCAGCCGG TGTACCCGCT GCCTTTGCGG CTCGAGTGAG AGAACGTGCA GGATCTCGCG TTCCCCTTTT TTCCGCCATG GTGGCTTTCG AACAGCCCTC TCAAATCCCT TTCGATGCAA CTGCTTTTGA CCAGAACGAG TCGATCTGGT TTGCAGCTAA GACAAACAGC AAGCCAGGAA TGGGAGCACT TGAGCAGGAA TGCTGGACGA TTATTTCTAC TCCCGAGTAT GCCATGCGCC AAATTTCGGA GATTCAAATG CAAGACAAAG AAACAGGAGC ATTTCAACCG CAAACACGGG AATATCTGAC GTCTGTACCT GGTCCTGATT TAGAAAGATC TTTTCGTAGC TCACTTAAAT CACAATGGAA GGTTGATCTA CCCAAGGTTA GCTTTTTGAG TGCTCAACGA TGGGGTTCGG CACTGCCGGC GCATCGTCTT GTGAACACTT CTTCCGACAC AAGACAAATC ATTGCCGGAG TAGCTTACGA CTCAAAACGA GGCTGCCTCG CGCCAACTGA GGCAGAAGCT GGTACACAAT CTTTCTTGGC TGATGATGGC TTGATGCTCT TTCAAGCTGG CGATATGGTG TCTTCCTACT CACCAGGATT TGAAGGGGCA GCAATCTCTG GTATGGATGC TGCAGAACAT ATATGTAAGC TTTTATCTTA GCTTCTGCGG CTGAGCTAAA ATAGGAAAAG TTTTGCTCCA GGGTCTTGTT CTCCTGAAAT AGTTGGGGTC TATATTTCTT CGATCATCGC TGGCACAACT CCTGCGAATC TCTCCTTCGA GGAAGAATAT TTCGCAAACA AATTTTTTTT GAGAGCACTA TTTCACTCGA ATCGCCCCGC AATGCTACTT AATCATAGTC GATCCATGAA GGACTCTGCA TCGCAAATGA AGATATCTTT TCGATCATGC TGGGGAATAC ATGACAAGTG TATTGACACA CACCTGAACA ATGATAGCTG TTCCGGTGCT CAATCTTTGT CCGTCGCTGG CCGTATCCTC TCCTCTACGG GCCATTGCTT GCATGGAGCT GAAGGCATCA AAAACATGCA GAGAAACTAA
|
Protein sequence | MTVPLAFHHH VAVVGGGITG ACAASTFASR NIKVDVFDQG RSGPGGRASH RVTEEAKLEW DHGCQFFRAD TERFRQKVEG WIEGGMCQEW FGKFGQDSSS ADFFGLPGKP PFFVGMKGLI DSLLNEEGIH VYSDQRVSSL EREGKVWKLL GVHGEAAFHD TSVEAKPQPI GSTNGYDAVV LTDVSSSFDS WHRASAGVPA AFAARVRERA GSRVPLFSAM VAFEQPSQIP FDATAFDQNE SIWFAAKTNS KPGMGALEQE CWTIISTPEY AMRQISEIQM QDKETGAFQP QTREYLTSVP GPDLERSFRS SLKSQWKVDL PKVSFLSAQR WGSALPAHRL VNTSSDTRQI IAGVAYDSKR GCLAPTEAEA GTQSFLADDG LMLFQAGDMV SSYSPGFEGA AISGMDAAEH IFGVYISSII AGTTPANLSF EEEYFANKFF LRALFHSNRP AMLLNHSRSM KDSASQMKIS FRSCWGIHDK CIDTHLNNDS CSGAQSLSVA GRILSSTGHC LHGAEGIKNM QRN
|
| |