Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38642 |
Symbol | |
ID | 7203348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 588073 |
End bp | 589836 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182565 |
Protein GI | 219124553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.429359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGTG GCAAGGTCGT CACCATTCAA GTACCGTACC TGTTCAAACA TTTGGTGGAC GACTTGCCGT CTGGAGCTGC TATTACGAGT GAAGGTAGTG CTACTGCTAC GGAAGCAGCC ACCGCGATGA TGGCGGCGGA CCCTTCCGCC TCCGTAGCTG CCGGAGTACC AGTATTCCTT CTACTCGGGT ACGGGCTTTC ACGCATCGCT TCATCGGGCT TGCAGGAATG GCGGAATGCC GTATTTGCGC ACGTTGCACA AGACGCCATT CGAAACGTTG GGCGGAGTGT CTTTGATCAT GTCCATCGAC TTGATATGCA GTTTCACCTT TCCAGGAACA CGGGACAGCT CAGTCGGGTA CTAGATCGAG GACAACGTTC GATTTCCTTT ACTCTCAACG CCATGGTTTT TCATATTGCT CCCACTATAC TTGAAGTCGG TATAGTCACA TCCTTGATGG GATACCAGTT TGGCTACGCG CATAGCAGTG TCGTAATGGC GACCGTGGTA GCCTACACCG GATTTACCCT TGGAGTCACC TCCTGGCGAA CGAAATTTCG TCGCGAAATG AATCGACTCG AAAACCAGGC CAGTGGACGC GTGGTTGATT CCTTGCTTAA TTACGAAACG GTCCAATACT TCAATAATGC ACAATACGAG GGTGAGCGTT ACGAAAGTAG TCTCAAGGGA TACCAAAAGG CAGCACTAGA GTCGCAAACT TCCTTAAGCT TGTTGAACTT TGGGCAAGCG GCTATTTTTT CAGCGGGTTT GACGTCGGTC ATGTGGTTGA CTTCACAGCA AATTGTGGAA GGCGCAGCCA CCGTGGGGGA TTTGGTGCTC GTGAATGGAT TGTTGTTTCA GCTTTCCGTC CCACTCTTCT TCATTGGGTC CGTCTATCGG GAGGTGCGAC AGTCACTGGT GGACATGGAA GCCATGTTTC AATTGCGAGA CACGATACCA GCCATTGTTG ACAAGCCAAA TGCGCTTTCT TATGATCCGA GTACCATGGG AACTTCGATT GCGCTCCACA ACGTACACTT TGCTTACCCA ACTGCAGCGA ATCAACGACC AATTTTGAAC GGCACCACGC TGGACATTGC TCAAGGCAAA ACAGTCGCCT TCGTTGGTTC TTCCGGTTGC GGCAAGAGCA CAATTCTCCG ATTACTCTAT CGTTTTTACC ATCCTGATCA AGGATTGATT TCCGTCGGTG GTCATGATAT TCAGGACATG ACGAAATATT CTCTGCAACG TGCCATAGCT GTTGTTCCGC AGGATACCGT TCTGTTTCAC GAGTCCATCG CGTACAACAT TCAATACGGA GATTTGAGCG CGTCCTGGGA TGAAGTGATT GAAGCTGCCA AAAAGGCCAA GATACACGAT ACGATTATGA GTTTTCCGGA TGGCTACGAA ACGGTAGTGG GAGAGCGTGG TCTCAAACTT TCGGGTGGTG AAAAGCAGCG CGTGGCCATT GCTCGGGCCA TTTTAAAGAA CGCCCCAATC TTATTGTGCG ACGAGCCAAC GTCGTCCCTC GATAGTGAAA CGGAAACAGA TATTATGAGT AACCTCAAAG ATGTTGGCAA AGGGCGGACC ACGTTGATCA TTGCGCATCG ACTGTCCACC ATTCAAGATT GCGATGAAAT TATTGTCATG AATCGCGGGA TGGTCGTGGA GCGCGGCACA CATGATGAGC TGATCGCCAT GGGTGGCCGA TACACGGAAT TAATCAAGAT GCAAGAGGCG GTAGTTGACG AAGACGAAAA TTAA
|
Protein sequence | MMGGKVVTIQ VPYLFKHLVD DLPSGAAITS EGSATATEAA TAMMAADPSA SVAAGVPVFL LLGYGLSRIA SSGLQEWRNA VFAHVAQDAI RNVGRSVFDH VHRLDMQFHL SRNTGQLSRV LDRGQRSISF TLNAMVFHIA PTILEVGIVT SLMGYQFGYA HSSVVMATVV AYTGFTLGVT SWRTKFRREM NRLENQASGR VVDSLLNYET VQYFNNAQYE GERYESSLKG YQKAALESQT SLSLLNFGQA AIFSAGLTSV MWLTSQQIVE GAATVGDLVL VNGLLFQLSV PLFFIGSVYR EVRQSLVDME AMFQLRDTIP AIVDKPNALS YDPSTMGTSI ALHNVHFAYP TAANQRPILN GTTLDIAQGK TVAFVGSSGC GKSTILRLLY RFYHPDQGLI SVGGHDIQDM TKYSLQRAIA VVPQDTVLFH ESIAYNIQYG DLSASWDEVI EAAKKAKIHD TIMSFPDGYE TVVGERGLKL SGGEKQRVAI ARAILKNAPI LLCDEPTSSL DSETETDIMS NLKDVGKGRT TLIIAHRLST IQDCDEIIVM NRGMVVERGT HDELIAMGGR YTELIKMQEA VVDEDEN
|
| |