Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49148 |
Symbol | |
ID | 7195615 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 65744 |
End bp | 70277 |
Gene Length | 4534 bp |
Protein Length | 1057 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183803 |
Protein GI | 219127148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.22092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCC TTGTAACGGG AGCACTTGTA TTTGGTATTG TGTTTCTGGT GGTACTAAAC GACAGTATCA CCAACAAAGC GCGGTCGGAA GCCTTACGAC GACAGTCGCC ATTTCCGCAG TGGAACGAAC AACTGCTGCA CGATCAAGCC AAGCCACAGA GAACGGTCCA CTTTCCTTTC CGCGACGGAT GCCCGCACGA CTGTCACGAA TATCGCCGGG ACGGTGCTCT CATGACCGAT GAGTATTACC GCGTAAACTT GGAATGGTAC TTGGGGGAAC CTGAGCAGTA CAATCCCCGT ACAGACTGGA AACTTCAAAA GAATTCTTCC TCTTACAGCA TGGGACTCAA TTTCCAGTGC CCCGAATCGT TTGTACGTAC CTTTCAAAAG GCATCCCGCG AATTGGGCGA GAAGGCAACA AAGCAACTGT CAACACCAGA CCGAACTATT TCGATCCATC CTCCGAAACG TATGCAGCTC GCCCTGACCT ACCTCTGTTG TTTGGATAGC GAGGAGGCCA GTGAAGCCAT TGCTGCTATG GATGAGTGGC TTACCCAACG GGATTTGTTT GACCTTGAAG TGCGATTTGA CGAAATCCAA AGCTGGCACG AGGCTCCGAA TGCAGTTGCG ACCGTTCTAG TTGCCGACCA AGCTTCGCAG CAAGCTCTCA TGCGACTCAA TCATGAATTG AGCCGTCATT TGCAGCGCTC CGATATTCCC GTTGCTGTAC CACGTGAAGA CCAAATGCCA TTTCATGTCG CGTTGGCTGG GTTTCGCCAC GGAGGCCGCG CTGAATCGTA CGACCCATCT CTTGACATTA CGTCCCAACT TCCCACTGTC TACAAGCTTG TACAGAAGGT TTCGGACAAG TACAAGGACG TTTGGACCAA AGCTGACCGT AAAGGGAAGG GTGAAGACAT GCTTCGTATC CAGCACAATC CTCGACACTC TCCTCGCCCT ATTTTACACA CTTTCCCATT GATCGATAAG GAGACTTGGT AGGGGCAGTC GTGCTTGTTC ACGAATTCTA TTTTCGGAGA ATGTATTTAC AGTATATTTC TTTTCCTTTG TCTTTGCATG TGGCTAGCAA ATATTTCAAC AAGAAATATG TACGAAATTT CACCAGGTTG CTGACGATAG CCATGCGTTC TGAGTGGACT GACTAAAGCT AGCAACAATA TCAAACAAAA AGCCAGTCCT CTCTGATTGT GATATTAAGG TTTTACAATC GAAGATGTTT TGGTTGGAGA GGTACAGTAA TATATGACGA CTCTGGAGGT CTGATAAGCT GATAGGTGAT ACTGTCTATA CTTATTACAG TCTATGTTTC GAAGGTTCGC ATGGCTTATT CCACATTTAT TAACGTAAGT TGTTCACTGT CAGATCTTCC ACACTTTCAC TGTCGCATAG GGACTTGTCT CAGTTTCTGT GACAGGATGA TGGTCAATAG GTCGACCGAG AGGGGAAACG CAACAATGAA CCGAGCAATG AAATGGTGCA CGATCAAGAT ACTGATTTAC CGTTGGCATC CAGCCTTAAG AGTATACTTC AAACCAGCAC AAATTCATTA TGGTGAAGAT GTATTCTAGT AGTGAAAGAA ATCACACTAT TTGTCGTCAT TTGTTCCGAA AAGTGTGTTT TCGAACTCAA TTAACAGTAA AGCGCTTGTT TGGTATCTTG ACGATTGCCA CAGCGTTTAT AATCATTCAG CAGAGACGAG AGAATAGTGC GTTTCCGCAA TGGAACGAGA AGCTCATTCA TGGTCGCGCC GAGCCACAGC GCACGCTGCA TTTTCCGTTT CGGACCGGAT GCCCGCACGA TTGCGGTGCT TATCGTCACG TGGAGTCTAG TAGATTCGGC TCCATTCTCA ACGACGAACA GGACTTTTAC AGAGTTACCG TCGAATGGTA CTTTGGGGAA CCAGAAAAAT ATATGCCCAC TGTTGATTGG AAAGCTCAAA GGAATTCTTC ATCGTACAAG CTGGGTATTA ATTTTCAATG CACGGAATCT TTCGTCCGCA CATTCCAAGC CGCATCTCGA GACCTCCACG ACCAAGCCAC AAAATTTTTG TCGACTCCTA GCCAACCTAC TACGATACAT CCGCAAAGAC GCATGCACGT TTCACTTTCC TACTTGTGCT GCCTGAATTC TGAAGAAGCG AAGCGTGCGA TAGCTGCAAT CGACGATTGG ACGGGAAGAG CCAAATTTGA TATGGAATTA CGGTTCGACG AAATTCAAGC CTGGCACGAA TCTCCGAACT CCGTGACGAC GATCGTGCTG GTTGATGAAG CTTCGCAGCA AACACTTATG CGAATTAATC ATGACTTAAA CCAGCATCTG CATCGCTTTG ACATTCCTAT TGCGATCCAA CGTGAGGACC AAATGCCCTT TCACGCTACC GTTGCTGGGT TCCGGTATGG TTCCAACGGC GAATCCTACG ACCCAACGCT CAACATTGAA CCACAACTAT CGACCATTTA CAATTTTGTC CATCAAGTTT CGCAGCAGTA CCTGATTGCT TGGAATGGCC CAAATGGAAA GGGTATCCGC ATTCAGCACA AGCCCAAGCG ATCCAGCCAA CCAAGTCTCC ACACTTTTCC GCTTGTAGAC AAGGAGAAAA AGTGAGTTGG TGTTTTCCGA ATGGGAGATG AAGCATGTTT TTTTGCGCCA AATTTCTGAT GTCAAAGGTA CCAGCAGTAC CGACAGTGCG AGCCCGCTCC CTACTGGATC TCGAACCCAT GAGGATGTTT GCTTTACCAT CGTAGATTCT TAGGGATCAG CGCGGATTGC CATGGAGGAC AAACCACTTT CTTCTCTTTC CGGCTGCAAT AAAGGAACAA CTTTCTGCTC TGATTCCGAT TCTATACTGT TGGGCAAACA CGATATAATG TTGTCCATTA TCCTAGAAAG CTTGCAAAGA AGAAGTGATC CTCATTTGAC CCCACCCATC CAACCCTTTC TGTAAAGAAG CTTCACGGCT AGCTTTAGCA TTGGAATTCG TAGAAGTGGA CAATCATGAA ATAATACTGA CGTTGAGTAG AACGAGGATA GGGTCTCATT CAGGAAATCG GGGGCAGTCC AATGGCAGAT TGTCCTCGCG GCTTTCGAAA CTTTCCTCAC TTGTTGACTT TACCGCATTT CAGCCCATAA TAATAATAAT CAAACGGATG ATTGTGCCTA ACAATTGTAC AGTTAATGTT GTTTCGTCTG CGGTCGTCCG TCGATTTGGG TTGCGCTTTT CCTTTCTTTC ACTGTCGGCT CCAATTCTTT GCGATGTTTG TTTACACACT CCGCAGAAAC CATGGGAGGA CGCCTTCATG CCGTTCGCTC GTGGCCAAAC GATGGAAGCA AACACTTCTG GTAGCACGAC CGACGACGTG ATTGATTTCC TCGTTCCGTT GCAGCAGAAC GGGCATCCTT TTGCTTCCTA TGCTGTACGA ACCGACACGG ATGTCTCGTT CGGTAGAACC GTGGAATTAT GTATGCGTCT GACCGAAGTT GCATCGTCTC TATCGCCAAA GAACGCGGCT GCTGCAGATA CTTCGAGCAA GAACGAAGCT TCGACGCTTG CTCGCCGCAC ACTTGGACGC GTAGACTTTT TACCAGCCTT ATGGGCGGCT TGTGGGTGCA ACACCAGGCC CAACCGCGAT ACGTCAGTGC CGCAGTCGTA TCACTACAGA ACATCGCAAG ACAAATGTGA TGCTTGTCAG CGAGTTCACC ATTGGGCTAT GCGTTTTTTG CCTCGTCGTT TCCTTTACTT TGCTCGCTAA ACGATTCGAG GGACTACGAG TTTGCCGTCA TGCCTGGCAA GAGTGAGCGT TCCTATCCCG GATTGGAATG TGTCACCATC AACGAAATCC CGTTGAAGAA TTTGCCAAGA TTTCCTAGAC GGGGGTTGGA GCTAGAAAGC AACAACGAAG ATACCGCGTT GGTAGGTTGG TGGAGGATCA GTGCAGGAGA CACGCTGTCG CTACTCCCGC CTCCTTCGCT ATTCCACAAG AATGGAATGA TTGAAGTCGA CGACAACGAG AGTGACAAGG CAGAAAACAA AACGCAGCTT GTATTATTGG AACCCCTTCG GCTTCAACTC ATGAGAGGAG CTGCCACCGC CGTTTATGAG AACAAAGCTG TAGGCGAACT ACGGATGCGT GATACACCGA TATCTAAAAC CAATGGACCT AGGCTCTCAT TCACGACGGA TGGAGCATCG TTAGACCACA CCGTTGTGGA AACGAAAGAG GTTCAGCGGG ACATTGAACA ATATCAACCA GAGAGACTTC AGAATGATGG CGACTCAAAC GAGAACATAC CACATGCGCT GGATCTCTCT TGGAGGAAAC CCTCTACAAA GAACGACGTT ACTCAGGAAA GTTTGCGTTC CGCCTCAAAA AATAGCAGGG CTCACGATGG AAACGACAAA AGGGATCCAA GTCGAATGGA GGAGGGTTTA CAAGTTTTTT CTTCCGTACC GAAAGGTCCA TCGCATCCGC CAGAGTATTT GCAACCGATC TCTCGAGCTA CACAGGGAAT TTGA
|
Protein sequence | MKRLVTGALV FGIVFLVVLN DSITNKARSE ALRRQSPFPQ WNEQLLHDQA KPQRTVHFPF RDGCPHDCHE YRRDGALMTD EYYRVNLEWY LGEPEQYNPR TDWKLQKNSS SYSMGLNFQC PESFVRTFQK ASRELGEKAT KQLSTPDRTI SIHPPKRMQL ALTYLCCLDS EEASEAIAAM DEWLTQRDLF DLEVRFDEIQ SWHEAPNAVA TVLVADQASQ QALMRLNHEL SRHLQRSDIP VAVPREDQMP FHVALAGFRH GGRAESYDPS LDITSQLPTV YKLVQKVSDK YKDVWTKADR KGKVYVSKVR MAYSTFINVD REGKRNNEPS NEMVHDQDTD LPLASSLKIK RLFGILTIAT AFIIIQQRRE NSAFPQWNEK LIHGRAEPQR TLHFPFRTGC PHDCGAYRHV ESSRFGSILN DEQDFYRVTV EWYFGEPEKY MPTVDWKAQR NSSSYKLGIN FQCTESFVRT FQAASRDLHD QATKFLSTPS QPTTIHPQRR MHVSLSYLCC LNSEEAKRAI AAIDDWTGRA KFDMELRFDE IQAWHESPNS VTTIVLVDEA SQQTLMRINH DLNQHLHRFD IPIAIQREDQ MPFHATVAGF RYGSNGESYD PTLNIEPQLS TIYNFVHQVS QHELVFSEWE MKHVFLRQIS DVKVNVVSSA VVRRFGLRFS FLSLSAPILC DVCLHTPQKP WEDAFMPFAR GQTMEANTSG STTDDVIDFL VPLQQNGHPF ASYAVRTDTD VSFGRTVELC MRLTEVASSL SPKNAAAADT SSKNEASTLA RRTLGRVDFL PALWAACGCN TRPNRDTDYE FAVMPGKSER SYPGLECVTI NEIPLKNLPR FPRRGLELES NNEDTALVGW WRISAGDTLS LLPPPSLFHK NGMIEVDDNE SDKAENKTQL VLLEPLRLQL MRGAATAVYE NKAVGELRMR DTPISKTNGP RLSFTTDGAS LDHTVVETKE VQRDIEQYQP ERLQNDGDSN ENIPHALDLS WRKPSTKNDV TQESLRSASK NSRAHDGNDK RDPSRMEEGL QVFSSVPKGP SHPPEYLQPI SRATQGI
|
| |