Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39474 |
Symbol | |
ID | 7194969 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 605425 |
End bp | 610937 |
Gene Length | 5513 bp |
Protein Length | 1250 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183516 |
Protein GI | 219126546 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.713077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCTT TGATCATTAA TCTCGTCGGT TTGTCGGATA CCGAAGCAGC TTTGTTTGCT GAAGCCGGTA TCACCGATCG GACTACCATC GCGTCGTTGG ACCTTCCCGC GTTCACGCAA GTGTTGGGAG ACCAGGTTGG GACGCTGGCG AAGCGATACA AACTGAAGCG TGTGGCTGAG TACCTCAGCA GCGGTAAGGA AATCGGTCCC GATACGACGA TGGGAGAGAT CAACCTCGCG TTGTCATCTA ACCATCAGAA TGTTGGAGCC AATGCTCCTA ACCCGGCTGC TGGACTTGAC CCTCGCCACG GAACGATGAA AGCGAGCATT AACACGATCT CGAAGTTTTC AGGTGATCTG GAAGATTTCG AAGACTGGTC GACGAAAACC GCAGCGGCTC TCGGAATCAC CGTTTACCGT AAACTTCTCG ACGGTCCTCC GGAAGTCGGA AATGATTTCG ATGCCGCCAG AAACAATGAG CTCTATCACA TGTTTGTGAT TGCTCTTGTG GATGGTGCTG CAATGCATAT CATGGAAGAC GTGAAGGACC AGAATGGATA CGCCGCGTGG ATGGCTATCA AGGAATGGTA TGGTATGTCG GATACCGGTC GGACGATCAT CGACAAGTAT CGTAGCAAGC TGGACACGCT ACTGCTTGAC GATGCGACCC CAGCCGGGAC TTTTGTCAAC CATTTCAAGA GATTCAGCCA GAAGCTCGAA GAAAACGGTG AGGGCTACAC GGCGGATACG AAACGACGTC AGTTTCTCGA CAAGATCATT GACGAAGACT ACGATGTGGT CAAACAGCAG TTGGAAGGCG ATTCGACTGC GGATTTTAAT GAATGTGTTG CGCGCATTCG TATGCGCGAG CAAGTCCTCA TGAAGGACTC CACTTTGTCG GCCAAGAAAG CTAGACGGTT CAAGTCGAGC GAGGGCGGCA AGTCGAACAG TGGAGGTCCG TCAAGTGGGA AAATTCCTTC GATTCCGAAC TCTATACTTA ACCTGGTCAA GCCGGCAAGT GCTCGCAAGA ATCTAATCAA GTGGAGGGGA GTTTGGAACT CCGAAGGGCG CATTCTTCGA TCGGACGAAC TGGCAAGCAC TGAGTATGAC GGGAAAGGTA AAACCCCGTC GAAAAGAGAG CACGACAATG GCAGTTCCGA CGAATCCGTC AAGACCCGCA ACACCCAAGG CAAGGGGGGT TCCTCGTCCA AGAAGGGCAA GAGAGGCAAG GGGAGGCGAA TCACCGGTGT TGTCCGTCGA ACGGAAACGA AGACATCCGG CACGCCTGAC TCCTCCATCC GGATCAGTAT GAAGGATCCG GACGACCACG TCGAAGGTGA TGAGTATGAC GATGTCGAAA TTGAGTCCGG TCAAGAGGAT GATTCTACTG CACCTGTGAA GAAAGAACGG GCGAAAAAGC AGAAGAATCC GTCGAAGCGG AAGCACAAGC GTGCCAAGTC TCGTCGAAGC CTCATCTCCC GCCGCGGACG CGTAGGCAAC GAGAAACCAA GAGCTATCCT AGACCCAGGG ACTGAATGCG ACATTGTTGG CGGGGACGTA TGGACAGTTC TGGAAAAGGT GATCGGTGTA GAAGCCCAGC TAGGCGGAGC TTTAGCAGGG ATGGGCAAAT GCAGCCTGCC ACTGGTTAAC GCGGTGGCTG CGTACGATCA TCCTAAAGAA GGAACGATCC TAATTGGTGC CGGTAACGTG GGATATGATG AAAGAAGCAC CCAGACGGAG TCGTTGTTCA ACACACACGA GTTACGAAAA CACGGTGTCA TTGTTTCGGA CACGACCATC CGGGATGGAG GGCTTCAAAG CATTGAAGTC GACGGAATTT CCATTGCATT GGACTTTGTC GACGAGAAAA CGCTTTCGTT CTACCTACGC AAACCGACCG AAAAGGAGCT AGAGAACTTG GAAATTCACT GGTTAAGTCC TCGAAGGCTA GTTAGATCTA GCATCCATCC TATTCGACGC ACGCCGGTTG CTATAGTTCC TGAGCGGGCT CCATGGGCCG AACGGCTTGG AAACTGCCCG GAGTTTACGT TATCGAAGAC TCTCCTGGCG ACGACGCAAC TATGTGCTGC CCCGGTCGAA ATGGATAAAC GAGAAGCTCC GCGTCAGCAT CGCAAGTCTC GCATTCATGC TTTGCATCCT CGTCGAATCG AGGGTCGCAC TGATTCGGAT ACATTCTTCT CGTCCGTTGA GTCTATTGAA GGGTTTCTGT GCGTGCAGAT TTTCTTTTGC CATGAATCTA ACTATACTTA TGTTCGAGGT ATGAGAAAGG AGTCGCAGTC GCACGGAGCG TATCAAGATT TTATACGGAA TGTGGGAGCA CCTAATGTTC TATTAACCGA CAATGCTAGA ACGCAAATCG GTAAAAAGTG GACTAAGACT AGTCGGGAAA ATGTAACTCG ACAGATCAAG TCCGTTCCGA ACAATCAAAA CCAAAACCAA GCCGAACGCA AAATTCAAGA CGTGAAAAAG CGAACTATTC TCACTTTGCG ATATGGAAAA ACACCGCTCA CATTTTGGTG TTTTTGCCAA CAATTTATTG TTGACTGTTT GAATCATTCG GCTCACAAGG ATTTAAATTT TCGCACCCCG ATGGAAAAAA TGTACGGTCA CACGCCGGAT ATTTCCATGT TTCGATTCCG ATTCTGGGAA CCCGTTTGGT ACTATGAACC AACGGCCAAG TACCCAGCTC CTAATTTTCT CCCTGGTCGT TTTGTTGGAA TTGCCTGGGA CCATGGCGAC GCTTTTACTT ACAAGATTTG GACTACTCCC AATAACGATT GGAAGCAAGG TCGCGAGCTT GTTCGAAATG TCGTCCGCTC GCGTCATTTG GAGGAAAAGG AGCCAGTGGT CTCCTACCAA GATGAAGATC TTCTCTTCAG CAAGACGCAG CCGTCGCGTA CGCAGCGTCG TCGCTCGAAA AAGCGAAACT CTCGTAGCGA CGACTCCCGT GGAGCCAAAG AAAGAAATTT GGACGGTAGT GAATTGGAGT CACTGGTCCG TTTTGATGAT ACTCCTCCTA TCACCAGCGT GGATTCAGAG GAGCAGGGGG GCGACAAAGC GGAAAGCCAA AGCGATCACT GTTCTCCCGT CGAAGTTGAA ACGTCCGACG GTGACGAGCT GTACGAAGAC AGTGAGTCAA AGACAACTAA CTCTACTATC AACTCGAAGA GGGCTCGCGA GGAGGTCGAC GGTTCAGCAC TTCAGTCTTT CGACCCGCTA AACAACGACG CGGAGGTCGA AATGACAAGC GAGATCAACG ACTACCTAGA CATAAACGAG TCCACGGCTT CAGGAGTTGG CGGAGCCCAT GTCACGAAAA TCGTCGGACA CAATTGGATG GACGGTCGAC TCAAGTTGAA GGTAACTTGG GCCACCGAGC AGACGACTTG GGAGGAGCTT CGAGACATGA AAGAGGATCA CCCCAAGATG ACTGCAGAAT ACATCGTGAG CACAGGCGGC GTTAGTAGGT CAACAACGAG AGGAGACCGA ACACTAGACT GGGCAAAGAA GACACTTCGC GATCTGCAAC GAGCAGTACG AAGAGTTGCT AGGCTGTACG ATTTTCATCT AGACGAACAC AACGATATCT ATTCGGTTAG ACGCGTTCAA AAACGTAAGA AGAAGAAGAT CTACTCTATT AAGCCTATTT TGAAATATGG CGTCAGAGTT CCAAGAAGTG TTCGAGACGC AATCGAGTTG GACAAGTCTA ATGGTAATTC CCTTTGGCAA GACGCGATCA AACTTGAAAT CACAAGTCTC ATTGATTTGG AGTGCTTTGA ATTCAAGCCA AGCGATTTTT CGCCGGGCAA CGAATTTCAG AAAACTACCC TCATGACCGT GTTCGATGTC AAGCAAGATC TTCGTCGCAA GGCCAGATTA GTTGCCGGCG GTCACCTTGT CGATGCGTTG GATCACGACA TTTACTCTTC GACGGTCAAA GGCATCAGTG TGAAGTTACT CCATGTAATT GCGCACAAGG CCAACCTGAA GCAACTTTGC GGAGACGTGG CGAATGCCTA TGTAAATGCC TACACGAACG AGAAGGTATA TGCAAAGGCA GGACCCGAAT TCGGCAGCGA TCTCGTAGGC AGCATTGTTA TTATACGAAA AGCGCTGTAC GGCCTACGAT CGAGCTCGGA ACGTTGGCAT GCACATTTTG CGGATACCCT ACGTGCATTA CAGTTCAAAC AGAGTCGTTA CGACAAGGAT GTTTGGATCA GATTGGGCAA CGAAAGCCTA TTCTACGAAT ATGTTTGTAC ACATGTGGAT GATTTTATGA TTGTTTCTAA AACACCAGAA AAAATCATGG AAAGTATAAA GGCAATTTAT TCAGTTAAAT CCGTCGGGCC TCCAGACTAC TATCTTGGAA ATGATTACAA GAAAGATCGA AAAGGCCGGT GGTGCATTGG ATGCAAAAAG TATCTTGTCG AAGCAATCAA GCGAGTAGAA AATATTTTCG GGAGTTTGAA AAAATACTCG TGTCCGTCTG AGACAGGCGA CCACCCGGAA TTGGATTCTA CACCTTTGTT GAGCGACGAC GAACATCAAA AGTACCAAAT GTTGATCGGA ATCCTAGTTT GGGTCGTGAC TATCGGACGG ATTGACGTCG CCCACGCGAC ATCTTCTCTT TCACGCTTCA CTGCGTGCCC TCGAAAGGGA CATTTGGAGC GCCTTCTGAG GGTTTTTGGA TATCTGAAGA AGCGACCAAA TCGGCGAATT GTTGTAGATT CCCGAGATCC CATCTACGAA GGCGGAGAAG ATTCGTTATC CCGAGATTTC ACGAAAGAAC TTGGCGATCA GTATCCTGGA GCTTTTGAAG AAATTGACGC TAACCTTCCC AAGCCGCTGA TCGACGAAAT CGAAATAACC GTGTTTGTCG ACTCTGACCA TGCCCACGAC AAGGTCTCGA GGCGTTCCAT AACCGGTCTT TTGATCTTTG TCGGGCGTAC CCCTGTTTTT TATACGAGTA AGCGACAGGG AGCCATTGAA ACATCGACAT ACGGTGCTGA ATTCTGTGGA ATGAAAACCG CCGTCGAGGA GTTGATTGCA GTACGTTACA TGCTTCGATG TCTGGGTGTA AAGGTCGAAC ATGCGAGTAT GATTTGTGGG GACAACCTTG GAGTCATCCA AAACGCCACT ATATCAGAAA GTCTATTGAA GAAAAAACAC GTTGCGATCG CGTATCACAA GACGCGCGAA GCCGCGGCCG CCGGAATATG TCACCCTATC AAGACGGGTG GGGTCGACAA CTTTGCAGAC ACGCTTACCA AGGCACAAAC GATTAAAACG TTTTGCACGC TGGTAGGTGG GTTTATGTTT GGTTAATCCG GGCACGTGGC TACGGATGAA GAGGAGTGTT AGAATTGTGT GTGTAGCACA CGAGACACTT CGGCATGTCA CCAGATGACG CAGACGGTGC GTGTGTCATC CGTCGCTACT TTCCTTTGTC CGTGCAACTA TGGAAGCCAT TCGCTGGTTC TAGTAACATG TAA
|
Protein sequence | MEPLIINLVG LSDTEAALFA EAGITDRTTI ASLDLPAFTQ VLGDQVGTLA KRYKLKRVAE YLSSGKEIGP DTTMGEINLA LSSNHQNVGA NAPNPAAGLD PRHGTMKASI NTISKFSGDL EDFEDWSTKT AAALGITVYR KLLDGPPEVG NDFDAARNNE LYHMFVIALV DGAAMHIMED VKDQNGYAAW MAIKEWYGMS DTGRTIIDKY RSKLDTLLLD DATPAGTFVN HFKRFSQKLE ENGEGYTADT KRRQFLDKII DEDYDVVKQQ LEGDSTADFN ECVARIRMRE QVLMKDSTLS AKKARRFKSS EGGKSNSGGP SSGKIPSIPN SILNLVKPAS ARKNLIKWRG VWNSEGRILR SDELASTEYD GKGKTPSKRE HDNGSSDESV KTRNTQGKGG SSSKKGKRGK GRRITGVVRR TETKTSGTPD SSIRISMKDP DDHVEGDEYD DVEIESGQED DSTAPVKKER AKKQKNPSKR KHKRAKSRRS LISRRGRVGN EKPRAILDPG TECDIVGGDV WTVLEKVIGV EAQLGGALAG MGKCSLPLVN AVAAYDHPKE GTILIGAGNV GYDERSTQTE SLFNTHELRK HGVIVSDTTI RDGGLQSIEV DGISIALDFV DEKTLSFYLR KPTEKELENL EIHWLSPRRL VRSSIHPIRR TPVAIVPERA PWAERLGNCP EFTLSKTLLA TTQLCAAPVE MDKREAPRQH RKSRIHALHP RRIEGRTDSD TFFSSVESIE GFLCVQIFFC HESNYTYVRG MRKESQSHGA YQDFIRNVGA PNVLLTDNAR TQIGKKWTKT SRENVTRQIK SVPNNQNQNQ AERKIQDVKK RTILTLRYGK TPLTFWCFCQ QFIVDCLNHS AHKDLNFRTP MEKMYGHTPD ISMFRFRFWE PVWYYEPTAK YPAPNFLPGR FVGIAWDHGD AFTYKIWTTP NNDWKQGREL VRNVVRSRHL EEKEPVVSYQ DEDLLFSKTQ PSRTQRRRSK KRNSRSDDSR GAKERNLDGS ELESLVRFDD TPPITSVDSE EQGGDKAESQ SDHCSPVEVE TSDGDELYED SESKTTNSTI NSKRAREEVD GSALQSFDPL NNDAEVEMTS EINDYLDINE STASGVGGAH VTKIVGHNWM DGRLKLKVTW ATEQTTWEEL RDMKEDHPKM TAEYIVSTGG VSRSTTRGDR TLDWAKKTLR DLQRAVRRVA RLRVQKRKKK KIYSIKPILK YGHTRHFGMS PDDADGACVI RRYFPLSVQL WKPFAGSSNM
|
| |