Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56624 |
Symbol | |
ID | 7197121 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 295208 |
End bp | 299525 |
Gene Length | 4318 bp |
Protein Length | 1301 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177904 |
Protein GI | 219112305 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.187837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCCGCTG GCACACTTCA GTCGGGGACA CACCGGATAG ACACCCGTTA GAACAATTGC GCCGCTTCGC ACATTCTGTC CTCTCGATCT CGTACCATGA CGGCGGATGC CAAACCCTCC TTACCGGCAA CCGAGGAATC TCCCGCCGAG ACAGAACTGC GTAGTCGCCT GGCTCGGACG ATACAGTTGC GAGCAACGAC GCAGGTAGGT CGGACGATGC GACGGCATGC GCGAGTTGGC GACAGAGAGA CTGCCGGGCA TTGTCACTTT GTTGTGTGTG GAAGGATGTG TGAAGCAATG CATCCCAGTC CAACCGTTTG ATTCCATTTC ACTCGCTCAC GTTGTGTACT CTTTCACACC AACCTACTTT TTGGGAACAG TTGGAACTCC GAAAGAGTCA AAAAGAACTG GAACGTCTCT TGGTCGCTAT CCCTCCACGA CCCCGGCAAC CCCTCGCGGA CGTGAACGAA CAACTCCGGT ACGTACCATC GCCAACGCCA CAAAAACGCC GACACATCGG TGCCCCCTCG CTCCAGTCGC ACTCGCTCGG TAGCGCGATT TTCTCACTAG AATTCGATCC TTTTCTTCCC ACAGAACACT GGAAACATAC CGTGAAACGC ACAGTCTCTC CTTGGCCGAG GAAAAGGCGT TGCTCCGGCG AATACATTCC TTGGAAGCCG CGCGTTGCGT GTGCAACACT TACACGGCCG CTGACGTCGC AATACGGTGC GAAAAGGCCC TCACGGCGAG TTTGCGCGGG TCGCTGCAAT CGCAAACGCA ACAGATTGCC GCGCTAAGAG ATACCTTGCA AACGACCCGC ACGGCGCGGA AACTCGGTTG CGACGTCGCG GAACTCCAAA AGGTGCTCGT AGACTGTCCT CCGGCTCAAC TAGGTCGGGT CATTGGCAAG GACGGAAGCA ATGTACGGGC GTGGATGCAG AAGTACGGCG TGACCATTGA CGTTGATCGG GAACGACACC AACTCGCCTT GAAGGGGTCG CACCAGGCTT TGGAATGCAC CCTACCGGCC GTCCAGCGTA TAGTGGATAC AACGGAACGC GACTTATCCA CCCTTGTTTC CACCAGTACG GTGGCCTACT GGACCACTAA ACACATTACC GCTCTGGAGG ATTTGCGATC CCGCTATTCC CACGTCTACA TCGACATTCA AAGGGGCAAG CATCAAAAGC AGCACTTGGT ACGGCTTCGT GGGATCCCGG TCGACTTGGA CAGCGTGCAG CACGAAATGG AACTTTGGCG GATTGTGACG AAAACTGTGG CGGTAGACGC CGCCGCAGCA TCGGTCATCC TGGGCAAAAA AGGAAATAAC ATTACCCGCA TGGTACAGAC GCATCAAGTG ACCATGGACG TTAGGAATCC GAGTGGCAGT GCGAGCAACG ATCACAGCAA TGAGACCAAT AGCATGTCTA CCACCGCCAC GACAAAGATG GAAATTACCG GACCGGTGGA CAACGTCACT GCAGCCGTAC AGGAAATCGA ACAAATCATG GCCGACCACG AGCGAGTCGT GAAAACGGTA CCAGTGGATC GAAACGTGCA AGAAGTCTTT TTGCATCAGA GTGGCCTGGG TATCAAAGCC ACACATAAAG TCATTAACGA CGCGATTCAA ACTATTGCTC CTGGTGCGCT ACCCGTCAAC GTGGGCGACA AAGGGGACCA AGGCGGTAGT ATTGTACTCA GAGGGAAGGC CATGTACATG GAAACAGCAG TATCCATGGT AGAGAAGGAG GCTGCACGTG TACAAAACCT GATTGTCCAA ATTCGGGTCG ATCCTTTGGT CATTCCACTC TTAATTGGAC CGAGTGGGAC CAACATTCGG AAATTGATGC AAAATCATCC CGCAGCAACA CTGCTTATTG ATCGTGACGG GGGCATCGTA ACCATTGGTG GACTCGAGGC CGCGCCAATA CAAGCCTTGC AGGCGGAGAT TGAGGCAATG GTCCGAGACA ATCAATTGGA ACGTGTGAAG CTGGACGCTC AGACGTACCA TTCCGTGGTT GCCGCCGTAT TGCGCTCTTC AAAGATAAAA GAAATGAACA AGTTGAATAT CAAATTATTT ACTCAAGACG ACACGAACGA AATTGTCCTG CGCGGCAGTG CGGAGAGTCT GCCGCAAGCC GCATCGTTGG TGCGAGATGT CATTTCAGAA AATTACATTG CAGAACTTGC GGTAGATGTC GATGATTTGA AAGTCTTATT GGAGGGGGGT AAAAAGAGCC CGATTGTCGA ATTCTCCAAT TCATTTGGCG TGCAACTCAG TTCGAATCGG GAAACACAAG TCGTTACGGT ACGAGGGCCT CAGGACAGGG TCAATGAAGC TACAGCTGCA GTGAACAGTT TTTTATACGG CGGTGAAGGC CACAGAGTCG CCAAGATTCC ACTGAATCGA GACGGTGCCG GTGTCGTCAT TGGTCGCGGT GGCAAAACCA GGATCGACTT GGAAAAAAAA TACGGCGTCA CAGTCCAAGT ACATCGGACA AATGACCATG TGACGGTTCG TGGGACGGCA GATGCTGTGG AGGATTGCAA CTTAGAAATA GCAAGATTGC TTTTGACTGC TTCCGTAGTC GAAAGTCTGG ACGTCTCGGA AGAGCAATCT AAGGGTTTGA TCAGTGCTAG ACTTGCAGCA CTGATTCAAA AAACTGTCCC CGTGCGCATA AGTGTAGGTG AAACCAAGGT TACGGTACGC GGATGTAGGC ATGATGTCAA TGACGCTGTC GCCTTGTTGA AGGAGCAGTT GTATGGTATC TATGAAGGAC GAATTGTGTT GGATATAGAC TACTTTCACA AGATGCAGGA TGCCTGCAAA GATTTATCCC ATTTCACACG TATTAAAAGG AATTCGGGTG TTGAGCTATC GGTGGACGAA AGCGCAGCAG CCATTGTTGT GACAGGTCAG CGAGAAAAGG TCAAGGGCGC AAAGCTCCAG CTATTTTCCC TTTTTGGTTT TATTTTTGAT TCGTCGTTCA CGCAGCTTGC CGTCCCTCCG GCTTCGCTAT CTACTATAGG ACAAGTTGCG GTACTGGCTG AGGTTTCATC TGCTTCGGGT GGTGCATCTG TCCTGTTAGA TCGGGACACA CACGCGATTC TGATTTTCGC CCAAGAAGCC TCCAAAGTTT CCAAGGCGAA GAATGAAATT GAGAAGCGAA TGCAATTATC GTTAAGCCGA CTTCATGTGA TTGAACTCGA ATCATCCGAA GACTGGCTGG TAGCTACTAC GATAGGAAAA GGCGGTAAAA ATATCAACGC GCTGCGCAAG CGCACTGGAT GCTCAATCGA TGTCGATAGT ACCAAGAGAA AAATTGTTAT TTCGTCGGAA AATGAGCAAA GCTTCGATAA CGGCCGGAAG GAAGTAGAAG ATTTTCTAGA GAAAGAGCGG CAGCGATGTG TATTTTGTGA GATTCCGGAA CAATACTATC CGGCCTTCGT CGGGCGCGGA GGTGCCAACA TCAAGAAATT TTCCGAAACT CACAATGTGA ACATTCAAAC CATGCGAAAT ACACCGGTCA AAGGTCTTCG AATAACGGGC GAGAAAGACT TTGTTGCCGC TGCCAAAACA GCAATCCAGG AATGGATAGC TTTCCGTCAG CAAGCCCGGG AAGAAGCCGA CATGAGCGAG TCAATGCCGT TGCGGCACGA TCAAATTTCC GTGATCCTTG GTACGAAAGG GTCCACTGTT CGCTCATTAC AATCAGAATT TGGCTGCCGT GTCAATGTAG AGCGGCAATC TCCATGCTGT GTACTTGTGC AAGGTGGCTC CCCTGGAAAA CGGCAGGCAA CCCTTGCAAA GATCCGTGAG CTGCTGCTTT CCGATGCTGT ACTAAAGTCA GAGAACCTCG AACGAAGCAA CGATCCCGGT AGCCATTTCA TAGAAATGGC AGTACAATCT CAGTCAAAGG TACATCGCGA ACAGGTGCCA CAAACCCAAC GCAATGTAAA GAAAAAAGGT GTATCATCAT GGACTGAATA TTTCCCAGTG CTCGACTCAG GGGCGGACGA ACAGAAAACA GAAACGAGTT CTACATCGAC TTTTATAGAA AATACTAAAA ACGAAAGCCC TTCCTGGTCG ACGATCGTTC AAATTCCTAC AATGGCCGGT GAAATCTCGA ACGACGTCAA ACAACGCCGT TCGTTCGTCA GCATGGTCTC AGACGATGAT TGGGATGCCA CATCCGTCGG CTCCGACCCC GCGGACCATC AGCTTAATGC GGAATTGCGA TACGAGTCTG ACGCATGCAC GAAAGTAGCA CAACTCATGG GCAACTTACA GTAAAGTGCA AATGAGAGGT CTTGCTTG
|
Protein sequence | MTADAKPSLP ATEESPAETE LRSRLARTIQ LRATTQLELR KSQKELERLL VAIPPRPRQP LADVNEQLRT LETYRETHSL SLAEEKALLR RIHSLEAARC VCNTYTAADV AIRCEKALTA SLRGSLQSQT QQIAALRDTL QTTRTARKLG CDVAELQKVL VDCPPAQLGR VIGKDGSNVR AWMQKYGVTI DVDRERHQLA LKGSHQALEC TLPAVQRIVD TTERDLSTLV STSTVAYWTT KHITALEDLR SRYSHVYIDI QRGKHQKQHL VRLRGIPVDL DSVQHEMELW RIVTKTVAVD AAAASVILGK KGNNITRMVQ THQVTMDVRN PSGSASNDHS NETNSMSTTA TTKMEITGPV DNVTAAVQEI EQIMADHERV VKTVPVDRNV QEVFLHQSGL GIKATHKVIN DAIQTIAPGA LPVNVGDKGD QGGSIVLRGK AMYMETAVSM VEKEAARVQN LIVQIRVDPL VIPLLIGPSG TNIRKLMQNH PAATLLIDRD GGIVTIGGLE AAPIQALQAE IEAMVRDNQL ERVKLDAQTY HSVVAAVLRS SKIKEMNKLN IKLFTQDDTN EIVLRGSAES LPQAASLVRD VISENYIAEL AVDVDDLKVL LEGGKKSPIV EFSNSFGVQL SSNRETQVVT VRGPQDRVNE ATAAVNSFLY GGEGHRVAKI PLNRDGAGVV IGRGGKTRID LEKKYGVTVQ VHRTNDHVTV RGTADAVEDC NLEIARLLLT ASVVESLDVS EEQSKGLISA RLAALIQKTV PVRISVGETK VTVRGCRHDV NDAVALLKEQ LYGIYEGRIV LDIDYFHKMQ DACKDLSHFT RIKRNSGVEL SVDESAAAIV VTGQREKVKG AKLQLFSLFG FIFDSSFTQL AVPPASLSTI GQVAVLAEVS SASGGASVLL DRDTHAILIF AQEASKVSKA KNEIEKRMQL SLSRLHVIEL ESSEDWLVAT TIGKGGKNIN ALRKRTGCSI DVDSTKRKIV ISSENEQSFD NGRKEVEDFL EKERQRCVFC EIPEQYYPAF VGRGGANIKK FSETHNVNIQ TMRNTPVKGL RITGEKDFVA AAKTAIQEWI AFRQQAREEA DMSESMPLRH DQISVILGTK GSTVRSLQSE FGCRVNVERQ SPCCVLVQGG SPGKRQATLA KIRELLLSDA VLKSENLERS NDPGSHFIEM AVQSQSKVHR EQVPQTQRNV KKKGVSSWTE YFPVLDSGAD EQKTETSSTS TFIENTKNES PSWSTIVQIP TMAGEISNDV KQRRSFVSMV SDDDWDATSV GSDPADHQLN AELRYESDAC TKVAQLMGNL Q
|
| |