Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44995 |
Symbol | |
ID | 7199513 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 926228 |
End bp | 930154 |
Gene Length | 3927 bp |
Protein Length | 1217 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179092 |
Protein GI | 219116594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00833272 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGACAGGTC AGATATCCTT TCGTGTAGTG TTGCCTCACA AGTCAGATTC TAAACGGCAA CCATGATGAA GCTGGCGTTC TTGGCAACCC TCGCAACGTC CCTGGCGCTG GCGTACGGCA ATGATGGTTC AATGGTCGGT ATGTGCAGCA ACAGCTTTGT CGAGTTCGAT TGAGTGAAGA CCATAGGTAC TGGAATACTG ACATGAGAAA TTCCATCTCG CTCCTATTTG TCCAGCAGGC TCTAACGAAG AGCGAAACTT AGCCGCGATG ACTTGCACAA CCAAGAATTT GGACTTTAGC GAGTTTGCCA CCGGCACATA CTTGAGTAAC TTGGAGGCTG ACTATGGGGT GACTATCACT GCCGTTTCCC GTACAAGCAA GGGCTACACA CCCAACGGAG CCGCCCGTGT TTTCGACACG TCAAAGCCCA CCGGTAAGAC GGGACAGTCA ATGTGCGGAA GAAACGACGG GGATTCAGAT CTCGGATCAC CCAACTCCGC TTGTCCCGGA GGTGGACCAG GTCATGGACC TGGAGGTGCA CCAAAACTCG CCAACGGACA GAACAATCCT TATAAGAACT GCTCGCCCCA AGGCAAAGTA CTCATCATTC AAGAACGGAA CAAAAACTGT CCCGACGACA GTGCGGACGG CGGTACCATC CGCTTCGACT TCTCCAAAAC AGTGGACCTC GAGTCCGTGA CGTCCTTGGA TATCGACGAA GGTAACAACA CTCCGGAGAT CACCGTCTCG TACGGCAACG GCCAGGAGGC TTTTTATAAG CTGCAGGCTA CGGGCGACAA CGGTGTTTTC ACGCAAATGA TCAACAAGAG TGACGTCAAG TGGTTCCAGA TCAAGTTCTA CGGCTCAGGA TCCGTATCAG GCTTCAAGTG GGATGAGTGT GTCACAGCCC CAACGAAAGC CCCCACGAAA AGCCCAACAA AGGCTCCGAT ACCGGCCCAA ACCAGAGATG ATACTTGTCC AATAAAGAAC TTGGACTTTA GTGAATTTGC CACCGGAACC TACTTGAGTA ACTTGGAGGC TGACTATGGG GTGACTATCA CTGCCGTTTC CCGTACAAGC AAGGGCTACA CACCCAACGG AGCCGCCCGT GTTTTCGACA CGTCAAAGCC CACCGGTGCA ACGGGGCAAT CAATGTGCTC CTCTGGTGAC GGTGACTCTG ATCTCGGATC ACCCAACTCC GCTTGTCCCG GAGGTGGACC AGGTCACGGA CCTGGAGGTG CGCCAAAACT CTCAAACGGT CAAAACAATC CTTACAAGAA CTGCTCGCCC CAAGGCAAAG TACTCATCAT TCAAGAAGGC AACAAAAATT GTCCCGACGA CAGTGCGGAC GGTGGTACTA TCCGCTTCGA CTTCTCCAAA ACAGTGGACC TCGAGTCCGT GACGTCCTTG GATATCGACG AAGGTAACAA CACTCCGGAG ATCACCGTCT CTTACGGCAA TGGCGAGCAG GCTTTTTATA AGTTACAGGC TACGGGCGAC AACGGTGTAT TCACCCAAAT GATTAACAAG AGTGACGTCC AGTGGTTCCA AATCAAGTTC TACGGCTCTG GATCCGTATC AGGCTTCAAA TGGGCCGAAT GTGTCACAGC CCCAACTAAA GCTCCAGTAA AAGCCCCAAC CAAAGCTCCG ACAAATGCTC CCACAAAAGC ACCTGTTAGA GCTCCAACCA AGGCTCCAAC CAAAGCTCCG ACCAAGGCTC CAACCAAGGC TCCAACCAAA GCTCCAGTCA AGGCTCCAAC GAAAGCTCCA GTCAAAGCTC CGACCAAGCC GCCAGTCACG GCAGCACCAT CCGAGTGCGT GGACGGTATG GACGTGGTAC TTGTCAATAA GTCCACCGGT CCCGAGAGCA CCATTGATGG CAAGAGCCCA ATCAAAATTG TTAGCGGTGA CGGCCAGTCC GTCTCTTTTG AGGTGCACCA ATACTGGAAG AGTGGAGCTA GTAGTATCAG CTGGATAGCG ACTCAGTTCC GTACCAACGA TGGCAACACG GCGTCTGATG CATGGGAATG CGAAAAGATC GAAGAAGTCT CCTGGGGCAG GGTGAAGGAG TACACTGCTG AGTGTGTTGG AGGAGCAGCA ACCGTGACCT TGTGGGTACA CGACGGACAG TTCCAAAATA CTCAAAACCT GAACAGCCTG GTCCCTGCAA GGTGCAATCC TTCAAACGAC CAGTTTCGCA AAAAGATCAT GTATAATTAC ACGTTGCCTT GCTCCTCGAT ATGCGCTCCA TCCCCGACTA AAGCTCCTGT CAAAGCCCCG ACCAAGGCGC CAACCGGTAC ACGAGATGAA ATATGTGTCG ACGAAGTCCT CGACTTTACT GACTTTACTA CAGGCGAGTA CGTCCACGAC CTGGTACGAG CTCGCGGCGT TACAGTGACA GCAATTGCAT CCGGAAGCGA CGGATACACG CCCGGCGGTG CGGCTCGCAT TTTCGACACT CGCTACCCTT CCGGCAGCAC TGGACAAGCG CTCTGCGCCC AGAACGAAGG TGAAACAACT CTCGGGTCAC CCAACCTTTC GTGCCCCGGC GGTGGACCCG GATCGGGTAA CGGAGGCAAA GTCAACACGC CCTTCGCCAA CTGCGACGCT CGCGGTAAAG GTCTCATCAT TCAAGAAGGA AACGTGGCCT GTCCTGAGCA CGCTGGACAA GGCGGGCAAA TCGTGTTTGA GTTTGCGGTA CCGGTTGAGC TCAACTACAT CGATTTGCTG GTTAGCACCG ATTCCAGCCC CGTAATTACG GTGTACTACG GCGTAGACCA ATCCATGTCG TTTGATATGC CGATGATGGG TGCCAATGGC TACCATCGGC AAGTGATCGA TCGATCGCAG GTTTACAAGG TCGAGGTGGG CTTCTGCAGT GGAGGTACCG TTACTGCCAT AGATTACGTT CGTTGCGAGC CGGAGGGTCC GCCAACGAAA GCTCCAGTGA TAGCTCCAAC AAAGGCTCCT GTCAAAGCCC CGACCAAGGC ACCCATTGGT ACCCGAGACG AAATATGTGT TGACGAAGTC CTCGACTTTA CTGACTTTTC TACAGGCGAG TACGTCCATG ACCTGGTACG ATCTCGCGGC GTTACAGTGA CAGCAATTGC ATCCGGTAGC GATGGCTACA CCCCAGGCGG TGCGGCTCGC ATTTTCGACA CTCGCTACCC TTCCGGTAGC ACTGGACAAG CGCTCTGCGC CCAGAACGAA GGTGAAACAA CTCTCGGGTC ACCCAACCTT TCGTGCCCCG GCGGTGGACC CGGATCGGGT AACGGAGGCA AAGTCAACAC GCCCTTCGCC AACTGCGAGG CTCGTGGTAA AGGTCTCATC ATTCAAGAAG GAAACGTGGC CTGTCCTGAG CACGCTGGAC AAGGCGGGCA AATCGTGTTT GAGTTTGCGG TACCAGTTGA GCTCAACTAC ATCGATTTGC TGGTAAGCAC CGACTCCAGT CCGGTAATTA CAGTGTACTA CGGCGTAGAC CAATCCATGT CGTTTGATAT GCCGATGATG GGTGCCAATG GCTACCATCG GCAAGTGATC GATCGATCGC AGGTGTACAA GGTCGAGGTG GGCTTCTGTA GTGGAGGTAC CGTTACTGCC ATAGATTACG TTCGTTGCGA GCCTGAAGAG GAGTGTCCGC CGAGTAGCGG TTCAGTCAAA CCGCTCCCTC CGATCGAAGT GCATCTTCCC CCGCCGAACA GCAAGCACAT GGTTTTTGAC TTTGTCGTTC TAAAGAATCA AGAATCGTGT CCTCCGGAAT GGCTTGGTCG CAGGGAACGT CGCGCTTTGG TAGATACCCG AGGACGACGT TGAGTCGATA ATGAAGCAAC GTGGTCTTCT CTCAAAATCG TCCACGACAA ACTCTTCATA TATTCATTGC GTTGAACCAT AGGGAAACGA TAATTACTTG TGGCTCATTA TTGGATT
|
Protein sequence | MMKLAFLATL ATSLALAYGN DGSMVAGSNE ERNLAAMTCT TKNLDFSEFA TGTYLSNLEA DYGVTITAVS RTSKGYTPNG AARVFDTSKP TGKTGQSMCG RNDGDSDLGS PNSACPGGGP GHGPGGAPKL ANGQNNPYKN CSPQGKVLII QERNKNCPDD SADGGTIRFD FSKTVDLESV TSLDIDEGNN TPEITVSYGN GQEAFYKLQA TGDNGVFTQM INKSDVKWFQ IKFYGSGSVS GFKWDECVTA PTKAPTKSPT KAPIPAQTRD DTCPIKNLDF SEFATGTYLS NLEADYGVTI TAVSRTSKGY TPNGAARVFD TSKPTGATGQ SMCSSGDGDS DLGSPNSACP GGGPGHGPGG APKLSNGQNN PYKNCSPQGK VLIIQEGNKN CPDDSADGGT IRFDFSKTVD LESVTSLDID EGNNTPEITV SYGNGEQAFY KLQATGDNGV FTQMINKSDV QWFQIKFYGS GSVSGFKWAE CVTAPTKAPV KAPTKAPTNA PTKAPVRAPT KAPTKAPTKA PTKAPTKAPV KAPTKAPVKA PTKPPVTAAP SECVDGMDVV LVNKSTGPES TIDGKSPIKI VSGDGQSVSF EVHQYWKSGA SSISWIATQF RTNDGNTASD AWECEKIEEV SWGRVKEYTA ECVGGAATVT LWVHDGQFQN TQNLNSLVPA RCNPSNDQFR KKIMYNYTLP CSSICAPSPT KAPVKAPTKA PTGTRDEICV DEVLDFTDFT TGEYVHDLVR ARGVTVTAIA SGSDGYTPGG AARIFDTRYP SGSTGQALCA QNEGETTLGS PNLSCPGGGP GSGNGGKVNT PFANCDARGK GLIIQEGNVA CPEHAGQGGQ IVFEFAVPVE LNYIDLLVST DSSPVITVYY GVDQSMSFDM PMMGANGYHR QVIDRSQVYK VEVGFCSGGT VTAIDYVRCE PEGPPTKAPV IAPTKAPVKA PTKAPIGTRD EICVDEVLDF TDFSTGEYVH DLVRSRGVTV TAIASGSDGY TPGGAARIFD TRYPSGSTGQ ALCAQNEGET TLGSPNLSCP GGGPGSGNGG KVNTPFANCE ARGKGLIIQE GNVACPEHAG QGGQIVFEFA VPVELNYIDL LVSTDSSPVI TVYYGVDQSM SFDMPMMGAN GYHRQVIDRS QVYKVEVGFC SGGTVTAIDY VRCEPEEECP PSSGSVKPLP PIEVHLPPPN SKHMVFDFVV LKNQESCPPE WLGRRERRAL VDTRGRR
|
| |