Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41167 |
Symbol | |
ID | 7199105 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 117242 |
End bp | 121948 |
Gene Length | 4707 bp |
Protein Length | 1469 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185210 |
Protein GI | 219130098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAAT CCAATAAGAA GGGAAACGAT GTCAAGACGG CGTTGGCGAC AGCACCTAAG TGTACTTGCG ATCATCCTTT TACTTGTACT TGTGGAAATC GACCTCCCCG ACCCTCCAAG GGTCACAAGT GGGATCCCGA GAGCCAGACG TGGGGTGGAA AAGGGCACAA GCAGAAGGGT GCATCTGGAC AGATAGCCTT GAAGAGTCAG GAAGCCAGGA CAACCGATGT CGGAAAAACT CAAGTTGCCC AATGGCAGTG CCTTCCTTCC CAACTATTGG AGGAAGTGTG CAAGCGTCAA AAGCGTATTT GTCCCAAGTA CAAGAACATT GACAAAGCGA AAGGTAAATT TAGGTACAGA GTCATACTTC CAGACGGAAA AGATACTCAA AAGGATCTCT TCTTTGTACC CGCTTCCTCG GTGGTAAACG AGGAACAAGC CAAAGAAGAA GCCTGTTTAC TGGCACTTCT TCAACTAACA CCGACTTTGC CGCACGAAAG AAAGCTTCCC GATCCCTACA AGTTGACTTG GTTAAACGCA ATCAACGCAC TAAAAACAGC TAGTAAAGAC GTGTCAAATG AGGCGAGCGT CACTTCGGGT TCTGCTCTCA CCAGGGTTGC TAAACCTAGC GCCATTTCAA CACAACCAAG TCCGGGGTGC GCGCAAGCGT CAGCCAATCT GATCAGGGGG ACTTCATATG CAAGTTCTGC GGAGCGTCGG AATATGTTAC AGGAGAAGAC CCGCGAGCGA AACGCACGCA TTCGACGTCA TGAAAATATA CGCATGGCAA ACCAGAATCA TCCCGTATTT ATGGGCGCAA GAATTCGGAA AGAGATCGAA CGCTTGCTAC GTGGCGACAC GAACTTCTTG CAGCATGATG ATGGAGAAGA TAATACAGTA GATGCTGTGG ACGATGACGC TCAGGAATAT GTGGAACATC GTCTCTGCCA TGAAGGCTTT ACGAAACGTC AATCACGTTC TGCATTCGAC GAAGTTGTGG GGAAGAACCC CGCTTTCACG GAGGACGAAT GGGAAAAGGC CTACGAAGCT TGTCTTCAGT GGTTATTGGT GCACTTGAAC GAAGACCAAC TGCCTGAAGG CTTCGATCCT CGAGGACGTA CTCTCGATGT AGTTGTTCCT GACTCTTTAA AACAAACCGG GACTAAGCAC AACTCTTCGG GTACAGATGG CTGTCCTCCA GAAACTCTTG TAGTCGCTGC ACACTACGGT CTGACAGTCC CAGAAGCTAA CGAGCTCTGT AAAATGGCTT CCAGAGGCAC AAGAGAACCT GAAGACGTCT TATGGGAAAT CGTCCATGAG GCAGTGGGTG CTACGTGTCA TCAGCTGAAT ACACCTCTTA TCGAAAGAGA CAGTAATGTA GAATCTACAC ACGAAGAGGT CGAAGCATTG CAGGCAATTT TTGGTTCTGA TTTCAGCTCT GTCAGAGAAG GCACCTTTGT TTCGAATTCA GTGATGCTCA AAGAATGGGG GCTTTCACTT TGCGTTGTTG TGGAGGAAGG TCTTTACCCT AGTAGACTGC CCGAAAAGGT ATTGCTTTCG GGTAAATGGA CTGTCGGACA GGTCGGTACA AGCATTCACT ATGAAATTGC CAAGTTCCTT TCCTCGTTTC AACCAGGCGA GCCTGTCTTT TTCGAGATCC ATGGTCTTGT GTTGTCTTTA TTGCAGAATG TCGAGAGGCT GAAAACAGAA TCGCTGGTAT CGTTGCTCGA TATCGACAGT GAAATAACAA GTACAATCAA GCGCTCGTTA GACGACAGAG CGCTAGAGCC CAGTCGGGAA AGGGGCAAGC TCCAGTCTCG AGTGATCCGT CGTGCTCGCG AACGAAGTCC TTTCTGGAGC AAGCTCCCTA CAGATACCCG GCCAGCTGTG GCACACCCAA ACATACCGAG ATCTTTGAAT TCGATACGAA AGTCTCTACC GGCAGCTTCC GCTCGAACGG AGTTTCTCCG CGTTATGAGA GAAGCGGATA AGGTCAGTAT TTGTTAGTCG TCGTAACTTG TCTTGATTCA AGTCTTTCTC ACGTGATCGG TCCGAACCAG CGTGGTCGAG TCGTTCTCGT TACTGGGGAT ACAGGATGTG GAAAGACGAC GTAAGTTCGA CCAAAAAACT TTTTCCCAGT GTCAGTTTCA TTTGCTATCA GTTCTGACTT TGTTTCTGAT CGACTCTATT GTTTTAGTCA GATACCTCAA TTTATTTTGG AAGAATCTCC AAATGATGCA AAGATCGTTG TTTGTCAGCC TCGCAGATTG GCAGCTACAG GTGTGGCTAC TAGGGTGGCT GAAGAACGAG GCGAACAACA GGCAGGAGTT GGTAGCGTGG GATATGTCGT TCGGGGGGAC TCTGCAATGG GCGAAAGTAC CCGGCTTTTG TTTTGCACAA CTGGAGTCCT CCTGCGCCAA CTACAGACTG AAGGAGCCCT AGACTGTATC ACACACGTTG TTGTTGACGA AGTTCACGAG CGCCATTTAG ATACAGATGT ACTCCTCGGA CTATTGAAGC AAAGTATAGG AAGCCGAAAA AACATTCGCG TTATTCTCAT GAGTGCGACA TTGGATGCCG GTCGTTTTGC AGCCTATTTT GGGGAGAATA CTCCTCGTAT CCATATTCCA GGACGCACCT ATCCTGTCAA AGACTATATG TTGGAAGATG TATTGCTGAT GACTGGATAT ATCCCCCGAA AACAGAAGAA GAGAAATGGT GATTCATCAG GTTCCATAGA TAAAGACGAG ACCTCGATGG AAGAAAGCAA CCTGGAAAGT GTTGACTTTC CACCGAAAGA GCTTACAAGC CACGGATTTC CAGTCGAAGA TCTTGTGAGA AGAATTGACG AAACCTTGGT GGACTACGAT ATGTTGGGCC AACTAGTGAA GCATCTTATA GCGAATAGCA GCGCCGGTAG CGATGGTTCT ATTCTCGTAT TTTTGGCAGG TGCTCCCGAA ATTAACAGAG CACAGGAGGC GGTTAAGCGT TGGACGGATG GGTTTCCTTT ACTGCTTCTT CAACTCCATG GCGGATTGCA GCCCCGGGAG CAGAACCTAG TGTTCAAGCC AGCCGCTACG GGACTGACGA AAGTAATTCT TTCGACTAAC GTGGCGGAGA CCTCAATTAC AATTCCCGAT TGCACTATTG TAATCGACAG TGCTCGTGAG AAGCAATCGT CTTACGATGC TGCGAATCGT ATGCCGCTTT TGCTTGAGCA ATTTTGCTCG AAAGCAAGTA AGTGAATGAT AAAAACGTCC TCTATTGTAT GTACTTTCGC ATCTCAACAC CTTTCTTACC TTTCGCAGGT CTTAAGCAAC GGAGAGGAAG AGCAGGGCGC GTTCGGGAGG GTAAATGTTA CAAGTTGATT TCTCGATCAA CATATGACGG GCTTCGAGAT CATGGAGAGC CAGAAATCCA ACGATGCGCC TTGGATCAAA CCCTTCTGAC TTTGCTTTTT CTTGGCGTCG AAAGTAGTGC AAAAGGACTA TTCATGGAGA GTCTTCTTGA TCCTCCCAGC AAAGTTTCAT TTGTTGCAGC TATTGATAGC CTTCGTCAGC TAGGTGCTAT TGCAACGCCT TCCGGGGAAG ATCTCAAATT GACCCCTCTA GGAACTCATT TAGCAGGCAT ACCCGCTCCT CCAATGGTTG GAAAAAGTAT GTCCGAACTC AATACTTGCG TCTGTTTGGT TTTGTCCAGT GCTCACTCGT TTCCCTTTTT AAAACAGTTT TGATTTTAGG ATCAATCTTA GGTTGCAGAG AAGCAGCCCT AGCTATGGCA GCTGCAATGA GCGTTGGTAG AAGCCCCTTT CTCAAAATCG ATGTTTCTCG CAAAAGAGGG AAAGACAAAA TTGACGAGCG AGCAGGGATC GAGGAAATGA AGAACCATCA AATTTTAGAA GGGCGAAGAA ATCTGTTTAC AATTGTTGGC AACAGTGATC ATGCACTTCT TGCAAGCGTC TTTTTGAAAT GGAAAAATCT TGACTCGGGG GGTGGTTCTC GGAAACGTTT CTGTGATTCC CTTGGTCTTA GTATTCCTGG TATGCGCGAT ATGTTGCAGC TATTCCGTCA GCTTGATACA GCACTGGCTT CGATTGGGTA TATTTCTTCT GTTGACTCCG ACCGAAACGG ACACTCATGG CGGATCATTC GTACGTGCGC AGTTGCTGCC ATGTCACCAG CTCAACTCGT GAAGGTGGTA CGACCCGCTA CTGTCTATCA CGAGACTGCC GAAGGCGCGA GGGAGAAAGA TGGCCAAGCG AAAGAGCTGA AATTCTATGT CCGAACTTCT GTCGACGCTT CCGCAAAATC CAACGGAAAC TCATGGAACG GAAAAGAAGA ACGTGTCTTT ATGCATCCAT CCTCGTCTAG CTTTGCAACT TGTTCGTATG GTTGTCCTTG GCTTGTCTAC TTTTCTTTGG TACGGACCTC GAAAGCGTTT CTGAGAGACG TCACCGAATG CAGCGCGTAC GCTTTGTTGC TCTTTGGAGG CAAACTCGAC GTGCAAGCTT CCAAAGGAGT GATTGTGGTG GATGGCTGGG CCAAGCTGTC GGCAAATGCT CGAATTGGCT CGTTGGTAGG CGGCCTGCGC TTAAAAGTCG ACGAGCTCCT CGAAAAGAAA GCTGCCGACC CAAGCTTTGA CGTTGCAGCT ACAAAGGAAA TGCAGCTTAT TGTTAAACTG GTTGTATCTG ACGGTCTTGG GATCTAA
|
Protein sequence | MGKSNKKGND VKTALATAPK CTCDHPFTCT CGNRPPRPSK GHKWDPESQT WGGKGHKQKG ASGQIALKSQ EARTTDVGKT QVAQWQCLPS QLLEEVCKRQ KRICPKYKNI DKAKGKFRYR VILPDGKDTQ KDLFFVPASS VVNEEQAKEE ACLLALLQLT PTLPHERKLP DPYKLTWLNA INALKTASKD VSNEASVTSG SALTRVAKPS AISTQPSPGC AQASANLIRG TSYASSAERR NMLQEKTRER NARIRRHENI RMANQNHPVF MGARIRKEIE RLLRGDTNFL QHDDGEDNTV DAVDDDAQEY VEHRLCHEGF TKRQSRSAFD EVVGKNPAFT EDEWEKAYEA CLQWLLVHLN EDQLPEGFDP RGRTLDVVVP DSLKQTGTKH NSSGTDGCPP ETLVVAAHYG LTVPEANELC KMASRGTREP EDVLWEIVHE AVGATCHQLN TPLIERDSNV ESTHEEVEAL QAIFGSDFSS VREGTFVSNS VMLKEWGLSL CVVVEEGLYP SRLPEKVLLS GKWTVGQVGT SIHYEIAKFL SSFQPGEPVF FEIHGLVLSL LQNVERLKTE SLVSLLDIDS EITSTIKRSL DDRALEPSRE RGKLQSRVIR RARERSPFWS KLPTDTRPAV AHPNIPRSLN SIRKSLPAAS ARTEFLRVMR EADKSFSRDR SEPAWSSRSR YWGYRMWKDD IPQFILEESP NDAKIVVCQP RRLAATGVAT RVAEERGEQQ AGVGSVGYVV RGDSAMGEST RLLFCTTGVL LRQLQTEGAL DCITHVVVDE VHERHLDTDV LLGLLKQSIG SRKNIRVILM SATLDAGRFA AYFGENTPRI HIPGRTYPVK DYMLEDVLLM TGYIPRKQKK RNGDSSGSID KDETSMEESN LESVDFPPKE LTSHGFPVED LVRRIDETLV DYDMLGQLVK HLIANSSAGS DGSILVFLAG APEINRAQEA VKRWTDGFPL LLLQLHGGLQ PREQNLVFKP AATGLTKVIL STNVAETSIT IPDCTIVIDS AREKQSSYDA ANRMPLLLEQ FCSKASLKQR RGRAGRVREG KCYKLISRST YDGLRDHGEP EIQRCALDQT LLTLLFLGVE SSAKGLFMES LLDPPSKVSF VAAIDSLRQL GAIATPSGED LKLTPLGTHL AGIPAPPMVG KSCREAALAM AAAMSVGRSP FLKIDVSRKR GKDKIDERAG IEEMKNHQIL EGRRNLFTIV GNSDHALLAS VFLKWKNLDS GGGSRKRFCD SLGLSIPGMR DMLQLFRQLD TALASIGYIS SVDSDRNGHS WRIIRTCAVA AMSPAQLVKV VRPATVYHET AEGAREKDGQ AKELKFYVRT SVDASAKSNG NSWNGKEERV FMHPSSSSFA TCSYGCPWLV YFSLVRTSKA FLRDVTECSA YALLLFGGKL DVQASKGVIV VDGWAKLSAN ARIGSLVGGL RLKVDELLEK KAADPSFDVA ATKEMQLIVK LVVSDGLGI
|
| |