Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21664 |
Symbol | |
ID | 7202266 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 727789 |
End bp | 732187 |
Gene Length | 4399 bp |
Protein Length | 1186 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181792 |
Protein GI | 219122937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGCC CCATTATCCA AGATGTTTCG TTGTGCTTGC AGCCGGGCAA AAATTATCTC GTATTGGGGC CTCCCGCTTC GGGCAAGTCA ACCTTGCTCA AAGCGATCGC GGGGCAGCTG AAATCATCGT CGACGGAAAA ACTGGAAGGC CAGATACTGT ATAATGGACG GGAGTTGGAG GTATGTAGTG CGGTTCGCCT GTCATGTACA TTCCCTTACA GGGTGTGGTT GCCTGTGTCG GTCTTTGGTT CGCACCTGTC TCTCACTTCA TTCGGCTACG GATCCCGACT GTAGGTGGGT CGACGGCAGC AGTGGTACAT TGAAAATGCC TTTGCTTACA TTGATCAACT AGACAAGCAC GCACCACGGT TGACGGTCGA CGAGACGTTC GAATTTTCTT TTCAATGTAA AACCGGAGGC ACATTTCAGC AAGCTCAGGA TCCACGCGTT TTGCAGGACC CCAAAGTTAT GACAGCGATA CAAGAAGCCG ACCGGAGCAG ACTCGGTGTG AATATGGTCT TGGCGAGTCT AGGGTTGACG GAAGTTCGCG ATACGTTTGT GGGGAACACT GCCGTTCGTG GGGTTAGTGG AGGCCAACGG CGGCGAGTGA CCGTGGGGGA AATGATTACG TCTCGTCAAC CTGTCCTTTG CGGTGACGAA ATTTCGACTG GTTTGGATGC TGCGTCCACC TTTGACATGG TGCAAGTACT CACTCACTTT GGAAAACTAG CGCAAATGAC ACGAGTCTTT GCTCTGCTGC AGCCGAGTCC CGAGACTTTC AGTCTTTTCG ACGAGATCAT ACTCGTGTCG GAAGGCTTGA TTTTGTATGC TGGACCAATC GACGAGGTAG AGGATTACTT CGCTGAGCTT GGCTATCGAT CTCCACAGTT CATGGATGTC GCTGACTTTC TACAAACGGT TTCTACCGAG GACGGTAAGA AACTGTATCA CCCTGTCGAC GATAGCAAAC GGACCGAACC GCCTACTGTC GCAGATCTTG CCAATTGTTT TAAAACCAGT CAGCAAGGGA AAAAAATTCG CGATCGACTG GACGAACCCC CTCAGTATGT TTGGAAACAA GACGATCGAA TCTCACAGCA TGGAAGCATT GTCTCGCAGC TTACCTTGTT GAAGCAAGTG AAGAAAAAGT ACGCCAACTC CTTCTTTCGA AATACGTGGT TGAATCTGAA ACGATTCTTG TTGTTGTGGA CGAGGGATAA AAGAGTGATT TTCGCCAGTG CAGTCAAGAA CATATTGATG GGTGTCAGTG TTGGCGGAGT ATTCCGCGAC GTCGATGACG AAGTCTCTAT TTTAGGGGCT CTTTTTCAAT CAGGTCTTTT TATCATGCTC GGGGCAATGC AAAGTGCATC TGGGCTAGTA AACGACCGCG TTATCTTTTA TAAGCAAATG GACGCCAATT TCTTCTCGTC GTGGCCCTAC ACGCTGGGAA GAACTTTGGC AGGATTCCCA CAGGTACGTG TCTGACCGAT GGCATGCTTC TGTTTGCATC GCACGTTCTT ATACGGCTCT CTCTTGTAGA CCATCATGGA TGTCTTCACG TTTGGGACAA TTCTTTACTT TATGGTTGGA CTTAGCGATC GAGCTGTGAC CGAATATTTC TTGTTTATTG CAATTTTAAT GACTTTTGCA ATGATGATGA ATATGCAGCT AGCAGTGTTT GCATCGTTCG CTCCAGACTC TCAGCTGCAA GTCTACAGTG CTTGTACACT ACTGCTGCTA ATTCTGTTCG GTGGTTACAT TGTGGCGCCT GATGCCATCC CCTCGTTTTA TCTTTGGATA TATTGGTGGA ATCCTTTTGC TTGGGCTTAT CGTGCTTTGG TGATCAATGA GTTTCGTAGT TCACGATGGG ATGATCCAGA CGCGACGCTT GCAGGGATTG GTTTCGTGTA CGGTATAGAT TCTAGGCCAT TTGAACAGGA CTGGCTGGGG TATTGCTTCC TTTATATGAC CATTTACTTT TTCGGTTGCG TAGTTCTGAC GGCTGTGAGT CTTGGCTACG TGAGACAGAT CCCTGAGCCG ACACCGCCAG ACGTGAACAT AACAAGGCTT GTCTCGGATC CTGTATCAGA GCGTCGGAGG GTCAATGTAC CCTTTAAACC TGTGACATTG TCTTTTGCAG ACGTTTGCTA CGAAGTCAAA GCTTCAACAA AAAATGAAAC TCTAAAACTT TTGAATGGTG TCAATGGAAT TTTCCGATCA GGACGCATGG TACGAGTTTT TGATAGCTCT CTGTTCCAGT TGTTGTGGTA GTGCTGTTCT CATGCATCCT CTCCGTTGTC TTGCTGTTAG TGCGCATTGA TGGGATCGAG TGGAGCAGGC AAAACGACAT TGCTGGTAAG TACAGTGCAT CTATTGTTCC AAAGCATTAA CATTCACATG CTCACAATTT TCCGGTCATA GGATGTGATT GCTCTAAGAA AAAGGACTGG ATCAGTGACG GGTGACGTTC GGTTGAATGG ATGGTCACAG GACAAAATCT CGTTTTGTCG TTGCTCTGGA TACGTTGAGC AATTTGACGT CCAGTCACCG GAGCTGACGG TTCGGGAGAC AATTCTGTTC TCTGCTCGGC TCCGCCTCGA TCGCGATGTC GTCACAAGCG AAGAGGACCG GGAGGCTTTC GTCGACCAAG TCATTGACGA TATGGAACTT CTTCCTTTGG CTGATTCGTT AGTTGGTAGT GACGAGGGAA TCGGTCTAAG TTTTGAGCAA AAGAAGAGGT TATCAATTGC GGTTGAACTC GCGGCTTCAC CGTCTGTCGT CTTCCTGGAT GAGGTAAGGC TTTCGCTGTC TGTAATCAAA ACGATGTATT GTGCATTTTT GCTCTAACAT ATTGATTGCT CTAACTATTG ATTACTATGG CTGTTCGCAG CCTACGAGTG GTTTAGACGC CCGAAGCGCT CTACTTGTGG TAAGGGCGCT ACGCAATATT TCAGACAAAG GGCAAACCAT CGTCGCAACT ATTCATCAAC CATCGTCAGC GATTTTTGAG ATGTTTGTAA GTAACTTCAG CGCCAGGAAA ATATGCAGGT TCGATAAGAC CTCACATTCT TCTTTGGTGG CAGGACGAAT TGTTGTTGTT GAAACGAGGT GGGCAGGTTG TTTTTCAAGG AGACTTGGGA AAAGATTGCT CGCGTTTAGT GAACTATTTT GAAAATTTGG GGGCAACAAA GATCGAACTC GGGGAAAATC CTGCGAACTG GATGCTTCGG GTAATTACAT CGGAAGACAT GGGTGATCTT GCGCAAAAGT ACGTCGAGTC AAAGGAGTAC GCACTCCTGC GTAAAGATCT GGATGAAATC AAGGCTGTCC AAGATCCCGA GTTAAAAATT GAGTACAAAG ATGAATTTGC TGCCAGCAAG GCTGTACGAC AGCTACTTGT CAACGGACGC CTACGCTTGA TCTATTGGCG GTCACCAGCA TACAATCTAT CTCGCTTGAT GGTATCTATG GTGATTGCCT TTGTTCTAGG ATCGGTCTTT ATTCTTGTCC GGCATCCAGA AATCTACACC GAAGTGGAGA TGCGCTCCCG CCTGTCCGTA ATCTTTCTAA CGTTCATTAT CACCGGTATC ATGGCCATTC TTTCGGTAAT CCCCGTCATG ACCAAGATTC GGGAGATGTT TTATCGCCAC CAAGATTCAG GAATGTACGA TAGTGCCGCC ATTGGTTGGG CCCTCGGTTC GGCTGAGAAG CTTTTCATTG TTCTGGCCAC CACCATCTTT ACGGTTGTCT TTTTGAGCGT AGCGGGTATG ACCAAGTCGT TGCGTGGATT GTTTGGGTTT TGGGTACGTC TGCCCAAAGT GACTTGCTAC CAGGGACTTG AAAACGAACT CGTGACACCT CATATCGTTT CTTGCTTTTT GCTTGTAGGG ATTCTTCACG TTCAACTTTG CGATATACTC CTACTTTGGA CAGGCTTTCG TTTGTTTGGT TGAGAATCCT GCAACGGCAT TAATTTTGTC GAGTGTCTTT ATCGGCCTCA ATAACTTCTT TGCCGGTTTA ATTGTGCGTC CGCAACTGTT GGTTGGTTCG TTTTTTGCCT TTCCATTTTA CATCACGCCC GGTCAATACG TCTACGAAGG TATGGTGACC AGTTTGTACA AGGGCAGTCC CAAAATTGTA ACGGCCGATG TGGGCGGAGG CTTTTTCGAA TACTTGGTGG ACACGGGCGT GTGTGTTCCG CAACAGCCAG AGCCGTGTCA GGGGACCGTG TCCGACTTTA TCGACGTCTT TTTCGGCGGC GTCTTTACGG ACGATCATAT TTCTCGCAAC GCACTGATTC TGGGCGGTAT ATTGATCTTG ACACGAGTCT TGACCTTTGC CGGTCTCAAG TACATTCGTT ATAATTAGCT TACATAACCG AAAATACGTG AACACAGTG
|
Protein sequence | MQRPIIQDVS LCLQPGKNYL VLGPPASGKS TLLKAIAGQL KSSSTEKLEG QILYNGRELE QWYIENAFAY IDQLDKHAPR LTVDETFEFS FQCKTGGTFQ QAQDPRVLQD PKVMTAIQEA DRSRLGVNMV LASLGLTEVR DTFVGNTAVR GVSGGQRRRV TVGEMITSRQ PVLCGDEIST GLDAASTFDM VQVLTHFGKL AQMTRVFALL QPSPETFSLF DEIILVSEGL ILYAGPIDEV EDYFAELGYR SPQFMDVADF LQTVSTEDGK KLYHPHGSIV SQLTLLKQVK KKYANSFFRN TWLNLKRFLL LWTRDKRVIF ASAVKNILMG VSVGGVFRDV DDEVSILGAL FQSGLFIMLG AMQSASGLVN DRVIFYKQMD ANFFSSWPYT LGRTLAGFPQ TIMDVFTFGT ILYFMVGLSD RAVTEYFLFI AILMTFAMMM NMQLAVFASF APDSQLQVYS ACTLLLLILF GGYIVAPDAI PSFYLWIYWW NPFAWAYRAL VINEFRSSRW DDPDATLAGI GFVYGIDSRP FEQDWLGYCF LYMTIYFFGC VVLTAVSLGY RRRVNVPFKP VTLSFADVCY EVKASTKNET LKLLNGVNGI FRSGRMCALM GSSGAGKTTL LDVIALRKRT GSVTGDVRLN GWSQDKISFC RCSGYVEQFD VQSPELTVRE TILFSARLRL DRDVVTSEED REAFVDQVID DMELLPLADS LVGSDEGIGL SFEQKKRLSI AVELAASPSV VFLDEPTSGL DARSALLVVR ALRNISDKGQ TIVATIHQPS SAIFEMFDEL LLLKRGGQVV FQGDLGKDCS RLVNYFENLG ATKIELGENP ANWMLRVITS EDMGDLAQKY VESKEYALLR KDLDEIKAVQ DPELKIEYKD EFAASKAVRQ LLVNGRLRLI YWRSPAYNLS RLMVSMVIAF VLGSVFILVR HPEIYTEVEM RSRLSVIFLT FIITGIMAIL SVIPVMTKIR EMFYRHQDSG MYDSAAIGWA LGSAEKLFIV LATTIFTVVF LSVAGMTKSL RGLFGFWGFF TFNFAIYSYF GQAFVCLVEN PATALILSSV FIGLNNFFAG LIVRPQLLVG SFFAFPFYIT PGQYVYEGMV TSLYKGSPKI VTADVGGGFF EYLVDTGVCV PQQPEPCQGT VSDFIDVFFG GVFTDDHISR NALILGGILI LTRVLTFAGL KYIRYN
|
| |