Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46953 |
Symbol | |
ID | 7204769 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 912593 |
End bp | 915105 |
Gene Length | 2513 bp |
Protein Length | 635 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185979 |
Protein GI | 219121513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00635036 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTGTCAAC AATGTTGTCT CGCGTTCGTC ACGGTAGCTG TCAGCATCTG TCATCGACGA TCGTTCACTT CCGAATCCGT TTTCGGGTTT GAAGCTATCA GCATTTCACC AAAGATCCTC CATTTGTCCC CTACGAAACA CTCTTGGTGG TTACGTTTCT ATCCTTCCAA CAGAGAAGCG CTTGCTACGC AATCTGTAAT AGCTCCGTTT CCAGGCAGAA CACTTTCGAC TCTGAGTTGT CGGGATCTAT CGACGGACGG AAGGAACATC CATACACGTA TCGCATTTAC GTATCGCATA CACGCACCCG TCGTCGACAC GAAAAGGATG AACGCGTTTA CTACGGGACC GGAGTCAACT GCTGCCGACC GCCATAACAG ACGCGCCACG GGAATTTCTT CCGAGGCAGA AGAACGCCGG ATGGAAACCT TGGAAATCAT GAACACACCA GTCGAAAACG ATGCGGAAGA GGAGAATGAT GATGACGACG ACGAAGAAGA AGAGGCGCTG GCGCCTGTTG CTGGATGGAG ATTCTGGTGG ATCTTTGCCA GGTTTCCGCT GGGTCTGCTC CTCGTTAACA CGATCCTTCT CGCTTTAATC ACCCTATACA ATCCACAATA CTCGCCTTCG TATATTGTGG GAAACTCGAC GGACGTTGAA ATGAACGACG ACTTCGATTC CGACGAGCGA AGGGATTTCG GGCTTGCGCA GTTCAGTGCG CTCGAACTGC AATCATGGAT TGTCAAATGC GGGATGGTAG CCCTCTTAGC TTGCCTGGAT GCTATTGTCT TTTACTGGTT CACAGTACGT CTCAAGAAGG GCATGGAATT GCTTGCGCAA AAGGCTCAGC AAGAAGAGCC TCAAGGACGG CCAGTCGCAA ATGAGAATCT GAGCAGGACA AGAACGGCAC CGTTGCACAA GTTGGACTTG AACTACAGCG AACTGGACCA AGTACACGCT CGGGTGACCC AAAAGATAGA CTCGTATCTA TTTAGGGACC CAGCGCGGCA ACCGACCGTC CCAATCTTGG TCAGCCTGTT GTACCTACTG ACGGCACTCA CAGTGACTGC TTGTCTGACT ACGCTATCAC TTTTGATATT TTTGTACAGC TCCGAAGGAT CTTCCATGTG CTTGGAGAGC ATCACATCAC AGTCAAGTGA ATCCGTGGAA TTTGATATTG ACTCGATTGA AAATATCCCG CAAGAGTTAC AGGAATGGGC AAGTCCGAGA TATTCATATT CTGATGGATC ATCATACATT CATATGAGTG ACGGAACTAC TTACTTTCGC GGTAGAATGG CCGAAAAGGA ACATCATGGT TGGGATTCCT ATATGGATAC GGAAACACTA GTTGCGACCA ACGTAAATGG AAATCTTACT GTGTACAGTC AACTCCACGA ACCGCATAGC TTCGTGAGTA TTAACGAAGG TTCTGGTGAA ACAATGGAAG GATTCTGTTT CCTCTATACG GAGTTTGCTG GACACGACGA CGAGGAAATA TATGAATACA CAACGCAAGC AGTTGCGTGT GTATCTTCCA ATGAAAACAT AAGTCAGGGA TTTCGAAACA CAACTCTTCT AAACAGTGAC GGAGGGGGAT GGTTAGAATC CGTTGGCAAA GCTCACGATG GACGATACTG GATCAGGCTG CAAGAGGATA GATTTGTTGA TGAGTGGTCG TCGTATCAAG AGGTTTTGCA TATCATACAG CTGGACCCGC AGTCGATGAT GTATACTGTA GTTGTAAACT CAACTTCCTT CCCCGATTTT CAACAACCCT TGAGGAACGA AGGAAGTAGA TGTTTCCGCT GGACTAGCGG CATTGGGTAC GTTGCTGCCG CAATATCTCT GTTTCTTTCG GCGCTGGTGC TCCTACTATT CATCAAGACA AAGTCGGGAG CAGGTTGCTT GGCGTTGTCT ATCTTTGCAG TCCGACTTTG GCTGGAAGAA ACTTGGCTAG ACGAAACTTG GCTAGACATG CTGAGTCCAG TATTGCTTGT TTTTACATTC ATATGCTTGT GCACGGCATC ACTCAGCCTT GCCGTCCGAG AGATGGTGTT ATGGGGAATA TATAGCGTCA TTGTGGTGCA ATTGGTTTTA GCTTTTGTGA ACCGAGAATT TCCGGTAATG GGGACTATTG GACTAGGTAT GGGCCTCGCG CTGGACCATC CTGTACTTCA GCTGGGTGGG TGGATCGGAG CGCCTTTATC GGTTTGTATT CTCTTGTACT ATGCGATAGT TGGCTATTTC GGCAACACTT ATGGTGGCTA CTTCTACTAT GATCGACAGT ATACTACTAC TCTACTGATT GTTGCATGCA TCCCTGTAAG TATAATTATC GGTTGCGGGA TGGTGACGGC AGGCCTTTAC TTTTCAAGGT CTCGTGCAGT CTTGTTGTTC TATCTGAGGC GCTTCTGGCG ATCTCTTCGT GTGAAACTGC GGAGGCGAAG TCGGCCTCAA AGCAGCAATA GCGAAATGGT CTAGTTCACA TAAAATTCTT GTTTGTATTT ATG
|
Protein sequence | MNAFTTGPES TAADRHNRRA TGISSEAEER RMETLEIMNT PVENDAEEEN DDDDDEEEEA LAPVAGWRFW WIFARFPLGL LLVNTILLAL ITLYNPQYSP SYIVGNSTDV EMNDDFDSDE RRDFGLAQFS ALELQSWIVK CGMVALLACL DAIVFYWFTV RLKKGMELLA QKAQQEEPQG RPVANENLSR TRTAPLHKLD LNYSELDQGP SAATDRPNLG QPVVPTDGTH SDCLSDYAIT FDIFVQLRRI FHVLGEHHIT VKMAEKEHHG WDSYMDTETL VATNVNGNLT VYSQLHEPHS FVSINEGSGE TMEGFCFLYT EFAGHDDEEI YEYTTQAVAC VSSNENISQG FRNTTLLNSD GGGWLESVGK AHDGRYWIRL QEDRFVDEWS SYQEVLHIIQ LDPQSMMYTV VVNSTSFPDF QQPLRNEGSR CFRWTSGIGY VAAAISLFLS ALVLLLFIKT KSGAGCLALS IFAVRLWLEE TWLDETWLDM LSPVLLVFTF ICLCTASLSL AVREMVLWGI YSVIVVQLVL AFVNREFPVM GTIGLGMGLA LDHPVLQLGG WIGAPLSVCI LLYYAIVGYF GNTYGGYFYY DRQYTTTLLI VACIPVSCSL VVLSEALLAI SSCETAEAKS ASKQQ
|
| |