Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33231 |
Symbol | |
ID | 7204308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 372217 |
End bp | 374973 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186050 |
Protein GI | 219112933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGCA CAAAACGTCA AGCAACCCCT TCGGTTTCCA AAAATTGGGC CATGATCAAA GCGCAGCTTT TGAATAAAGC CCATTCCCCG GTGGGTTCTA TTTCACGACC CCAGTGGAAC CAACTTGTGG AGGCGCTTCA CAACCTAAAA GCCCCACTGG GCAAAAATCT TTATAGGGCT CTCGTGGAAG CCTTTGCTCT CCTGGATCGA ATGCATGAGG AGGTAACGGA AAAAAAATCG GCAATGCCTC GTCTGCCTTT ACCTCTTCCC ACGGTTCTCC TTCATCTGAC GATCGATTCG TGGCGCAGGC AGGCGGCTGA ATTTGCTACC AAAGGTAAAG TCTTTCCAAT CTCTGCCAAG TCTCTCCTAC AGAAAGTCGA GCACTTTACG GAAACAGGAC TATTTGACCC TAGCGCCAAA ACTTATGGAA TGCTTGTCGA GGGCATGGCA TCGACCGAAG ACAAGCATAA TGCGCCTATC TTGGCCGAAG AGCTTCTGGA GCGCATGACC AAGCAGTACG GCCAGGATCC TGCCGGGCGA AAACTTGAAT GTGCACCCAA TACTGTAATA GTCAACAGCA TCATTAATTT GTGGATAAAA AGTGGTCGGA GTGATTCCAT GGATGCCATT GAGCGGAAAT TTCAAAAGTT GAAAGACTGG TACAGGTTTT CTGGACGGGA TGATCAAAAA CCAAACGCAT ATACTTATTC GTCTCTAATC AGTGCGTTGG GTCGTTGCGG ACACCACCAA GCTGTCGAAC GAGCTGAAAC TTTATTGAGA GAATACGAAG CCAGCACAGG CAAGCAGCCT TGCACAGTCC TATACACTTC CTTCTGCCAG ACTCTGGCAA CAACAAACGC CCCCGGCGCT GCTGATAGAG CACAAGCGGT GCTGAACGAA ATGCTCTTTC GCTCACGCCA CGGGGAAGAT ATGGCGAGGC CAAATTCGTA TACCATCTTG GCCGTTGTTT CCACTTACTT GAAGGAAGGA AGAATTGATG AGGCTGAGGT GCTAGTCCGT AATATGGAAG ATCTATCCCG TCAAAGGGCC GACGATGGTT TGCGGCCCGG TATTTTTTGC TACAACTCCC TCATTCACGC TTGTGCCAAG AACGGAAATG CAGAGCATGC AGAATCGATA CTCAATCACT TGTTAGACTC GGCAGAATCC GGAAATAGTA TTCTGCAGCC CAACATTGTT ACATGGAACA ACGTTCTTCA CGCTTGGGCA AAAAGTAAGC ACCCCGCCCG GGCTCAGCGA GCATCAAATG TGTTTAAACG AATGCGACTG CTCGATCGGG CTGGTATATC AGGGGCGAGC GCCGATACCC GAACCCTTAA TATTCTTATC GACTGCTATG TGAACAGCCC AAACCCCAAA GCATTTGTTC ACCAATCCAT CGAGCTTTTC GAATCCGTTA AGAGCCAAGG TGTAGACTCG TCGAGTGGCA TTTTTGACCC TGTTTCATAT CGCGGAATGA TGGACCTACT ATGTAAGGCC GGCGAGTTTG ACAGAGCACT GCGACTTCAC AAGCGTTACA TAGAAAAGTC TACAGCTCCA GAAGCGCCGC TTCAACCAGA CAGGGCATAT TTTAATGTGC TAATGTCAGG ACTAGCACGG TCCGGATGGG AAAAAGCTGT TGAGTCTGTG GAAGCCATGT TAACCGAGAT GCATTCTCTT GCCAAATTAG GCTACAATAC CCATCCCGAT GTGGTCTCTT ACAACTCACT TCTAAATTGT TTTGCTACAT CGAGCCAAGT AAACGCACCG AGCAAAGCTT TCTCAGCACT TCGGAGAATG GAGCAATTGT GGGCAAACGG AGATTTATCG GCTGCACCGG ACGCATCGTC ATATTCAGCA GTTTGCGCTA CCTGGGCGAA TGCTGGTCTG GCTGAGGGTG CCGAGAAGGC TGAGGAAGTC TTGCGTCATA TGCTATTGCA CCGGGACCAG AAGATCCAAA GCACGAGCCA AGTGTTTAAC ACTGTAATGG TTGCATACGC TAGACAGGGT GATGCACCTA AAGTACAGAA GCTCTTCGAC GAGTGGCTCG AGACTGATAA CACTTATGAC TCTCGTATAT ATGTCACTTT GCTGCAAGCG TGGTCAAAAG CAGGCAATCC GGAATCTACA GCTAGTGTAC TCTATGAATT GATAAGACTG TTTGACAGCG CAGCCATACG TAATCCACCA ACGACTCAAA TGTTCAATGC TGTCCTCCAG GCTTGGCTAC GGTCAGGACG CAAGCACGCG GAAGTACAAA TAAAAGCGGG GGTCGATGAA ATGTCGTCCC TAGCAACCTC TGGGAGATTT CCGTGCGCCC CAGATGCTCT GACATACTCA ACTCTGTTTT CCGCTTGTGT GCGATCAGGC AGAGATGATC TTGGTGAACT AGCGCATAAT GGGCTTTTAG AATTGAAATC GCGTTTCGTC ACAACACGAA ATCCAATGTA CCGGCCTGAT TTGCGAATAT TTGCAGAGGC TATCATGTTG GTGGCTATGG ATGATAAATA CTCAACAAAA GATATTTTGT CGCAGCTACT ACTTGAGCTG AACGCAGTCA ACGGGACAAT CTGGAAAAAG CAAGGACAAA TCGCGATGAA TCGAATTCTG GCTGCTATCT CGCGGTCTAC CATCAATGAA AAGGAAGTCT TGGCTCAACT TGGAGTAGAG ATCATGAGAG CACAAAATGT TTCACCAGAC GAATCGACTC GAAAGTTTTT AGCTCGATGT TGCAATAAAG AGGGACAACA AGCTTGA
|
Protein sequence | MTRTKRQATP SVSKNWAMIK AQLLNKAHSP VGSISRPQWN QLVEALHNLK APLGKNLYRA LVEAFALLDR MHEEVTEKKS AMPRLPLPLP TVLLHLTIDS WRRQAAEFAT KGKVFPISAK SLLQKVEHFT ETGLFDPSAK TYGMLVEGMA STEDKHNAPI LAEELLERMT KQYGQDPAGR KLECAPNTVI VNSIINLWIK SGRSDSMDAI ERKFQKLKDW YRFSGRDDQK PNAYTYSSLI SALGRCGHHQ AVERAETLLR EYEASTGKQP CTVLYTSFCQ TLATTNAPGA ADRAQAVLNE MLFRSRHGED MARPNSYTIL AVVSTYLKEG RIDEAEVLVR NMEDLSRQRA DDGLRPGIFC YNSLIHACAK NGNAEHAESI LNHLLDSAES GNSILQPNIV TWNNVLHAWA KSKHPARAQR ASNVFKRMRL LDRAGISGAS ADTRTLNILI DCYVNSPNPK AFVHQSIELF ESVKSQGVDS SSGIFDPVSY RGMMDLLCKA GEFDRALRLH KRYIEKSTAP EAPLQPDRAY FNVLMSGLAR SGWEKAVESV EAMLTEMHSL AKLGYNTHPD VVSYNSLLNC FATSSQVNAP SKAFSALRRM EQLWANGDLS AAPDASSYSA VCATWANAGL AEGAEKAEEV LRHMLLHRDQ KIQSTSQVFN TVMVAYARQG DAPKVQKLFD EWLETDNTYD SRIYVTLLQA WSKAGNPEST ASVLYELIRL FDSAAIRNPP TTQMFNAVLQ AWLRSGRKHA EVQIKAGVDE MSSLATSGRF PCAPDALTYS TLFSACVRSG RDDLGELAHN GLLELKSRFV TTRNPMYRPD LRIFAEAIML VAMDDKYSTK DILSQLLLEL NAVNGTIWKK QGQIAMNRIL AAISRSTINE KEVLAQLGVE IMRAQNVSPD ESTRKFLARC CNKEGQQA
|
| |