Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40843 |
Symbol | |
ID | 7198701 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 79994 |
End bp | 82964 |
Gene Length | 2971 bp |
Protein Length | 903 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184824 |
Protein GI | 219129288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACA CCGATTGCTA CCGCGACAAC CAAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ACGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAACC CGAGTACCTC GCAATGACCG ACGGCGTTCC ATGCATTCCT CCTGTGCAGC CGCCTTTCGA CCCAGTTCAT GCTGCCAACG CCACCGCTCC TCAAATTGTC GAAGCTAACC GTCAGAACGA CAAACGTCAA AAGCTTTTTG ACCTTTACCA CAACGCCATT AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATTGA ATCTCTCGGT CACCCTACCC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTCTCTCA TCTCTGGGAA AATTTTGGTA AAATTCAGGC TTCGGATCTC ATTGCTAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC CAACACCTAT CCAGCAACTT TTCCAGCAGC TTGAAAAAGG CAATCAGTTT ATCATCGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCGTTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA AAATTGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT GAAAAACAGG CCCTGGGGCA CAAAACCGGC GCTACATTGC ACGACAAACA AGGCGGGTCG ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC ATCAACGTGG TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCATATCAC CGAACTCAAC ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC ACAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCCTACACC CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACTGA ATTGCATGCG CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT TTGTTCATGC ATCCTTATTC TCACCACAAC TTTCAACATG GTGCAAGGCC ATTGACGAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GCTTTGTTAC CACTTCAAGC TCCGGTAATG CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCGAA GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG CTATCCTCCA AAGGTTTGCG CCCCCAACTC CAACACTTAG ACAACGAAGC CTCCACTGCG TTACAACAAT TCATGTCCTC TGTTGACATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG TGCAGCACCG ACAAGAACTT TCCGCTTCAC CTTTGGGATC ACTTACTCCC ACAAGCCATC ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCATGGCT CGTTCGACTA CAATTGTACC CCTTTGGCTC CCCCGGGCAT CCGCGTACTT GTACACGAAA AACCGACAAT TCGCAGAACC TGGGCCCCCC ACGCAGCCGA CGGCTGGTAC GTTGGTCCCG CCATGAACCA TTACCGATGT TATCGCGTCT GGATCAGGGA GACCACCAGC GAACGCATTT CTGACACCCT GACATGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGACA CAATTGTCGC CGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCATCT CCCACGTCCC CCTTGTCGCC TCTTTCGGTC CACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCATCCC TCCCAGTTGC TCCCACGGCA ACCCTAAGTC CGCCAACTGC ATCGACTTCT TCACCTCGTC AAGTCCGCTT CCGAGACCCG GTCACTGAAT CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCCTCCGCA GTCACTTCCG AGGGTGCCTC CCCCAAACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC GTAGGGCCGC TCGAAAACTG A
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTKPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISEPEYL AMTDGVPCIP PVQPPFDPVH AANATAPQIV EANRQNDKRQ KLFDLYHNAI KAFRNQLLEA IPIEYIESLG HPTRGFNKVS PLEILSHLWE NFGKIQASDL IANDERMKAA WHPPTPIQQL FQQLEKGNQF IIASGQVMDE RIIARIGYQI IEKTGLFDLA SRDWRYKDEA DKTLANFKKH FQKANKDLAL TATSSSAGYH TANQSTVTKG KLYCWTHGIV HNTKHTSATC EKQALGHKTG ATLHDKQGGS TKTYQYTPPS SVAPNTPPLA SSPPFFPPDA IADTGCTGHF LSTNIAHIHC QPTVPGINVV LPDGRTITSS HITELNIPSL PPAARTAHIF PGLSNGSLIS IGQLCDHGCT ATFTSDTVRI ELNNTVVLRG GRSPYTRLWT LDSPVTPNPP ATELHAPVHD KNFANHLGDH SGTLADRIAF VHASLFSPQL STWCKAIDEG RLTTFPDITS AQVKRHPPQS VPMVKGHLDQ QRSNLRSTKP KVTLSASVDP DDINFDTNPV VQDPPAARTQ FLYADFAEVT GKIFTDPTGR FVTTSSSGNA YMLVVYDYDS NFIHVEAMKN RTGPEILSAY KRAHAMLSSK GLRPQLQHLD NEASTALQQF MSSVDIDFQL APPHVHRRNA AERAIRTFKN HFIAGLCSTD KNFPLHLWDH LLPQAIMTLN LLRGSRINPN LSSWAQLHGS FDYNCTPLAP PGIRVLVHEK PTIRRTWAPH AADGWYVGPA MNHYRCYRVW IRETTSERIS DTLTWFPSQV KMPSTSSRDT IVAAAHDLAH ALAHPSPTSP LSPLSVHERE ALSQLSDIFS KAANPVDSSL PVAPTATLRP LEN
|
| |