Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35518 |
Symbol | |
ID | 7200771 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 94902 |
End bp | 98267 |
Gene Length | 3366 bp |
Protein Length | 982 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179975 |
Protein GI | 219118402 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACGG GAGTCACGCG CGGTGACCAA GCCACCGGGA TTTCTCTGCC AGCCGTACTG CGGTACCGTA CCGTACAATA CGTTATGCGT TGTTCGTTGT TCCCTAGTGG CCATCCCACA ATTCCATCCG GACTCTCCCT TGGGTTGGTC GTTTGACAAA CCTACCAAAA CACCACTCTA GCCCGACCTT GTTACCCAAC ATCTACTCAC GGTCCGAGTC TATCCGAGTG ATTCGTCGTC GTGGATATCG GTGAGAAAAC GAGACGCCCA GAGACTTTTT GAAGAGGCAA CGATTTCGCT TCCTACCCGA CACCATGATA GTACTGCGTG ATTCACACCC CGACGGGTCG AAGGATCCGT ATTGCAACGA CTACACTCGC AACAGGCACG GGTATGGATG CTGCTAGTGA GCTTGTTGTG TGCGAGTCTC GTCGTCCCCG CACAAAGTCA GAGTGCCCGC ATTACGGGCT TTTCCTTGCT GAACGCGGAC ACCCATCAAG TGATTCAACC ATTACGCAAC GGAGACGACA TTGATTTGTT CCGCGCCGGA ACGACGTTGC TCTCGATTCG TGCGGAAGTG TCCGGATCAC TCGAGGGAGG TTCCGTCCAG ATGATCCTGA ATGGACGAGT GCGCAACGTG GACGTCAGTC CTCCGTACAG TCTCGGAGGC GACAACAACA ACAACAACAA CACAACGCAC AATACGAGAG TATCCCGGAA TTGGCCCAGT GGGGTGGACA TTCCGTACAG GCTCGTATTA TGGAATTCCC GGACGGCACC GGAAACGTGC AAGACTCCCG TTCGATTGGC TTTGCCATCC GCAATTCGGA TCCCAACGCT CCCACGGCGG CACCCGTCAC ACCCGAACCT ACGACCCCTT GGACCGGCGA AAGTATCTCC CCCAGCACTG GTGACGGAGA TGACTTTGCC ACGGCGGCGC CTACGGCTTC CAACGTGCAG ACTCCTCCCA CCACACCCAC GTCGACGGTG GCCTCGCCTT TTCCTACGGC TCCACCCGTC CCCGTACGGC CACTCGAACC AACCGACGTA CACGCCTATC CCGCCAGTGT TCGGGGAACA CTCTCCGGTA CCTTGGAACC GTGGAGTAAG CTTACCTTGT GTTTCCTAGC CACTACTACT GATGACACCA GCGCGACCGC AACGACAACC TCCGCATTCA CCCACGAACG CAACGAAACC GTCAATCCCT TTACGGACAT TCGCCTCGAC GTCACCTTTA CCGCGCTCGA AGAACCCGTG GAACTCGTCG TTCCGGGATA CTACGCGGCC GATGGCCACG CCGCCCATAC ACACGCTACC GCCGGGGCCG TCTGGTGCGT ACACGCCACC CTGCCCTCGG AAGGCTCCTG GATGTGGCGC GCCAACTTTT GGCACGGTGC CAACGTCGCA CTCTTTGACG TCAACCACGG AGGCGTCGTC AAAACACCGC TCTTTCCCGT ACACGGGTCC ACCGGACAAT TCATCCTCAC GCCAACCACC GCCAACGGCG ACGACGAAGA CGACCTGGCC ACGGGCCGGG CTGTTACCAA CGCCACGACA CCGCGGCGGA CGCGGGGACG ACTCCAGTAC GTGGGAGAAC ACGCGTACAA GTACCCCAGT GGCAACGATT GGTGGTTGAG TTTCGGTGCC GCGAGTCCGT CCAACGGTCT CGCGTACGAT CGCTTCGACG GAACCACCAA TGCCGGGGAA CGCCGCAAAT CCTGGACACC CCACGCTGAC GATTACGTAT CCGGCAATCC GACCTGGGCC GGTGGACAAG GCCGAGAACT CGTGGGTGGT ACGTGCGTGT GTGTGTGTAC ATGTCGAACC CCCCAAAGTG ATGGTTGTTG TAGTGGTAGT GTTTGGTGTG AAGGTTGTGT TTGTGATTGT ATAAATGTGT ATACTGACGC TGTCGATTGT TCTCGTTTTC GTTCGTCTTC CATCTTTGTT ACAGCACTCA ACTACTTGGC GAGTCAGAGT TTGAATCTCG TGACATTTTC GACCTTGACC TTGGGTGGAC CGGACGGTAA CGTTTTCCCG TTTGTGTCAC CGCAACCATC GGATCGATTC CGTATGGACG TTTCCAAACT GGCGCAATGG GAAGTAGTGT TCCAGCATGC CGATGAACTC GGGTTGCTGC TGAATTTGCG ACTCGAGTCC GAGTCCGCAG CGGACGTCCT GGATGGGAAA GCTGGCGTCT TGGGACTTCG ACGACGCTTG TACTATCGTG AAATGATTGC TCGATTTGGT CATCATCTTT CCCTGATTTG GAATTTGGGC ACCGCAACAG CTACGGCTAG CTTCAGTACC GCAAACCAGC AATCGCTGAC CAACTACATT CGGAGTGTGG ACCCGTACGA ACATCCTGTG GTTTTACAGA CGCCATCGAA CCAGCAAGCC GAAGTTTACG AAGCCTTGTT GTCGAGTTCG AATGTAGCCG TGGAAGGAAC CTCGCTAGCT TCCGATCTAT ACGATACGTT CAACGATACG CTGATCTGGA GATCGTTGTC CGCCGAACAA GGTCACAAAT GGGTCGTCAC CAGTGAATAT CAAGGTTCGC AAGGCGCAAC CGCGGATAGG GATGATCCCA CGCACGATGA ATTCCGCGTT GAAGTCCTGT GGGGTAATCT TTTAGCGGGT GGTACCGGAG TTGCGTACCA TTTTGGTGAC GAAAGGGGCG ACAGCAGCGG GTGTTCCGAC TTGGCCTGTC AAGACTGGCG CAGTCGGGAG GCTTTATGGG GTCAATCACG CTATGCTCTG GAATTCTTTC GTGAAAACAG TATCCCGTTT TGGAACATGG GCAATTCGAA CGAGCGCTGC ACGGACGGCA ATCGATGCTT TTCTAACGAC GAATTTGTTG TGGTGCAGGT CCTACGAACC GACACACCCA GTCTCGTCGA CTTGACGACG CCGTCTCCCG TCGTCGCAAC GTACAGCTTA AAGTGGTTTG ACCCACTCCT CGGCGGACCC CTCCAGGATG GCAGTGTTGC CTCCGTGTTT TCCGGTCCTG CACAGGATCT TGGCACTCCA CCAACTTCAA CTGGCCAGGA GTGGATTGCT TTGCTCACAC GCAACCGGTT GCCACCCACG ACAGCCCCAA CAATTTCGTT GGCCCCAACA CAAAGTCCTC TTCTGGTTGT GGTGCCTCCG ACCCACGCAC CTCACGTACC AGGGACACCC ACTGGTACAC CCATAGAGAT GCCCTCGTTC AGAGAGTCTG ATTTCCTTTC CAGAACCATT GAACCGACCT CTGGACCCCC TAGTGAAGGC GTGTCGAGTG CCGTGGCTCC TACGGCGAAT ATCAGTGCCG TTATTCAATG GATTCTTTTA TTCTTGATCT TGGGGCTGGT ACGAGTGAAC CCATAA
|
Protein sequence | MSTGVTRGDQ ATGISLPAVL RYRTVQYVMR CSLFPSGHPT IPSGLSLGLA RVWMLLVSLL CASLVVPAQS QSARITGFSL LNADTHQVIQ PLRNGDDIDL FRAGTTLLSI RAEVSGSLEG GSVQMILNGR VRNSRRRQQQ QQQHNAQYES IPELAQWGGH SVQARIMEFP DGTGNVQDSR SIGFAIRNSD PNAPTAAPVT PEPTTPWTGE SISPSTGDGD DFATAAPTAS NVQTPPTTPT STVASPFPTA PPVPVRPLEP TDVHAYPASV RGTLSGTLEP WSKLTLCFLA TTTDDTSATA TTTSAFTHER NETVNPFTDI RLDVTFTALE EPVELVVPGY YAADGHAAHT HATAGAVWCV HATLPSEGSW MWRANFWHGA NVALFDVNHG GVVKTPLFPV HGSTGQFILT PTTANGDDED DLATGRAVTN ATTPRRTRGR LQYVGEHAYK YPSGNDWWLS FGAASPSNGL AYDRFDGTTN AGERRKSWTP HADDYVSGNP TWAGGQGREL VGALNYLASQ SLNLVTFSTL TLGGPDGNVF PFVSPQPSDR FRMDVSKLAQ WEVVFQHADE LGLLLNLRLE SESAADVLDG KAGVLGLRRR LYYREMIARF GHHLSLIWNL GTATATASFS TANQQSLTNY IRSVDPYEHP VVLQTPSNQQ AEVYEALLSS SNVAVEGTSL ASDLYDTFND TLIWRSLSAE QGHKWVVTSE YQGSQGATAD RDDPTHDEFR VEVLWGNLLA GGTGVAYHFG DERGDSSGCS DLACQDWRSR EALWGQSRYA LEFFRENSIP FWNMGNSNER CTDGNRCFSN DEFVVVQVLR TDTPSLVDLT TPSPVVATYS LKWFDPLLGG PLQDGSVASV FSGPAQDLGT PPTSTGQEWI ALLTRNRLPP TTAPTISLAP TQSPLLVVVP PTHAPHVPGT PTGTPIEMPS FRESDFLSRT IEPTSGPPSE GVSSAVAPTA NISAVIQWIL LFLILGLVRV NP
|
| |