Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49112 |
Symbol | |
ID | 7195335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 620918 |
End bp | 622614 |
Gene Length | 1697 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183642 |
Protein GI | 219126810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00796178 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGCGTTTG CTGAACCGAG AAAAAAGAAC ATCATAAGTC AACATGTCGA CTGCCACGCC GCATTGCCAC GCCTCGACGG TCGAACAGGA TGGCCTCGTC TACCAGACAG GTTTGGGGAA TCAGTTCGAA AGCGAGTGCA TTCCTGGCGC GCTGCCCCGC GGTCGCAACA ACCCGCGCAT GGTTCCGTTT GGATTGTACA CGGAACAGCT TTCGGGAACG GCCTTTACGG CACCGAGAGC GGAAAACCGC CGGGTTTGGT TGTACCGGAT TCAGCCCAGT GTCACTATTG GTGCCGCCGA ATCGCTCCCC CAGGAACCGC AATTTGCGGG TGGTTGTGAT CCGCGGGCCT GTGAAGCCGC GATAGATCCC TTGCGGTGGC ACCCCTATCC CGTCGATGGT GCGGCTGGTG TAGAGTACGA CTTTGTGTCG GGTTTGAAAC TGCAGTGTCA CGCGGGAGAT CCAGCCATGC GAGAAGGACT CGCGATATAC CTTTACAGCT TCGGCACCAA TATGAAAGAC CTACAGACGC ATTTCGTCGA TCACGATGGC GAGCTGTTGA TAATGCCGCA GCAGGGGTCG TTGGACGTTT TGACGGAGTT AGGACGCTTG ATCGTACATC CCACCGAACT CGTGGTGATT CCTCGCGGCG TTGTGTTTCA AGTCAATCAT TTCGAGGGCG AGAGCAAGGG CCCCATCCCA GGCACATCTC CGACGGCAAC AGCACGAGGC TACATGCTGG AAGTATACAA AGGCGGATTT GCCTTGCCGG AGCTGGGACC GATTGGATCG AACGGGCTCG CCAATGCTCG AGACTTTTTA CACCCAGTAG CCTGGTGCGT TGCGCAAAGC GACTACGAGA AACCGTGCTG TATTGTGGCC AAAATGCAGT CCCAACTCTA CGCCAAATCG TCCACACATT CGCCCTATAA CGTGGTGGCC TGGCACGGCA ACTATAGTCC GTACAAATAC AATTTGGAAC GCTTCTGTGC CGTCAATTCC GTCACGTACG ATCACCTGGA TCCCAGTATC TACACCGTTT TATCCTGCCA ATCCGAACAC GTGGGAACGG CCTTGGCCGA CGTTGTACTG TTCCCACCTC GCGTTTTGGC GACCGATGCC AACACACTGC GACCACCCTG GTTTCACCGC AACGTCATGT CGGAGTACAT GGGACTGCTG TACGGTTCGT ACGACGCCAA AGTGTCGTCG GGTACGGACG GTGCTGGTGG TTTTGTCCCG GGCGGTTCCA GTTTGCACAA CGCCATGGTC CCGCACGGCC CGGATGCGGC GACCTACGTC CGCGCCGTGG CGGATCCCTG TGACGCTCCC GTTTTGCTTA ATCGCGGTTT GGCGTTCATG TTTGAAACGT ATCTGCCACT CCGGGTCAAT CCACAAGCCT TGCGGGACGA AGCGTGGCGA GATGTTGACT ACACCGCGTG TTGGCAAGAT TTGACGGCGG CAGATTTTAC CGGATGGGAC TGGGTCAACG GTACGGTGGG GACGACACGG GAGGAAGACG ATCGGTAGCG AAGGGTCGAG ACGGCGTATC ACGCTGTCAA TGGAGGAGTA TGCGTGGGTT CTCATTTCCA CCGGCACGGA ACGAAAGACT ACCGTTTTGA ATGCGGCTTA CGAGTGCTCA CAGTCTGTGA ACGAACATTA ATTTCGCATA CATATAAAAA TCGACGTTAC CTTGTGC
|
Protein sequence | MSTATPHCHA STVEQDGLVY QTGLGNQFES ECIPGALPRG RNNPRMVPFG LYTEQLSGTA FTAPRAENRR VWLYRIQPSV TIGAAESLPQ EPQFAGGCDP RACEAAIDPL RWHPYPVDGA AGVEYDFVSG LKLQCHAGDP AMREGLAIYL YSFGTNMKDL QTHFVDHDGE LLIMPQQGSL DVLTELGRLI VHPTELVVIP RGVVFQVNHF EGESKGPIPG TSPTATARGY MLEVYKGGFA LPELGPIGSN GLANARDFLH PVAWCVAQSD YEKPCCIVAK MQSQLYAKSS THSPYNVVAW HGNYSPYKYN LERFCAVNSV TYDHLDPSIY TVLSCQSEHV GTALADVVLF PPRVLATDAN TLRPPWFHRN VMSEYMGLLY GSYDAKVSSG TDGAGGFVPG GSSLHNAMVP HGPDAATYVR AVADPCDAPV LLNRGLAFMF ETYLPLRVNP QALRDEAWRD VDYTACWQDL TAADFTGWDW VNGTVGTTRE EDDR
|
| |