Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49176 |
Symbol | |
ID | 7195667 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 129996 |
End bp | 133982 |
Gene Length | 3987 bp |
Protein Length | 869 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183932 |
Protein GI | 219127417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.450709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCGG TCGTTGTTAC CACGGCGTGT GGTTGTTTCG CACCGCTCCT GGACTCTCCC CCCAAACAAT CCACCCAGTT CTTGTACGAG TACCCCAACA CGTCCTTCGA CAACAACTAC AACAACAGCA TCAACGTTAG CACTAGCAAT ACTAGCGACC GTGACGGGAC TACCGACATT GACCGCAACA GGGAATGCGC AGCCGACGAC CAACTTGGGG ACGACGCATC GCACGAATAC CCGTACGGGA ACGTTCCACA CTTCCGCAAC GACAATAGCA GTAGCAACGA CGAACACGAC GACGACAGTA GCAACGGTAC GACGACGAGT CCGCGAATGG GTCGTCGGCG TCGCAAGGGC ATGAACTTTT TCGCTTCCAA AAAGTCTCGC GCCGCTGCCG CGGCCGCCGC GGCGGTGGAA ACGGCTCCGG CCGACACGTC CAGCTTTCGA CCGGCCGTCA AGGCCGTCTT GGAACCATTC ACGTCCAAAA GTAGTGTCGG ATCTCCGCGT GCCGATTCTG TCGTCGCCAA GGAAAACCCG CTGCCTCGCA ACCACGACAC TAGTCAACCT CAACACACTG AGGATAGTAA GAAGGAGAAG GATGGCGGCA GTACCAGTCC AAGCAACGGC ACCAACCACA ACAATGGTAA CTCCGGTACG CACCCGAACG ACAGTAACAA CAACAACAAC AACCAGAACA AGAACAGCGA ACCGTCGTCT CCCGTGTCAT CCGACGAACA ACGCGATCAC CATGACGACA CTCATAGCAA CCACAACGAC CACAGCAACA GCAACGACGA CGACAGCAGT ATCATTTCTA TTCACAATCA CGACAGGATC GACAATTCAC CCAAACGGCG AGAAGAAACT ACCGAACGCG TCACCACCAC CACCACCGAA GACGACGACG ACAATGACAA CAACGAGCCG GGAGAGGACG ACAATAGTAT CCTTGGTATC CGCGACGACA GTTCCTCCAC GACGAACCCA GCCCACTCTC CCATTCAGCG TCACGTTGCC ACACCAGAAG AAAACACAAG CAGCACGAAC AGCTCCGTCA TCCTCGAACC CTCCTCAGCT CTGCTCGACA ATCCACACAC CGCACTCCAC ACAACTCCCC ATCGCATCGA CAGCAGCGAC TCGGAACCCC GTCTGTTCCG GACGTTCACG CCCGGTGGAT CCACCACGGA CATTCCCGCC GTCGAAACCG CCGTATCCGA TCTCGGCGAC GTCTCGGAAG TTCCCCACGG GGGTCTTCCA GCCCGACAAC CAAGCGGGCC CTTGCGGACG AAGCTCCAGT TGGACCTGCA TACCGATCTG CTGAAGCGAT TGCGAAACAC CAACCGCATG CCGGAATACA TTCTCGCTAC GGAATCGTCG CGACCCAGCG AATCGCCCCC GCGACCCAGC ACCCCCACCG AGGCCATGTC GACGACAACC TCCCACGTGT CGGAACCCTC GTTGACCCCG GAAACGCCCG TGCAAGCCTC CCCCCTCGCC GTGGAAACAG ATTCGAGTCC GGCAGAATCT TCCCGAACGG TACCGGAGAC GCCCGATTCG TCCCCACGGA CACCCGTATC ACCGACAGCA ACGACATCCT CGGACGACAC ACCGTGCAGC CCAACAACGA CGCACTCTTG CGTATCGGAA TCCCATGATC TCCCCGAATC GTCACCGGCG GTACCGAACA CCGCTGCCGG AGTCGCCACC GAAGCCAGCG GACCTTCCTC GAGTCCAGAA GCGTCTTTAA CAGTATCCCC GGAATCGTCA AGCGTGCCTA GACCGCCGTT CCCGACACAA ACAACCGCCA CCTTGTTGGC TGAAGAAGCA CCCCAGAGCC CGGCAGAATC TTTCCAAACG GTACCGGATA CGCCCGATTC GTCCCCACGG CCACCCGTCT CACCGGCAGC AACGACATCC TCGGACGACA CACCGTGCAG CCCAACAACG ACCACCACGA CCGGCTCGGA CACGGCTGAA GAACTATCTC CTGCTCCGGA AAGACTTCCC GTACCCAAAC AGCTACTGAC GGTGCCGGAC GCAGCCACCA TCTCCGCAGC AGCCGAAGAA ACATTCTCGA CTCCGGAATC GTCTCCAAAA GCGGTACCGG AAACGACCGA CTCATCTTCA GTGCCATTGT CGACAGAATC AGAGACAAAT ACATCAGTGG GAGAGCTGCT GTCGAGTCCG GCAGCAAAGT CCTCGACAGC CTTGTCGGAC GGAGTTAGCC AATCGGCTGC ACAAATAGTC CTGCAAGCAT CCACATTTAC AGAAGAAAGA CCTTCGAGAC CTGTGGCGGA GTCGGAGCCG ACCAGTGTAT TGGCAGCGCC TTTGACGATT CTGGAAACTA GCCCATCATC AGGTAATTCA TGCTCGAGTC CGTTAGAGTC GCCCGAAAAA GACTCAAAAA CGCCCAGCAT AGATGTACCG CCTTCGAAAC CCGCACCAGA AATTACCGTA TCCGTCGGGG TGACACCCTT GGTATCCCTA ATGGGGTGTG AGGCAACGCC CAACGTACTT GTTCGGCCGT TGGAAACATT ACGAGAAAGC ACTCCGTTAG TAGTGAAATT GCCTTTGAGT CCAGCCGAGT CGTCAACGTC GGACGCGGAA ACACCGATGG CAAAATCGTC ATCGCCTCTT CCGAAAACAG CCAATTCAGC CGGAGCAGCG GCTTCTTGCC CAGTGGAATC CCTAGTGGCG GCCTCCGAAA CTCTAAAAGG TGCTGCAATA CCCTTGAAAG TACCGGAAAC AGCCATAGTA GCAGGAGAAA TCCTCACTTT TCCGGCAGAG TCGTCGATAG GAGTACCGGA AACGCATGAT GCATCTGCGC CTGCGCCAGT AATGCCGGAA TCTCCAAAAT CATCCGAAGA CACGCCCTCT AGATCCCCAC CAATAATACC TGTATCGGAA ACGCCCCATG GAGGGGCAAT CAATGAGAGG CCGGTAGTGT GTCCAGTAGA ATCGGAAGCC ACCACCTTAA CTGGAGAAAC GTCCCTAAAA CCGACGGTTA TCCTCACGGC ATCAACAGAC GAGTTGTCTT CTCCGAAAGA ACCGGAACGA TCTTCATCAG ACAGAGAAAC ACCATCGAGT CCATTGCAGA GTGTGACATG CCAGGAAAGG AGCGGGTCCC AAGGAACTTC TCCCGCTTCA GACTCCGTAA AACCATCTCT CATGCAAACC CCGCCATCCG ATACTCATTC GTATCCACAA AATGGAGGAT TGACACCCTC TCAAACAATT GCAAGCGCTA TCAACACCCC AATCGTATCA GAAACCATAT CGCCCCCCAA GTCCGATACA GTGGCTGCTA CATTAGCATC GGGTTTGTCC ACGCTTTTAT CCACCAAGCA GTTCAAGCCT CCGTCAATAT CTCAGGTTAT TCGCAAGGAT TTATGGAGTT CGGAAACCGG TGTTGTGTTT GAAGCACTTC AATGGATTAC AATCGAAGCC TTCCACGACG AAGGAGCCCG GGATACAATT GCTCGGACCG GTGGCCTGCT GGCCATTGTG CGGGCTATGG AGACGCATTC TTCGCACGCA CCGATTCAAA AAGCTGCTTG CCAGGCTTTA GAAAAGCTTG CATTGGATAT CGAGAACGAA CGCGCCATCA GCGATGTTGG CGGAGTTGAA GCAATTCTAG CGGCCATGAT GGGTCATTTG AACAACGTAT CAGTTCAGGA GGCGGCCTGG TCGGCCCTGC AAAATTTAAC ATGTGGGAAC GCCCAGGGGG CCATGACAAT TGACACTACG GGTGGCATGG TCTCGTTGGT CTCTGCAATG CGAACGCATT CTACCGAGCC ACGAGTACAA GCGAGCGCGT GCGGGACCTT CGCCAATTTA TGTCTGGATC ACGAAGATCG TTTAACGGCG TTGGCGCAAG CGGGCGGCTT TAGCGCCATG GCGGATGCCC TACAACTCCA TTGGGAAAAT ATGGAAGTAC GAAAGGAAGC AAGTCGAGCG CTGGCGGATT TGTTGGAAGA TGTTTAA
|
Protein sequence | MLPVVVTTAC GCFAPLLDSP PKQSTQFLYE YPNTSFDNNY NNSINVSTSN TSDRDGTTDI DRNRECAADD QLGDDASHEY PYGNVPHFRN DNSSSNDEHD DDSSNGTTTS PRMGRRRRKG MNFFASKKSR AAAAAAAAVE TAPADTSSFR PAVKAVLEPF TSKSSVGSPR ADSVVAKENP LPRNHDTSQP QHTEDSKKEK DGGSTSPSNG TNHNNGNSGT HPNDSNNNNN NQNKNSEPSS PVSSDEQRDH HDDTHSNHND HSNSNDDDSS IISIHNHDRI DNSPKRREET TERVTTTTTE DDDDNDNNEP GEDDNSILGI RDDSSSTTNP AHSPIQRHVA TPEENTSSTN SSVILEPSSA LLDNPHTALH TTPHRIDSSD SEPRLFRTFT PGGSTTDIPA VETAVSDLGD VSEVPHGGLP ARQPSGPLRT KLQLDLHTDL LKRLRNTNRM PEYILATESS RPSESPPRPS TPTEAMSTTT SHVSEPSLTP ETPVQASPLA VETDSSPAES SRTVPETPDS SPRTPVSPTA TTSSDDTPCS PTTTHSCVSE SHDLPESSPA VPNTAAGVAT EASGPSSSPE ASLTVSPESS SVPRPPFPTQ TTATLLAEEA PQSPAESFQT VPDTPDSSPR PPVSPAATTS SDDTPCSPTT TTTTGSDTAE ELSPAPERLP DLWSSETGVV FEALQWITIE AFHDEGARDT IARTGGLLAI VRAMETHSSH APIQKAACQA LEKLALDIEN ERAISDVGGV EAILAAMMGH LNNVSVQEAA WSALQNLTCG NAQGAMTIDT TGGMVSLVSA MRTHSTEPRV QASACGTFAN LCLDHEDRLT ALAQAGGFSA MADALQLHWE NMEVRKEASR ALADLLEDV
|
| |