Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37226 |
Symbol | |
ID | 7202179 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 446365 |
End bp | 449946 |
Gene Length | 3582 bp |
Protein Length | 1165 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181378 |
Protein GI | 219122073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.154451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCTC TTGAACATGT TCTTGTGAAC CTTTTGGGAG CGACGACACC GGATTCGTCG TACCGTCGGT TCTTTGAAGA GTACAGTATT ACTCAGGCCA GCGAGTTGGC CTCAATCACC GAAAATCGTC TTGCAACGGT GTCATATGGT GTTTTGACTC CTTCTGTGGG AGATACCCCT GCCATTATTG TTCGTATGTT TCTTCCGTCT GCTCAGCAGG ATCGGATCTT GAAGATTGTC AAATGGTTCC TCTCGAAAGG TACCGATGTG ACAAACGAAA CCTGGTTTGA ACTTACCCCT GAAGTCCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCTGCTAC CCCTGTTGGA TTGGACGCTC GGAGTTCCTT TGTTGAAAGT GCTGCCGCAA AGTTTCGGAA GACAATCAAG AATCACTCCG TTCCGTACCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG AATATTCGTA TCAAGCTTCG TATCCATGGC GTCCAGTTGG TTCTTGACCC GGATTATTTG TCCGAGACTG TCAACGAGAC GGATACATTT GTCGAAATGC AGAACTTTGT TTTTGGCGTG TTCAACAATA TATTATTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAA TTGGATGCTC AGGCTGTTTA CCGCGACCTT GTTGCCTCGT ATGGTAAAGG TATCAATGCG CAGATCACTG CTACATCCAT TGAAACGAAG CTCACTTTGT ATTCATTTGC AACTTCAAAG AGCAAGACCT ATGTTGCTTT TTTGACGACC TGGCGCAATT TGATTTACGA TCTTGAATGG ATTAACAAGT TCCCCTTGCC GGATCACCAG AAGAGCGTAC AACGCAAGTC AGCTGTCCGT TCCCATCCGC AATTGAAACT TTTCCTTGGA AATGTTCAGC TTTACTCTCG TACCCATGTG GGTAAGAGTG CCGACAATTC CGATTTTGAG TATGTTTATG ATTTGATGCT CGAACATGCG ACTGATATTG ATCTGACCGA TTTGGAAGAC CGCGGTAACA ACCGCGGTGG CCGCTCAGCA AACAATGCGA AGTCTCAGTC TTCTTCCAAG AAGAAAACTA ACAAAACAAT TGGTAAGAAG CACAAGAATT ATGTGCCTCC TGAAAAGTGG AATGCTCTCT CTCCCGAAGA GAAGTGGACC ATTATGGACC AACGAGGACC TTGCCCTGCT CCAGCTCCTG CCCCTGCCTT ATCAGTGAAC GCCGCTGCCA CTCAGCCTCC TCCTACGGTG TATGTCAGCG ACTCGACGGT TGTGGACAAT CAAAGCCTCG CTTCGACTCA CGTCCTGCCT GCTGCCGGAC CTGGTCAACT GCTTTGTTCG CTCATTTCGA ATTCAGGTTC CTGCCAGCAC CCTGCTCCAT CGAATGGAGC CACGTCTGAC TCTTTTTCGG TCAATGGGAC CACCTATTGC GGCAAAGTGA ACCGTGCTTC TGTGCAGTAC CGTCTTTCCA CTCACGATGT TTCGTTGAAT AAGGACTCTT TGATCAATGG TGGTGCCAAC GGTGGCCTTA GCGGCTCCGA CGTAACCGTT ATTTCGCAAT CCCTGTTGGA GGCAACTGTC TCTGGAATTG GAAATTCGGA ATTGACCAAC CTCCGTTTGT CAACGGTGGC TGGACTCATT CACACGACGG ATGGTCCCAT TATTGGTGTG TTGCACCAGT ATGCTCATCT TGGTGTTGGT AATACCATTC ATTTGTGCAA CCAAATGCGC TCCTGGGGAG TCACAGTTGA CGACGTCCCT CGTACTTTTG GTGGCAAACA GCGTATTGTC ACGTCCAATG GTCGTTTTGT CATCCCGCTT TCGGTTTCTG GCGGACTCAC TTACTTGTCT ATGCAGGCTC CTGCCGAGGA GGACCTGGAC ACTTTCGAAT GGGTGCCTTT TACCGCTGAC AACGAGTGGG ACCCAAATGG TGTCTCTTCT CCTGCCGCTG CCGACAATGA CCTCAGTTTG CAGCTTCCTG CCGGCCATGT CCCGTTCCGT GATGAACGCA TCAATAACTT TGGTCTCCTT GCGCATTCCG CGGCTGTCAG TTGATCCCCT TTGAATGCCG ATGCTTTGCA ACCCAATTTT GGATGGGTTC CCAGTGCTTG TATGTCTCGC ACGTTTGAGA ATACCACGCA ATTCGCTCGT GCCCATGCCC GTTTGCCCCT GCGCAAACAC TTCAAGTCGC GTTTCCCTGC TGCCAATGTT TCTTGTTTGA ACGAAATTGT GGCAACCAAT ACCTTCTTCT CGGATACCCT TGCGGCCGAT GACGGCATTT TTAACCATGG TGGGGCTACG ATGGCCCAAC TTTTCGTTGG AAAAAGTTTG CAAATCACCT CTGTCTTCCC GATGAAGCGT GAATCCCAGT TTGCCCATAC TTTCGAGGAC TTTATTTGTA CCCATGGTGC TCCCAATGCC CTCCTCAGCG ACACTGCTTG TGCTCAGATC GGTAAACAGG CACTTCAGAT TTTGCCTATG TATGCAATCG ACGATATGCA GTGCGAGCCG CATCATCAGC ACCAAAATTA CGCGGAGCGC CGCATTCAAG AGGTGAGAAA GATGGTGAAC ACAATCATGG ATCATACAAA CACTCCTCCG GAATATTGGT TGCTCTGCGT ATCTTATGTG ACCTACTTGC TCAATCGCCT TGCTGTTGAA AGCTTGAATT GGCGTACCCC GCTTCAGGTT GCCCATGGAC AGCGTCCTGA TATTTCTGCT TTGCTCCTTT TCCGTTGGTT TGAACCCGTT TATTATTACA ATCCTGACCA TGCGTCTTTC CCATCGGCTT CTTGCGAGAA AACTGGTCGT TGGATTGGTG TTGCTGAACA CAAAGGTGAT TCGCTGACTT ATTGGATTTT AACCGACAAT ACTCACCAAG CCATTGCTTG TTCTGTTGTT CGTTCAGCCA ATGTCAATAA TGTTTTGAAA AACCATCGTG CTGCGAATTC CTCTCCCAAT GGTGGGGAGC TTTCGAATCC TAAGCCCATT GTCTTGGCTA CGAGTGACCT ACGCCATGAC GCTACGGTCG ATCCATCTTT TGAGAAATCC CCTGCATTCT CTCCTGACGA ATTGATTGGC AGGTATTTGA TCCGTGAAGC CCCTGACGGC CAGAGCCATC GAGCCCTTGT TGCCCGTAAA ATTATTGATA CCGACTCTGA TAACCATCAG GCGATCCGCT TCTTGTTGCA AATTGATGAA AAGGATGCTG ACAAGATCAT TTCGTACAAT GAACTTTCCG ATTTGATGGA AGCCCAACAA TCAGAGCCCG CTACGAACGG AAATATCGAA GATCATTTCA CGTTTACTAG TATTATTGGA CACCAAGGCC CTTTGCAACC GACCGATGCT GGTTACAAGG GATCCTCTTG GAATGTTTTG GTTCAATGGG AAGATGGTTC CCAGTCGTAC GAACCTCAAA TTGAAATGGC TAAGGACAAT CCAGTCACAC TCGCGATGTA CGCGTCTGAC AACAATCTCC TTAACGTGCC CGGGTGGCGC CGCTTCAATC GTTTGCTTCG CAACCGTGAT GACTTTAATT GA
|
Protein sequence | MDPLEHVLVN LLGATTPDSS YRRFFEEYSI TQASELASIT ENRLATVSYG VLTPSVGDTP AIIVRMFLPS AQQDRILKIV KWFLSKGTDV TNETWFELTP EVLEYWQPAS AIVAPATPVG LDARSSFVES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL SETVNETDTF VEMQNFVFGV FNNILLTPRA RGILHKHVDE LDAQAVYRDL VASYGKGINA QITATSIETK LTLYSFATSK SKTYVAFLTT WRNLIYDLEW INKFPLPDHQ KSVQRKSAVR SHPQLKLFLG NVQLYSRTHV GKSADNSDFE YVYDLMLEHA TDIDLTDLED RGNNRGGRSA NNAKSQSSSK KKTNKTIGKK HKNYVPPEKW NALSPEEKWT IMDQRGPCPA PAPAPALSVN AAATQPPPTV YVSDSTVVDN QSLASTHVLP AAGPGQLLCS LISNSGSCQH PAPSNGATSD SFSVNGTTYC GKVNRASVQY RLSTHDVSLN KDSLINGGAN GGLSGSDVTV ISQSLLEATV SGIGNSELTN LRLSTVAGLI HTTDGPIIGV LHQYAHLGVG NTIHLCNQMR SWGVTVDDVP RTFGGKQRIV TSNGRFVIPL SVSGGLTYLS MQAPAEEDLD TFEWVPFTAD NEWDPNGVSS PAAADNDLSL QLPAGHVPFR DERINNFGLL AHSAANTTQF ARAHARLPLR KHFKSRFPAA NVSCLNEIVA TNTFFSDTLA ADDGIFNHGG ATMAQLFVGK SLQITSVFPM KRESQFAHTF EDFICTHGAP NALLSDTACA QIGKQALQIL PMYAIDDMQC EPHHQHQNYA ERRIQEVRKM VNTIMDHTNT PPEYWLLCVS YVTYLLNRLA VESLNWRTPL QVAHGQRPDI SALLLFRWFE PVYYYNPDHA SFPSASCEKT GRWIGVAEHK GDSLTYWILT DNTHQAIACS VVRSANVNNV LKNHRAANSS PNGGELSNPK PIVLATSDLR HDATVDPSFE KSPAFSPDEL IGRYLIREAP DGQSHRALVA RKIIDTDSDN HQAIRFLLQI DEKDADKIIS YNELSDLMEA QQSEPATNGN IEDHFTFTSI IGHQGPLQPT DAGYKGSSWN VLVQWEDGSQ SYEPQIEMAK DNPVTLAMYA SDNNLLNVPG WRRFNRLLRN RDDFN
|
| |