Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31400 |
Symbol | |
ID | 7196987 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 388 |
End bp | 3359 |
Gene Length | 2972 bp |
Protein Length | 787 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176501 |
Protein GI | 219109493 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAAG AGGAGAAGAT TCTTGAGGTG GCCAAGAAGC TTATGCCTTC TAAGCCAACA ACCCTCAAGG ATATGAGCAA GTGGCGTTCC TTCTTTGAAA ACTGGAACTT GTACATGAGT CAGTGTCGCG GTGCTGCGGC TATCCCTCTT TCGTACGTTT ACCGTACCAA CGAGCAGCCC GAGACCGCTT TGGTCGGAAC CTATGTGAAT ATGGATGCCT ATTTGGTTGC CCAGACAGTA CTGTCTGGTT CCAACTTTGA GATCGATAAT CAATGGGTTT TTGACGAATT CAAGGAAGCA ATCACTACAA CCGGACCTGG TTGGTCTTTC ATCAAGACGT ACAACCGAAG CAAGGACGGT CGTGCTGCCA TTTTGAAATT AAAGGAACAG GCGGAAGGAA CATTAAACGA GTCCGTTCGC CGTGATGATG CCATCAAGAT CCTGTCAACT ACGACATACA ATGGTCCGAG TTGTAACTGG AATATTGATA TGCTGTTGCA GAAATTTCAG TATGCCATCT CGGAATTGGT CGAAATTGAC GGAGTCGCGT TGCCGGATGG GCAGCTTGTG ACTTATTTGG TCCAGGCATT GAAGGACCCA AGTCTGAGTT ATGTTCGTGA CACAATTCGC ACCAATGCCA CTTATCGGAA CAGTTTTCCG GAAGCGCAGC TTTTTGTGAA GACTTTTGTG TCTTCGTCCA CGAGCAAATC CGAAAACACG CCTCGACAGG TCAATGATGT GCAAACATCA GGTAGTGGGG CCTCCGGTGG GAGTAAGAAA GGAGGTACCG GGAAAGGAGC CAGCAAGCAG ACTCCCTTCA AGGGTGCAGT CACGGCTCGC AGTTATACTC CGGGAGAATG GAAAAGATTG TCCAAGGACC AACAGGAAAA AGTGCGATCG CTGCGTAATA AAAAGAAGCA AGGAGGGAAA CCCGAGGAAT CAGAGAGGAG TGTTGACAGT GTAGCACGGG ATGAGCCTGT GGACACTAAG GAAGTCCATA CCAGCAGTGA TATGGAACCG ACTTCAGATG CGGCTGGCCT GCAATTTGGC CGTGGTGCGT ATAAGAAATC GGTCGGATTC ACTGCGGACA CCGCTTCTCC TTCAGAAAAC GGAACGAAGA AGCAGAAAAC GCATCACGAT GCGTGAAACG CGGCACCCAA TGCCAGTGTT TCGGGGACTA AGCAATGCAT TTTACCAGAT CGAGTGATAT TGAGCCTCAC CTCTACACGC AGCATTTGTG ATCTCAACGC ATGCACTCAT CTTGGTGAGG GCCGCTGCGA GTTGGATTCA CATGCAGACA CATGCGTGGC TGGGGCAAAC ACTGTCTTGA TTGGTGAATC GCAGAAGTCC GTAACTGTGC GACCTTTCTC CGGTGAATAT TCTGCACTGA AGAATATCCC CATTGGAACG GTTGCCACAG CTTACACAGT ACCAGAAGAC GGGAGAGTGG TGCTTCTTAT TATTAATCAG GCCCTATTCT TTGGGGACAG ATTGAAAAAC ACCCTATTGA CCCCCAACCA GATGCGAGAC TTTGGCATTG AAGTTGACGA TGCCCCTCGG CAGTACGTCG CCAACTCCAA GCACTCTTTG TATGTTCCTG ACTCCCAACT TCGGATTCCG CTGCAGCTGC GCGGTATATT CTCGTTTTTG GAGTCGCGGA AGCCCACGCA ACAGGAACTT GACGAGTGTG AGCATATCAT ACTCACCTCT GATGTGCCGT GGGAGCCTTG CTCAACGGAC TTTGCCCGTC GAGAAGAAGA GGCCGCTAAG AGAGACCGGA GCGTATCATT GGTAGACACA ACGGGACTTT CCACTGGCCA CGCAATCCTA TCAGCACACC CATATGGTAT ACGAACTGTT GCGGCTTCGC AGCAAATACT TGAGACTTTT CGTTCCTTGA CAGAGGTTGA ATTGTGCGAG ACCAATCTGG CGGACCGCCT TATTGCCTGT GTTAATGTTG CGTCGGATGA TTACTGTGGA GACGGGTTGG ACGGTAGAGC TGACTTGGAT GTGTACCCGG ACTCAGAAGA CTTCACTCGT GTCGTCTCAG GTATGACATC AAGCGAAAGA CGGTCAGCGT TGACAGCTGA GGTTTTGTCG AAGCGTTGGA ATATTGGCCT GGATTCGGCC AAGCGGACTC TGCAAGTAAC AACGCAGAAA GGTGTGAGAA CGGTGATGCA TCCCTTGACC CGACGGTATC GTACTCGCCA ATCGCATTTA CGATTTCCTA CCATTCGGAC CAAGGTTTAC ACCGACACCA TGTTTTCGTC CGTGATTTCC ATCCGTCAGT ACAAGTGTGC CCAGGTTTTC ATAACCAACA CGGCCTATTC GCGTATTTAC CCTCTGCAGA CCAAGCAGCA AGCTCCTGAT GCACTAATGA AGTGGATACA TGATGTTGGG GTAATGAGTG ACCTAGTTTA TGATGGGTCT AAGGAGCAGG GAGGTGGCAA ACATTGGAGA GAGATTGAGC AGCGTCACCA TATACATCGC CATGTAACGG AGCCACACAG CCAGTGGCAG AATCGAGCTG AAGGAGAAAT TCGTGAAATT AAGAAGGCTG TTCGGCACCG ACTGCAGGTT TCTCGTGCAC CACGGCGCCT ATGGTGTTTT TGTTGTGAAT GGGTGTCGGC TATCCGTCGA TTAACTGCTC ATGACATTCC TGCGCTAAAC GGTCGAGTTG CCACGGAGCT TTTGGAAGGG GACACCCCCG ATATTTCTGA GTACGCGCAA TTTGACTGGT ATGAGCCTGT CTGGTTCATC GACCCAACTT CTGCTTTCCC TGAAATGAAG AAGAAATTGG GCCGATGGGT CGGAGTTGCA TCAGATGTGG GACAGGCGAT GACTTTTTGG ATTCTTCCAA AGTCATGCAT CCCAATTGCA CGTTCCTCTG TTGCTTGCGT CTTTCCAGAC GTAGCCGCTA CCGATGAATT TAAGGCTGAC CTTGCTGAAC TTGATCTAGC CATCGAAAAT AG
|
Protein sequence | MDEEEKILEV AKKLMPSKPT TLKDMSKWRS FFENWNLYMS QCRGAAAIPL SYVYRTNEQP ETALVGTYVN MDAYLVAQTV LSGSNFEIDN QWVFDEFKEA ITTTGPGWSF IKTYNRSKDG RAAILKLKEQ AEGTLNESVR RDDAIKILST TTYNGPSCNW NIDMLLQKFQ YAISELVEID GVALPDGQLV TYLVQALKDP SLSYVRDTIR TNATYRNSFP EAQLFVKTFV SSSTSKSENT PRQVNDVQTS GSGASGGSKK GGTGKGASKQ TPFKGAVTAR SYTPGEWKRL SKDQQEKVRS LRNKKKQGGK PEESERSVDS VARDEPVDTK EVHTSSDMEP TSDAAGLQFG RDRVILSLTS TRSICDLNAC THLGEGRCEL DSHADTCVAG ANTVLIGESQ KSVTVRPFSG EYSALKNIPI GTVATAYTVP EDGRVVLLII NQALFFGDRL KNTLLTPNQM RDFGIEVDDA PRQYVANSKH SLYVPDSQLR IPLQLRGIFS FLESRKPTQQ ELDECEHIIL TSDVPWEPCS TDFARREEEA AKRDRSVSLV DTTGLSTGHA ILSAHPYGIR TVAASQQILE TFRSLTEVEL CETNLADRLI ACVNVASDDY CGDGLDGRAD LDVYPDSEDF TRVVSVYDGS KEQGGGKHWR EIEQRHHIHR HVTEPHSQWQ NRAEGEIREI KKAVRHRLQV SRAPRRLWCF CCEWVSAIRR LTAHDIPALN GRVATELLEG DTPDISEYAQ FDWYEPVWFI DPTSAFPEMK KKLGRWVGVA SDVGQAMTFW ILPNHRK
|
| |