Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50153 |
Symbol | |
ID | 7198940 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 205794 |
End bp | 208721 |
Gene Length | 2928 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184989 |
Protein GI | 219129635 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACGAC CCCTGTGGGT GCTAGTTCTT GCGTTGGTCG GAGTCGTCTT GGAATCGGTT GCAGCCCACA GTCAGGAGAA TTTTGGTGGG TTCTGCGGCG GCGGAGATGG AACGCCGACT TGTGCGATTG GAATTTGCAG CTCCCACAGC TTATGTGGCC CCTGTGAGGT TGGAACTTGC GAAACTTCCC TGGCGTGCTC CCTTGATGGA ACAAACACAA TTTGTGCAGG CTCGGACTTG GAACGACGAT TGAAACAGAA GTTGCCGCTG CGAGAGTCCG AGGCACCAAC ATCAACATCA CCGACTCTCA CGGAAAACGA CTATTATTGT ACCGGAAAAT GTACCTGTGC TGCAAACTTT TGCAACTCGG ATAACGACTG TGGCGGGGGA AGAGAGTACT GCCAAGGAGG TAATGTCTCC TCGATGTTTG GCTATAACAT GACAACAGCT TGTTCCGGTA CCTGTGTTCC GGCCCTTCAA CTCGACGCGT GTGGCGCTGG TCTGTGCACA ACAGACGTTG ATTGCGGCAG CGAGAACAAA ACGTGTCTAA ATCTAGTTGG TGAAGAATGT GGGGGCTTTT GCAACGGGCT AGCATCCACT GTCAGCCACG ACAGACACGA AATCTCTGTT CCTTCCGCGT GTGCCGACCG TTCGTGCAGT TCCGATTCTG AATGCAAAAC AGGAAACGAA TTTTGTCTCG ACTTTAATAG CAAGGTTCCT TGCAGTGGAA CGTGTGCAAC CACCACCACT TTAGATGCGC GGAGCTCCTG TATGATGAAC ACATGCTCAG TCAACCGTGA CTGCATCGAG CCTTTGGAAT TCTGCTCACA GGCCCAGGAA GGAACACGTT GCAGCGGCTT GTGCATTCCT ATGGAAATTG AAGAAGAGGA GGAGAGTAGT ACCAGAACCA GTACAATTAC AAATCCTTTC ATGGGCTCCA ACGACTTCTT GATGACCACC AACGAAGGTG TCTTTCAGGT TACTGGCTGC GCAATTGGGG CGTGTAGCTC AGACAGTGAT TGCAGCGCTG ATTCAGAGTT TTGCTTTGTT TCTGCAGATT CACCGTCCTG CTCAGGAGTC TGCTTACCAT TAGAAGAGAT TGAAGTTAAC GGTTCCTGCA CGCTTTCGTC ATGTACTTTT GATGAAGACT GCAACATTGG ATTGGAAACC TGCGTGGGCG GGGGTAGCGT CTTTACTCCT TGTTCAGGTT CCTGCCTCTT GCTTGATATT GAAGAAGAGG GAGACGATAG CCTTGTCGTC GTTAGCGAAT GCTCTAACTC GACCTGCAGG TCTGACTTGG ACTGCATTAC GGAACTCGAA GTTTGCGATG GAATGGATAC GCAGAACAAT GACACTTGTT CTGGGGTTTG CATTTCAATC GATGGGCAGC TTTGTCCGTT GGAGAGCTGC AATTCCGACG ACGATTGCAT CGATCTGACT GCTGTTTGCC AAAACGCCAC CGACAATTTC ACTTGTTCCG GAACCTGCGA ATCTATTCAG TGTCCCGTAA CTGACGAGGA GGTACAATCG TGCAGTATGG ATAATGATTG TCTTTCCGGA TTGGAATCAT GCCAAGGATT CGATCTGCAA TTCGCTTGCA GTGGGACATG CTCAATCTTG GCTGGCGAAT GTCCGGAAGA GGTTTGTTCG AGCGACGCAG ATTGTTCAGA TGGCGTAGTC TGTCGAAATA TCGACGAAGA GGACCCTACT TGTTCGGGGA CATGTCGCTC TTTTTCTAGC TGCGGGGGCC TCACGTGCAG CACTGACGGG GACTGCCGGC CTGGCCTGGA ATCGTGCGAA GGAGAGAGCG ACTTTTCTTG CTCGGGAAAT TGTACCCCAG TCGAGTGTCC AGTTTTTCTG TCGTGCGGCA GAGACGCTGA CTGCCAAGAA GGCCTGGAAC AATGTCAAGA CTTTGATGGC GTGACGTCTT GTTCTGGAGT TTGTTTAATC CCCAGCTGTC GAGAGGTTGG CTCCGAAAAC AGCTGCTCAC AGAATTCAGA CTGCAACGGA TTAGAGTTCG AATGCATCGG TTTCGACGGG GAAAGAGCTT GTACTGGCCA ATGCGTCCGT ATGCCTTGCC CCGGGGAGAC CTGCTCGACC ACGAGTGAGT GCATTGACGG ACAAAAATGT TCTGGTGCAG ATGATAATGA ACCGTGCAGC GGTAGCTGCG AAACCTTTTC ACTACGATTC CGCAATTATC GTGGCGAGCA GTTGCCAAAC TTTGGCAAAG GATTTGATTT CGGGACATTA TTCTTTACGA AACGAACGGT ATTTGTGAAT AAAAGGCGTT CAAACAATGA TTGAGATCCT AAACGCATGT CAATACCTCC CAAAGAATCC TGATTGTGCC GCGGTCGTAG CTACATTAGA GAATGACCAA GACCGTTCCG CGTGGAATTC GCTGGAGAGT GTTGACGCCG CAATTGAAGC TTGACGACAA ACAGCAAACA ATTTAAAAAC AAAGAGCATC GCAAAAGTAT TTTGATTAGC GAAGAGGTTG TGATCCTGAC TGTGAGAATG TTCCGTAGAC TCTTGGCGAA GAAAACCAGG TACTCTTATT TCCTCACAGA GTTTCGATTT ACATATAAAG TACGTTGCAT GACCTGCAAA ATTTTGTACA CTTTTAATAA CAATTCCCTT ACTGTTAGTT TGTTGTCCCA GAAGGGAAGA CCATGGCGGG AGAACAGAAA CATGAAGCTG AATTCGCCGT AAAACACCAT ATCTTGCATC CCAATCCCTT TTGTGCATGC TCGAAAAAGA TATCCACTAC TGGTATGGGT GAAAATTGAT GCACAGTGTT GTTGCTTGCC TATCTCCAGA CGTTTACAGT AATACAGTGC AGAATTCAAA TCAAAAAGCT GGAATACAAG GTAATTATGG TCGAACCGCT TTTTTTGGGA AAGGCAACTC TCTTTGACGT AACGAGAG
|
Protein sequence | MVRPLWVLVL ALVGVVLESV AAHSQENFGG FCGGGDGTPT CAIGICSSHS LCGPCEVGTC ETSLACSLDG TNTICAGSDL ERRLKQKLPL RESEAPTSTS PTLTENDYYC TGKCTCAANF CNSDNDCGGG REYCQGGNVS SMFGYNMTTA CSGTCVPALQ LDACGAGLCT TDVDCGSENK TCLNLVGEEC GGFCNGLAST VSHDRHEISV PSACADRSCS SDSECKTGNE FCLDFNSKVP CSGTCATTTT LDARSSCMMN TCSVNRDCIE PLEFCSQAQE GTRCSGLCIP MEIEEEEESS TRTSTITNPF MGSNDFLMTT NEGVFQVTGC AIGACSSDSD CSADSEFCFV SADSPSCSGV CLPLEEIEVN GSCTLSSCTF DEDCNIGLET CVGGGSVFTP CSGSCLLLDI EEEGDDSLVV VSECSNSTCR SDLDCITELE VCDGMDTQNN DTCSGVCISI DGQLCPLESC NSDDDCIDLT AVCQNATDNF TCSGTCESIQ CPVTDEEVQS CSMDNDCLSG LESCQGFDLQ FACSGTCSIL AGECPEESSN ASVSTGKELV LANASVCLAP GRPARPRVSA LTDKNVLVQM IMNRAAVAAK PFHYDSAIIV ASSCQTLAKD LISGHYSLRN ERYL
|
| |