Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42826 |
Symbol | |
ID | 7196486 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1249962 |
End bp | 1253665 |
Gene Length | 3704 bp |
Protein Length | 1192 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176748 |
Protein GI | 219109991 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGTAT TCTTCTTTCT CCTCCTGGAG CTTTCTTCTG TGGCGGGCTT CACTGGATTG TGGTCTCGAT CTTCTGGCTG GAGAAACCAA GCTCACACTT CATATTCTTC TTGGCTGAGA GTCGTTGCTC TGTCCCCTGA TGCCGACGGC ACTACAGACC AAGATGATTT CAGTCTCCTT AGGAATGCTA CAAATAATAA GGCACCGGTG AGAAGCAATG AAATAGGCAA TGGACGGATA GACTTCAATC AATTGAGTGA GATGAAAGAG TCAATCGACA TTATATCGGT CATTGAGTCA TATAATCTAG ATCGCTTCGA ACGGAAATCC GACGATCGAG CTACAGCTCT TTGTCCCTTT CACGATGATG AGAATCCTTC CCTTTCGATT GATCGAACGA GAAAAATGTA CAAGTGTTTT TCATGCGGAG CCGGGGGAGA TGCTTTCCGT TTTGTTCGTG AATATAGTAA ACTAAAAGGC CAGCAAATGT CGTTTTATGA GGCCGTTCTC GAAGTCAGCA CGAAGTTTGG CGACGGCCAT GTCGAAGGGA TCGGAAAGTC TGTCCAAGCG GAAAGCCCCG AGCTTCGACA GCGCAAAGAA CGCACACTGT ACGCGAACGC CGTTGCGGCA GCATTCTACG CAACTTGCCT GACAAAGCCC TCCGCAGGAG GTGCCCGGCA GTACCTCCGG CAGCGCGGGT TGACTCCAGA AGCTGTTCGA ACGTTCGCCT TGGGATTTGC TCCGGATTTC TATTACGGTG CCAACCGATC GCGAAAGAAA TGGGGAGAGG ATAGTTTGGT ACAGCATATG CAAAGTCTTA ACTTCACTGT AGATGAGCTT TTGGAAGCTG GATTAGCGAC TGAGACAAAG ACGACGGAAG AACCTGTCCA AAAGTTTTCC TTTGGAGAAG GTGAGCTTTT ATTTTTCAAC TGTGAATGTC TACTTTTATC TTTTGCTAAC CTTACAAAGA TCCATTACTA GAAGGCGCTT TTACTTCAAA ACCAACAAGT ACAAGTTGCC CTTTCGATTC AATCATCGAT CGGTTTCGTT TTCGAATTGT TGTTCCGATA ATGGACAAGG CGGGAGCGAA CGTTTTGGGA TTTGGAGGGC GCATCTTGCC TTCGATCGTA GAAATGCCCA ACGCCTACAA TCCTCCGAAG TACTTGAACA GTCCGGAGTC ACTTGTTTTT AAGAAAAAGT GCATTCTATT TGGTCACAGT TTGGCTAAAG AGACTGTGAA GCAACCAAAA AAAAGCGAGC ACGAGCAACT GGCGAACACT TTGATTCTTG TCGAAGGATA TATGGATGTT ATGTCTCTTT GGACAATCGG CGTTAGAAAC GTGGTAGCAG CCATGGGAAC CGCTGTCACT ATGGACCAAC TGGCTATCGC AGCAAAAAGC ATTCGAAATG GAAACTTAGT GCTGTGTCTG GATAATGACA GCGCTGGATT ATTGGCGCTC GAAAGGCTTT GTTCGAATGG ATTACTTTCG CGGATCGTAT CGAAGTACGG GACTGAGATT AGCATTGCTT TACTTCCAAG CGGAATCAAA GACCCAGGGG AGTTCATAGA ATCCAAAGCT GAAGCGGCCT CAACAACCAT TGCCGATGCC TTTCAAACCG AGGTACTTTC TAAATCGCAA GATTGGATTG ACTGGTATTT GCAGCAATTG CTGGAGAGCT ATGATTCCAA GGCCGGTAGA GGTAGGGCCG GTAGTTTCGG CGATGTTTTT GAACGTGTTG CAAATTTTTT GGCCAACAAT ATGGGTCCAG CGGATCGAAC GAAACATGCG TGCGAAATTG CCGTGTCTCT TGCTAAGACA ACTGCAAACG AAATGGGCTC TGATCACGCA TCAAGCGCGG TCCAAAACCA GTTAGAGTCA GATCTTATTG ATCTCTCTTC TCGTTTAGCG GAGAAAAAGA GAGCCATTCA GAGGCGGACT GAGTTGGTGA ATTCAGATGG TAATGTAGGA TCACAGAAAG ATGCTCTTTT TGCGTTAACT AGAGGGAGCG GGCCAAGTGC GGATGAAAGT GATAAGCTTT CAAGTAGTGC GTCAAAAGGT AGCGATCTGC TTTCAGTTGC TCTAAACGAC GTAGATTTGC TAGACAAGTT TCCCGCACCC TCTTTTGAAA GAGCATCTCT AACCAGGAGA AAACGGACAC ATGATGCGGC AATTTCCAGA ACTCTGAACA AGGCTTTGAC GCCACATTTC TCAGGTTTTC GGTTTATGTA CAAGAGTGAC TCACAGTGGC TGGGTGTCGA TGAAGATAAG GTTTGTTCTG CATCTTGATG AGGATTCTTG TCTACAAATT TCTTTTTCTC ATATCTTTCA TTGCAGCTAA AAGGCGGCGA ATTGACCTTG GGCTATATCA AGAATACCTG GCGAAAGAAA GAAAAGAGTA TTTATTTCAA TTCTAACGAT TATCATGGCC ATCAATTTCT CACTGAAGAT GCTATGGACG CAGGTTATGT CAACCGCAAT GTTAGACGTG ACCCTTCGTT TGTCGAAAAG GGCGTCGCCT GTTTAGTCTA CCGAGACACG GAGCTGATGC TGAAGACTGC AGAAGATAAC ATGCTGACTA CGCTTGTGGA CTGCCCATCA GCGCGCACAG TATTGAAGAA TATGCTTGAT GCACGCTCGG CTACTGGGTC CAGCAATCTA GTTGAGTGGA CCAATACCGA AAAAGAATGG CTCTTTTCTA CTTTGGTATA TTCCTCTTCT TCCATCCCAG AAGGCTGCAC CAATCGAACT GAATTGATGC GCTTTCTTAA ACGTATTCCT GACTGTCCAC CTCAGGCTTT TATCCATACC ACGGTGTCCG CAAATCGCAA CAGCAATTCA AAAAAGGACT ATACCAAGAG TCTTCCTGAA TCTTCAAATG AGGTGATTGG CGGGAAAGAA ATTTATGGAA GATTTGATAC CTACTCTAAA GTAGGGAACG GGACTTTATC CCTCTTCTTC AACGGAGTTC GAAACGATCT GGATAGCGCA GCCTCTTTGA ACGTAACCCG AGTAAACGTT CTTACACAAG AGCGCTGGGC CGCTGTACTG TGGGCATCCA CGGCACATCA AGCAAGACAA ATTCGAGAAA AGTTGCTATC GCTCTCGAAT GCTATGGAAG GTCGTACGGC GACTGGCATC AAAAACGTCC GTTTTGCTGT ATCACCAGAA GTTACGCTCA GCTCAGACCA TCCAGTAGAA GGCGACGGAA ATCCTTGTGA CTCAACAAGG CTCCGTGACC TCACCATTGC TCTGCAAAAC AAGAATCGTG TTCTTCAAAC ACTTTCTGAT TCTTCAAAGC GCCTTTGGAC GAAGCTGGTT GATGAGACTC TGTCTGACGG TATAGAGGGA CATGTTTCGG CATCCCTTCA AATGGACCTA TCAGTCAGGT TAGACGAATA TTTAAACGCA TTCATCGACG TCCCTTCGCA GAAAGTGGAA ACGACACAGA AACTGGAAAC GATATTGTCG GGCTTAGAAG ATGAAGAGCC TTACGAAGAT ACACTTGAGC GAATAGCGAA AGATTGGGGT GAATGGGCAG ACGACGATTA TTTATGGACA ATGGATGACG CCATTAGCAA GACGAATAAA AAGCAGGTTG CTTCCCCAGA CTTAATTTCA GCAATTACCG AAGACGAAGA TGACGAAAAT GTGGAAGATG CACTACAAAG GATTTCTCGA GATTGGGCTG AGTGGGATGA GTGA
|
Protein sequence | MQVFFFLLLE LSSVAGFTGL WSRSSGWRNQ AHTSYSSWLR VVALSPDADG TTDQDDFSLL RNATNNKAPV RSNEIGNGRI DFNQLSEMKE SIDIISVIES YNLDRFERKS DDRATALCPF HDDENPSLSI DRTRKMYKCF SCGAGGDAFR FVREYSKLKG QQMSFYEAVL EVSTKFGDGH VEGIGKSVQA ESPELRQRKE RTLYANAVAA AFYATCLTKP SAGGARQYLR QRGLTPEAVR TFALGFAPDF YYGANRSRKK WGEDSLVQHM QSLNFTVDEL LEAGLATETK TTEEPVQKFS FGEDPLLEGA FTSKPTSTSC PFDSIIDRFR FRIVVPIMDK AGANVLGFGG RILPSIVEMP NAYNPPKYLN SPESLVFKKK CILFGHSLAK ETVKQPKKSE HEQLANTLIL VEGYMDVMSL WTIGVRNVVA AMGTAVTMDQ LAIAAKSIRN GNLVLCLDND SAGLLALERL CSNGLLSRIV SKYGTEISIA LLPSGIKDPG EFIESKAEAA STTIADAFQT EVLSKSQDWI DWYLQQLLES YDSKAGRGRA GSFGDVFERV ANFLANNMGP ADRTKHACEI AVSLAKTTAN EMGSDHASSA VQNQLESDLI DLSSRLAEKK RAIQRRTELV NSDGNVGSQK DALFALTRGS GPSADESDKL SSSASKGSDL LSVALNDVDL LDKFPAPSFE RASLTRRKRT HDAAISRTLN KALTPHFSGF RFMYKSDSQW LGVDEDKLKG GELTLGYIKN TWRKKEKSIY FNSNDYHGHQ FLTEDAMDAG YVNRNVRRDP SFVEKGVACL VYRDTELMLK TAEDNMLTTL VDCPSARTVL KNMLDARSAT GSSNLVEWTN TEKEWLFSTL VYSSSSIPEG CTNRTELMRF LKRIPDCPPQ AFIHTTVSAN RNSNSKKDYT KSLPESSNEV IGGKEIYGRF DTYSKVGNGT LSLFFNGVRN DLDSAASLNV TRVNVLTQER WAAVLWASTA HQARQIREKL LSLSNAMEGR TATGIKNVRF AVSPEVTLSS DHPVEGDGNP CDSTRLRDLT IALQNKNRVL QTLSDSSKRL WTKLVDETLS DGIEGHVSAS LQMDLSVRLD EYLNAFIDVP SQKVETTQKL ETILSGLEDE EPYEDTLERI AKDWGEWADD DYLWTMDDAI SKTNKKQVAS PDLISAITED EDDENVEDAL QRISRDWAEW DE
|
| |