Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43922 |
Symbol | |
ID | 7204319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 436146 |
End bp | 439609 |
Gene Length | 3464 bp |
Protein Length | 973 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186344 |
Protein GI | 219113521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.150894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAATGGGA TGTTTTTGTC AAAGTCACAA AGTCTTCGAA TTCTTTCCGG GTTTCAAAGC TCTTGTCGAG ACACGTTGGT ATTTGTTTTT GGCTGGCGTC CTTCCACATG TTGGCTAGCT CCGTATTTGT TTACTAATAG TAACAGCCAA GGCGGGCGGG CCCTTACGCG GCTTGGGCGG CTCTCCGCGG TTGCAAGTAG TTACTGTTAG CTGTCGGCAG TGTAACTGCT ATCATCTTTT TGCTGGGGAT ATTCCTTGCG TATTGCTCTT GTTTAGTGGT CGGAACCCTC ACCGATTGTC GACAGATTGT TAATCGACCA TGTCTACCGA ACCGTTCGGC ACCACACCGC TCCCGACGTG CGTGTCGGGA AGCCCAGTGA AAGTCTGGTT CGATGTTAAT CCTCTGTTGG CGCGGGCGCA GCAGCCCCAA ACCCAACAAG TGGCGAATTT CGAAGCCTTA TCGAACGAAG CTTGTATCAA CGGCTGCTTG CTGGATGTCA ATCAGCATTT CATTGTGTAC GGGATCAAGA ACGGCCTTAT CCGCGTCTTC CAACGCCATA CGGTGCTCCG ATCGTTGTTG CGCGGTCACG AGGGACAGAA TCTGACCGAC ATGCATTTTT TCCAAAACGG CGACGTCCTC GCCTCGGCGG CGTCCAACGC CCAATCTTCC ACGGTGCTCG TTTGGAGAGT CTTTGGGCGC TCTCCGGAAA TCATGTCGGA AAAATTGTTG GAAATTTCGA CGCCGCATTT CACTATACAA CGAGTCGTCT GGCACCCCTT CAATCCGAAT CAGTTTTGGA TGCTGCATAC GAATGCTGCC AACCACATGG TGGCGACGCT GGTGGAAACC ACGCGGATCG CAACACAGCC TCATCCCGTC GAAGGACACG CCGTGTGCAA CTTTCACGAT GCGCACATTA TTATGGACGG TGCGGTACAG ATTAGTGCTG ATTGTGCTTC GGGATCCGGT GCCTCTTTGA CCGATTTGAC CTGGTCCAAT CGGGACACGC GACATATTTT GACGTGTCAC GATTCTGGGG AAATTGTCTT GTGGGATTTG AAAACGCTGT CGTCCTCGTC GGCTACCCCT GGTACCGTGA CTCCGGCTCG ATTAGCAACG TTGCGTATGG ACGAACCAGT CTCAAGGGGT CTGTTTTTGC CACACGAGGA CGTCCTTGTA TCGGACAATC GTAGCCAGGA GGCCAAACTG ACCACTTGTT TTGTCACAGC CAGTGATAAG AACGGAACGA TCACTGTATG GAGCCCATTC GAGAGTTCGG GAGCGCTGCC GCAAAAAATA CAAATCTTGG CGGTGGAGAA TCCCAGTCCC AGCTACGTTT TGGATGTTTG CTCGGGACCC GCCCCGGTCA ACGCATCCCC GCCCTCGGCT TTTGTGGTGA TGGCTGATCG CCACAGTGGG GCGATTCTAG CGTGGCATTT GCGGGCAGAT TGGAACGATA CCGTTCCGAA AAAGGCTTTG CTGAAGGGTT GTGACTACGT GGTGCCATTT CTAACAAAAT TTCCAACCTA TTCTTGGAGT GTAGTGTGTG CGCCTGCGAC CAACATTTCC GACGAGGAAC TGTCGGACCA GGGTGGATTG GTCTTTGACG TAGAACTCTT TGCCTACCAG ACTACCGCGG TGCAGCGTTT GAAATTGACT TCTTACATGT GTCTGCCGCC GGAAACCTCG TGGACGGATC CAACGCCGAG TGTGCGGTTG GAGCGGCTAG TGTCCGCTCA GTCGGCGCAC GTTTCCGAAA TCGGCTCGGA CGATGCCAAT CCGGATGTTG AATTCGACGA AGCTTACGAT TTGGAGGAGG ATGACGAGGA AGAAGAAATT GAGGCGCCGG ACCCCTCGTC GCTACCCTCG CCGTTGGGTA TAGGCAATTC TACGCCGTCG TTGTCGAACA ATCCTTTCGC TAACTGGTTG GGTACGATTG CGTCGAAAAC GACTACTTCT GTACCACCAG CTGTAGTCGC AGCCCCTGCT CACGTACCTC CACCGGCAAG CTCGTTGCCC ACACCGCCTC CCGATCAACC GAAAAAAATC GTCTTGTCGA AACACGATTT GGAGGATCCA AAAAAGGTGG AACCTCAAAA CGCGGCTCCA GTGATGACCA CTAAAGGCCC CAACAATATC AGCAACGCTG ACTCTAAGAA AAAGAAAAAA GTAAAGGCAA CACCTGTCCC CCCGTCAGCT CCGGAAGTGG GGAAGGTTTC CATTCTCAGA AGGGATGACG AGGTGAAGCC ATCACTATTG CTTGATAGCG GTGCGAATAT ACCGCCATCT CCAACAAATC CAATCGAGGC CAGTATGGAC ACCAAATCCA TTGCAGAAGA TATTCGAAAG GTTGTACACC ACGAAATGCG CTCAACTCTC GTTCCCGCCC TCAAGCAAGC TGTTCAAGAA TCCTTGAACA CTTCCGTAAT CAATCCTATC CAAGCATCAA TCACTCAACT GTCCAAGCAA GTGGTGATGA ACGACAACAT GGAATCCGCC TTATCGGGAT CAGTTGAAGA GCCTCTTCAG GCCGCTGTTG CGAACACTAT GCGAACGGTG TTGATTCCAA CAATGGAGTC AATCACGAAT CAGGTCTTCG TACGGGTATC TGAAAGTCTG GAACGAACGG CAGCGACTAC ATCAACTGAT TCAAAAAAGG AACTTGAGGC TATCTCTTTA CAGCTCACGA CAATGACAGC TCTGGTTGCC GAGCTTACAA ACGAAGTGCA AAGTCTACGC AAACTGGTTC GATCTAACCA AGCGCCTGTA CCACCAGCAC CTACGGCCCC GAGTCTACCT CCGATTAACC CCGTGGAGGC ACTGCGCAAG GAAATTGCCG CACTCATACA ACAGCGGCAG TACGAAGCCG CGTTCACGAA GGCTGTGTCA TCAAGTACAG CCGAATTGGC CGTTTTCGCT TGTACAAATT CGAATCTGAC TTCTGTGTTG GGCAGTGCAC GAGTAGAACT GAGTCAGCGC ATTTTAATTT GTCTGATGCA GCAGCTCAGT ACTGTCTTAA ATTGGCGTGA TGCAAGCTTA AATGTACCAC TCATCCTTGA ATGGCTCCAA GAAATTGCCT TGTCATTAGA TCCCAACGAT GATACCATTA AACGGCACAT TCCAACCGTT TTGCAACAAA TGGTGTCCAG CGTCAACAAT CGAATGTCCT TGGACGAGCC TGTTCTTAGG CGACCTTTAC AGAAACTACT TCAGATTCTT CGGGGGATGT CTATATCATA AGAGCGCCAA AGGATTGTAA GACCGCAATG AAAATCGAGA ATATGCATCT CTTTGTGAAT TTTCTGAGCT CCTTCCGTGG CTTTAGCATT TTTTACGAGA ATACGTACAC TGAAGTCACG TCATAAGTAT TGGCCTCTTC CTCAACAAGC TATCAAAAAA GCAATGATGG AGTAGCTTTA GGGGAATTCT CTAGAAAACA GTAGGTCATG CCGTGATTGA TGGT
|
Protein sequence | MSTEPFGTTP LPTCVSGSPV KVWFDVNPLL ARAQQPQTQQ VANFEALSNE ACINGCLLDV NQHFIVYGIK NGLIRVFQRH TVLRSLLRGH EGQNLTDMHF FQNGDVLASA ASNAQSSTVL VWRVFGRSPE IMSEKLLEIS TPHFTIQRVV WHPFNPNQFW MLHTNAANHM VATLVETTRI ATQPHPVEGH AVCNFHDAHI IMDGAVQISA DCASGSGASL TDLTWSNRDT RHILTCHDSG EIVLWDLKTL SSSSATPGTV TPARLATLRM DEPVSRGLFL PHEDVLVSDN RSQEAKLTTC FVTASDKNGT ITVWSPFESS GALPQKIQIL AVENPSPSYV LDVCSGPAPV NASPPSAFVV MADRHSGAIL AWHLRADWND TVPKKALLKG CDYVVPFLTK FPTYSWSVVC APATNISDEE LSDQGGLVFD VELFAYQTTA VQRLKLTSYM CLPPETSWTD PTPSVRLERL VSAQSAHVSE IGSDDANPDV EFDEAYDLEE DDEEEEIEAP DPSSLPSPLG IGNSTPSLSN NPFANWLGTI ASKTTTSVPP AVVAAPAHVP PPASSLPTPP PDQPKKIVLS KHDLEDPKKV EPQNAAPVMT TKGPNNISNA DSKKKKKVKA TPVPPSAPEV GKVSILRRDD EVKPSLLLDS GANIPPSPTN PIEASMDTKS IAEDIRKVVH HEMRSTLVPA LKQAVQESLN TSVINPIQAS ITQLSKQVVM NDNMESALSG SVEEPLQAAV ANTMRTVLIP TMESITNQVF VRVSESLERT AATTSTDSKK ELEAISLQLT TMTALVAELT NEVQSLRKLV RSNQAPVPPA PTAPSLPPIN PVEALRKEIA ALIQQRQYEA AFTKAVSSST AELAVFACTN SNLTSVLGSA RVELSQRILI CLMQQLSTVL NWRDASLNVP LILEWLQEIA LSLDPNDDTI KRHIPTVLQQ MVSSVNNRMS LDEPVLRRPL QKLLQILRGM SIS
|
| |