Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48916 |
Symbol | |
ID | 7195344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 5989 |
End bp | 9036 |
Gene Length | 3048 bp |
Protein Length | 944 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183654 |
Protein GI | 219126835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0642733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAGAC CACGAAATTT TTGTTTGTAC GGCATTCCTT TGGTTGCTGC TTCCGTGTTC TCCATTTTCT ATTCATTCTA CGCTTTTGAC TATCACAACG ACTTTTGCTT GGAAGCACAG AAGGCCGTGA CGGTGCCAGC CATATCTACC AGTGAGGTAC TATCGTCACC ACGCCAAATG GCCAAGTCAG CATCTTTCCA AGGCGAATGC CGCTCGCTAG CCAACGGTGG TCCAGTTAGT CTCGTTTGGG CATGGAATGA TCCACCCTTG TTTCACGCCA TGAAGACCAC AATAGATGGG CTTACATCAG GGTTGTCGGG TATTCGAGTA GTTATATACT GTGGCTCAAC CGCATGTGTT TCGGCAGCAC ACAAGGCTGT CCTTGAACTC CCACCACAGG CGACGTTGAT GTCATCCTGT ATCAGCATCC AGTACATAAT TGCCCCCCAG TTGGCCGAAG ACTCACCATT TGAAGAATGG ATTGGAGATC ATGTCTTGGC AAAGCTCTTG TCGGCAATTT ACTTTGAACA GACCCTTCAA GTTGTGATGC AGCTGACTGT GATCTGGAAA TATGGTGGTA TGGTGCTCTT GCCGGGGTTC AACGTGGCGT TTCCCTCTAT GCTCGAACTG ACGGCTAAAG ATGGCGCCAC ACTCATGACG GCTGCTGACA TGGGCCTAGT CAAGCCAGGA GTTGGCGGGG GTCTCTATGC AGTAGCGGCG CCACCGAAAG ATGCAATGAT CAAGTCTCTT ATGGAGGAGA TGCTAGTAAT GTACAAATGG CCAAATTACG TTGCATCGGA CTGGCCGGTA CGAAGCCAAT GGGACATCCT GTGTGCACGA CAGAGTAGCT GTCTTGCTGC TCAGCGACCC CTACAAGATA TAGGAATAAG AACAGGAAAT GAGTCTACAG TGGAGCATCA TCCTAAGAGA CACTTTGGAA CACTGAGCTA CCAGGCAAGA AGACATGGGA CAAAGGCATA CCCTAGCATG AATCAAGGCG ACGAGATGCA AGGGTTGGCG GGTTTGCAAT TTCTTCCGCG CCTCGACGCT TTTGTGGACA GAGACCGGCT TGACGTGGTG AAATTGATTG ACTCCGCCGA CTTTTCCGCC ACAGGTGAAG TCACCCCTCA ATCCCCGTCT CAACCTGCTT TGACACAAAC AACACTGTTT CTAAATGCTT GGTGGGGGAT TCCAAATTGG GTCTGGCCGC CACCTGAAAA GCTTGAGCCT ATCTTTGTCT CAATGCATCT CAACAATAAC AAAAACAAAG ATGACGTAAA GAGATCAAAA GGATATCTCA ACCAAAATTG GCCTATTGGA GCACGAGACT CAAAGACGCA CGATTTTTTT CAGTCCATCG GTATCCACTC AGTTTTTACG GCGTGCATGA CAATGACTCT TTTGCCAACT TGGAGTGAGC AACGTGCTTT GATGGAACAA AGCGATGAAG TGTTGCTTGT AGACGTGAAT AGAGAAGGTC TTCAGCTGCT ACCAGACCAC ATCAAGTCAA GAGCCGTGAC CCTGTCAGCA AAGCTTAAAG ACCCAAATGT TATAGATGAC ATGGTGGCAC GATACGTTGA AGCTCATGCT ATGAAGGTTC GCTTGCAAAA AGCCAAGTTG GTCATCACCC AGCGGCTGCA CATTGCGTTG CCAGCTGCTT CCACAGGAAC GCCCGTGATT TTGATCATTG ACAATGATAT GCCTGGGGGT GGAGGTGATC GCTTCAGTGG TCTGCAGCAG GCTGTACACA CTGTACACTC TACAAATGGG TCCACAGCCT TGGCTTTATT CAATTGGGAT GATCCACCAC CCAATCCCAA CCCAATATTT TTCCGAAAGA AACGTAACGT CCTTCGCGTC TTGACTATGT GCCATGGGGA AGTGACCGAC TCGGCACGAA AATTTGGTGC AATTCCAGCT TCATGGGAGT ACCCGTCTGA AACCAAAGTG TGCAGGAATA CAATTGGCAA CTTGCACACA GAAGATGCTA TTCATATTGC AACTACAATT AATCCTTTAT GGTTGGACTC CAAGCATGTT CTCCCCAGCT GGGTTCATGC ACTGTACAAG TCAAACCCTA CGGAAACATT TGTGTTCTAT TTCCTTACAG ACAGAATGAA TGAAAAGCAG CGATGTATAG TCCGATGGAT GGTGCTTCAA TGGTTTCCCA ATGCAAAGGT TTACACAATA CCAATACAGC TGCCATCTGT GGACATTTCT TCCATCCCTA TAAAACATGT TCCTACATTC TCTCAAGTCC GGCTTCTCTT GCCCCAAATG CTTCCCTGTG TGCAGCGAAC TTTGTGGATT GATGTTGATG CCATGGTGAT AAAGCAACTT AGACCAATTT GGGACACCTG GAAGGTCATG CCAGAATGTG GTATAGTTGC CAGGAGCTTA TTGGCAAAGA CAGACGTGGG ATCTATGATG GCAGCCCTGA ATGTAACATC TCCCCAGCAG CTGTGGAAGA AAGCCAGCAA AGACATGCCA GGATTCGATG CTGGAGTGAT GCTTCTAGAC CTTGATGCAT TGCGCGCCAG CCACTTCACA GAAAAGGTGG CATCGTACTG GTCCTTTTCA ATTGGCGGAA ATGATCAAAT TTCCTTGAAT ATGCAATGCA ATGGGACCCA TGGCAACCTT GACTCAGTTT GGAATGTATT CATGGACTCT CCAGACGACT ATGTGCACAA CCGGACAAGA GAGTGGAGCA TTGTTCACTT TCAAGGCTTG AACAAGCCTT GGCTTGTAAA GAGTGATTTG TTTCATGGTA GAGTATGGGC CAAATACGCC CTTTCGCTTG TTGATGCTCT CTATGGACCA ATCCAATTGC AGTAATTAGC AGTGCTTGTA ACTGTTTGTT GCACAAGTTC TTTCCATGTT GCTCTTGTGG GAAAGATCTC TTATCCAAGC CGTCTGCAGA AGTTGCGTTG TTTTCCGGAA GTACTCACAA GCACTTGGCA CATGTTGTAG AGGCAAACTC TGAAAAAATG CACTACATGA ACAACACACA GTCTTGTCTA ATCTTGGTCT AGTTTTGCCT GCATTTTG
|
Protein sequence | MPRPRNFCLY GIPLVAASVF SIFYSFYAFD YHNDFCLEAQ KAVTVPAIST SEVLSSPRQM AKSASFQGEC RSLANGGPVS LVWAWNDPPL FHAMKTTIDG LTSGLSGIRV VIYCGSTACV SAAHKAVLEL PPQATLMSSC ISIQYIIAPQ LAEDSPFEEW IGDHVLAKLL SAIYFEQTLQ VVMQLTVIWK YGGMVLLPGF NVAFPSMLEL TAKDGATLMT AADMGLVKPG VGGGLYAVAA PPKDAMIKSL MEEMLVMYKW PNYVASDWPV RSQWDILCAR QSSCLAAQRP LQDIGIRTGN ESTVEHHPKR HFGTLSYQAR RHGTKAYPSM NQGDEMQGLA GLQFLPRLDA FVDRDRLDVV KLIDSADFSA TGEVTPQSPS QPALTQTTLF LNAWWGIPNW VWPPPEKLEP IFVSMHLNNN KNKDDVKRSK GYLNQNWPIG ARDSKTHDFF QSIGIHSVFT ACMTMTLLPT WSEQRALMEQ SDEVLLVDVN REGLQLLPDH IKSRAVTLSA KLKDPNVIDD MVARYVEAHA MKVRLQKAKL VITQRLHIAL PAASTGTPVI LIIDNDMPGG GGDRFSGLQQ AVHTVHSTNG STALALFNWD DPPPNPNPIF FRKKRNVLRV LTMCHGEVTD SARKFGAIPA SWEYPSETKV CRNTIGNLHT EDAIHIATTI NPLWLDSKHV LPSWVHALYK SNPTETFVFY FLTDRMNEKQ RCIVRWMVLQ WFPNAKVYTI PIQLPSVDIS SIPIKHVPTF SQVRLLLPQM LPCVQRTLWI DVDAMVIKQL RPIWDTWKVM PECGIVARSL LAKTDVGSMM AALNVTSPQQ LWKKASKDMP GFDAGVMLLD LDALRASHFT EKVASYWSFS IGGNDQISLN MQCNGTHGNL DSVWNVFMDS PDDYVHNRTR EWSIVHFQGL NKPWLVKSDL FHGRVWAKYA LSLVDALYGP IQLQ
|
| |