Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43587 |
Symbol | |
ID | 7197316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 890909 |
End bp | 893860 |
Gene Length | 2952 bp |
Protein Length | 977 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177710 |
Protein GI | 219111917 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTAA TGGTACAACA ACAGGGTCAA AATCGGAGAC GGATCATCTC GTCCACACAT CATCTAACTA TAGTATCGAA CACTTCAGAT ACGCCTCCCA CTCTCCTTCG ACAGACGGCT TGCTTCATGA TTCCAACAAT TTCGGCAGGG TTACGATGGT CGTCGTGTGC CGTTGTCGTC GTCGGAGTTG TCGCCCTTCT GAGCTTTTCC CCCTCGACGC TAGCGCTGGT AGTTCCACAG GCTCCGGGAA GACCACGGTC TGGGACGACA AGTACAGGGT TCGGCAAAAG TAGTAGCCAA TCGATTGTGC CGCGACCAAG CCACTGCCGT ACTCGCCGTA CCGCGACGTG GATGGTTTTG ACTCCTCCAT CCCAACGGCA ACCAACGCCA TCCGTATCCC AATCATCACG ACCGTCGCGA TCCACCACGT TATCGACGAC GTCCCCGCCG CAAGGTACAC CCGTTCAATC CGCATCGGAG AATACTGATT CCACCAGTGG GTCTACGGGG ACTGGACAAG CTCGCCCAAA GAGCGCCACA CGCAGTAGCT GCAGCAGTAG CAGTAGTACG CGATTACACG CCCGTAACAA TAATCTGCTC AAGCGTAAAA ATTACCAGCG CAATTTCATT CCGGAATCAG AACGTGAAGT GAAAACACTG CGTCGATCGC GCCAGGCCAA GTATGAGCAG CTGAAGTCGG GCTCTGATGT GGCACCCAAT ATCTGGAGTT TCGAAAGTCT TTTTCCGGAT CCGGTCTGGG ACGAGACCTC CGTGCAACGA GACTTGTACC AAATTAAAGA GAGGGACGAG AAACAAAAGC AACGGCTCGC CTTTCAAAAA AATGCCACCG ATGGCATCAA CAGCAACAAC ATCCGACAGA AAAAGGATCC TGTGGTAGAT TCTACAAAGC TGCGCGAGAA GCGCGATAGC CTCGTCAATC CAAAAATGCG ATCACCCTCA TACGGTGGAA GCTCCGTCAT GCGATTGTGG CGCGAGCCCA AGCTCAGCTC TCTCGAATCA CCTTCAATGG AAAGCAACCG TGCTCTTCCT ACCCGCAAGA ACACGATTCC GAACAGCATA AGCAAAACTA GTTCGGAGAG TATTGCAAAC GTTCGTCCGA ACGAGAAACT GCACAACATG TCCAACGTCC TCACCATGAC AACAAGCTCG AACAGCAACA ACAAAACCAA GCCAGCGAAA GTCGACGCCG ATTTAACGCG TCTAGTTCGG GATCGTATTT TCGGCTATCG CCGAACCAAA ACCGGTCAAC TCCAGTACGA CACATCGCTT ATGGGAGACG GAGCCGTACA GTTTCGCGAC GGGGTCCGCC TCAGCAATCC TTTGCGAGTC AACGCGGATC GACTCAATTA CCTCGCCAAA AAGGAATTAC AACACGGTCG GGTGGAAGAA GCCCAAGAAC TGTATACGAT TGCCCTGCAA ATCGATCCCC GCGATGGGCG CGCTTACCTC GGGATGAGTC GCTGTGCAAG CCGGCGGCGG GACTTTAAAC TCGCCAAAGT CTGGTTGCAA ACGGGCATTT CCAATGCCGT GTCCGTTAAC GAAAACACCA TGCAAGCTGA TCGTGGCGCC AACCCGTTCT TACTGCAGGC GCTCGGCTGC TTGGAAGAAA ATTCGGGACG ACTTTCCGAG GCGGAGGCCT TGTATATTGC GGCGGCCAAA TCAAGACCTA CTCATGCAGC TGCATGGGTC AGTCTGGGGC AGTTAAGAAT CCGCAAATTG GGACAATCCG CTAACGCTGG GCGAGTTTGT TTTCAATCCG CGGAACGAGA ATGGCAACGA GCATCGCTAC CCCCGTCAGC ACACGTTTAC ACGGCCTGGG CGGCCTTGGA ATGCGAAGCA AACGACATAC GGCGGGCCCG CCAACTATAC AAGGCTGCCT TAGATGTTGA CCCAAGAAGT TCCGTGGCCT GGTTGCAGCT CGGTGTCATG GAAGCAGATG AGGAGAACTG GAACGAAGCT GAAACTTGCT TTGAAACAGC GTTAAAATTT GATCGTCGGA ATTCGCGACT GCTGCAAGCA TACGCACTCA TGGAAACGAA ACGGCCTAAC GGAAACAGTC GGAAGGCGAT TGGATTGCTA GAGCGTGCCC TCAAGGCGAA TCCCAGAGAC GCCGGTGTAC TGCAAGCTTA CGCTTTGTAC GTTGCCGAAC TGGGCGACGT GGACGCCGCT CGCGATTTGC TACGACGAGG GGCCGAAGCC AACAAGCGCC ACGCCCCGGT CTGGCAGGCC TGGGCGGTAC TAGAAACGCG CCATGGAAAC GTTCAGGAAG CCCGCTCAAT TTTTCAAGAG GGCATTTGGG CTTGCGCGCA ATTGACGGGT GGCCAGTCGG GTGGCTACCG GTGCGCCCGA CTGTGGCAGG CCTGGGGCGT GTTAGAGGCC AGAGAAGGCG ACGCTGCCGC GGCTAGAAGA TGTTTTTCGC GGGCCCTGGA TGCCGATAGT CGTAACGTAG CGGCAGTCAC AGCCTGGGCC TTGATGGAGG AAGAGTTTGG CAACGTTCGG GACGCCCGAG CTATTTATGA ACGATCGCTG CGGCTGTTCG CTGCTGGCAG TGGTGAGAAA ACATCAATAT GGAGAAACTA CGAACTCATG GAACAGCGGC TTGGTCACGT GGCGGCCGCC CAAAACGTCT ATCAGCGGTC CATGCGGGAA GCAATTACCG TCTCGGATGA AATCGCCGAC AATATTGTGG GCCTGTCGGC TAAGAGTACA ACTCCCCTCC CGGACTTGAC AAACGTACTG AGTAGATCGT CGGACGAAGT GGAAGTTTTA CGATGGGAAG GCCAATCAAA ATCGAGCTTG GGTGGCGAAG TTTGGCTCAA CGACCGGGCT ATTGAAGGCA AGGTACCATT TGACATGAAG ACGAACCAAC GACGGAACAA GAAAACCGAT AAAAAATACA ATCAAACCCC GTAGAAAAGG TTGATAGATG TG
|
Protein sequence | MALMVQQQGQ NRRRIISSTH HLTIVSNTSD TPPTLLRQTA CFMIPTISAG LRWSSCAVVV VGVVALLSFS PSTLALVVPQ APGRPRSGTT STGFGKSSSQ SIVPRPSHCR TRRTATWMVL TPPSQRQPTP SVSQSSRPSR STTLSTTSPP QGTPVQSASE NTDSTSGSTG TGQARPKSAT RSSCSSSSST RLHARNNNLL KRKNYQRNFI PESEREVKTL RRSRQAKYEQ LKSGSDVAPN IWSFESLFPD PVWDETSVQR DLYQIKERDE KQKQRLAFQK NATDGINSNN IRQKKDPVVD STKLREKRDS LVNPKMRSPS YGGSSVMRLW REPKLSSLES PSMESNRALP TRKNTIPNSI SKTSSESIAN VRPNEKLHNM SNVLTMTTSS NSNNKTKPAK VDADLTRLVR DRIFGYRRTK TGQLQYDTSL MGDGAVQFRD GVRLSNPLRV NADRLNYLAK KELQHGRVEE AQELYTIALQ IDPRDGRAYL GMSRCASRRR DFKLAKVWLQ TGISNAVSVN ENTMQADRGA NPFLLQALGC LEENSGRLSE AEALYIAAAK SRPTHAAAWV SLGQLRIRKL GQSANAGRVC FQSAEREWQR ASLPPSAHVY TAWAALECEA NDIRRARQLY KAALDVDPRS SVAWLQLGVM EADEENWNEA ETCFETALKF DRRNSRLLQA YALMETKRPN GNSRKAIGLL ERALKANPRD AGVLQAYALY VAELGDVDAA RDLLRRGAEA NKRHAPVWQA WAVLETRHGN VQEARSIFQE GIWACAQLTG GQSGGYRCAR LWQAWGVLEA REGDAAAARR CFSRALDADS RNVAAVTAWA LMEEEFGNVR DARAIYERSL RLFAAGSGEK TSIWRNYELM EQRLGHVAAA QNVYQRSMRE AITVSDEIAD NIVGLSAKST TPLPDLTNVL SRSSDEVEVL RWEGQSKSSL GGEVWLNDRA IEGKVPFDMK TNQRRNKKTD KKYNQTP
|
| |