Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48109 |
Symbol | |
ID | 7203273 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 228276 |
End bp | 231225 |
Gene Length | 2950 bp |
Protein Length | 857 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182492 |
Protein GI | 219124400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCGCG ACACGGATTT CCATCCGAGC GAACCTCCGT GAGTCTCGAA CACGCGGTCC GAATCGGCCT GTGCACGGTA CTTACCAACC GTACGGTTAA TTCTCCCTTA CTGCTAGCTG ACACAACGAA CCCGACGCAG CCAAAGCAGA GAGAGAAGAC AGCCCCAAAC CGCGGCCTAT CGTAAAGTAG AGTACTCGTT CTATTGGACA GTGTCGGTGC GCAGCTCTCC GTGGGACGTC GGATACAACC GCAATCGCGC AATCATCCGT CCGGCCTACC CGTAAGTCGT CGAAGCACTC GACTGTTTGC CGACCTCACG AATCGTCTCG GAGAAATCAT CACCACAGAA CGAGCATTGA TGGAATCTGC AGTGTCCACG AGTTGTACAG CCTCAGTGTC GGACGCGGCT ATTTCCGGCG CTCGTGTTGT CCCGTCGAAT GGAGCACCGA TTCCGGCCGA AGCAACCCAC GGCTGGCACA ACCAGACGAG CCCTCTACTG ACCGTTTCGT CGCAAACGAG TAGTCCGACG ACGACGTCCA CACAAAGTAC GGGGACGTGT TCGTCTCCGT TGGATCTCTT GGCCGGCGTT TCGTCCACGG AACACGCCAA GCACCAAACA CCATCAACGA CCAAGGTCCG TCCACAGTCC ATCGGTGCGG CGATCAAGAA CACCAATCCG TCCGCAGACG CCGTTTCCAA GACAGAAGGC GTGAGCAATA GACACGTTGC TGCCGAGAAT CAAGCCACTG GAACGAGCGT GGATGTCCAC GAGTCGCAGC TACAAGGAGC CGAAACATTG GAACAATTCG AAACGAAAAG CGACGACGTT CCATCGCCTC ACCTTTCGTC CTTCCGAAAA GCAACAACAA CAACAACAAC AATAATTCGT CACAATACGC GCCACAAAGA AAAACGTCGT CCGGTGGGAT ACGTGGCTCC CAAAACGGAA CGCACCGTCA AACCAAAGGC TACGCCCAAA CCCCGCCGGA ATCGTACCCT TCAGCGAGCA AGCGGAAGTT TCCCACGCGC CTGTTTCACG CATCGACCCA GCTTGGCTTG CCTTCGCTCA CAGTCCTTGA CCGACTACGT TAGAGATGTG GTGCTTCCGA CTGCCAGCGC GTACGAACCG AGTACGGATC CCGACGACGA CGATGATTAC GACGATTTCA ATCGAATTCA CGAGTACCGT CCGCTGGAAT GGACTGAAGG TATGGCCAAA ATCACCCTAC CAGAAGGTTT TTGTACGCTT GATGGAATCG CACGCGACCG GACGGGAAGA GGACTGGATT GGCAAGCGGG TACACCCTTG GGCGACTACG TCATCCAAAC CCCGATTGAA CAAAACATAC GAGGCCTCGC TGGCGTATAC GAGTACACCT TTGCCGACAA GCCGCAGGTC ACAATTGCAA GTTTTCGTGA ACAGGCGGAC GCCTACCGTA AAGTACAAGT TGGCAGCGCT GTCGATGATG GCGAAAATGC GGACTCGGAC GAAGCCATGG ATAAGTTGGC CCGAAAATTC TGGCAGCGTC TTGGTCCGAC CATGCCCCCT GCCTGGTATG GAGCCGATCA AGAAGGAACA CTCTTTGGCG ACGATCCTGC ATCCGGCTGG TCGATTGCAA AACTCGACTC GTGTCTGCAC GTGCTTTCGA ATGTTCCCGG CGTCACTACC CCTTACCTAT ACGCTGGAAT GTGGGCGTCG GTCTTTTGCG CGCATACCGA AGACATGAAT TTACTAAGTA TTAATTACCT GCACGCCGGT GCACCCAAAA TTTGGTACGC CGTTGCGCCC GGAAAGGACG CAGATCGGTT TGCCGAGCTT TGTGCCTTTC AGTACAGTAT GGAGGCCCGC AAATGTAAGG AATTCATGCG CCACAAACGA TGCCTACTCA GTCCGAAAGT ACTACAAAAG GCAGGAATTC GCTATACAAC GGCGGTACAG CGACCAGGTG ATGCCATGAT TACTTTCCCG GGTGGTTATC ATTTCGGCTT CAACGTGGGG TTCAATCTGG CGGAAGCAAG TACGTATGGC ATTGTGACAA ACGTTTTTGG TTTTTTGATT TGGATCAGCT TACACAAAAT TGTACGCTAT TGATTTTTAG CAAATTTTGG GGTACCAGAG TGGATTCCCC TGGGTTTGCA AGCTCATGTA TGCTTATGTC GACCAGATTC GGTTCGAATC GACGTGGAAC GCTTAATTGC GCTCCTGAAA TTGTACCAAC AGGCTGAGAA GCGGGAGGTG GGTTTGTCGT GGAAAACTTG GAGTCAGCGG AGGGAGGAAA AATTGGCTCG ACGAGCACTG TCGGAGCGTC GACGCATGTC ATCGCCGCCT TCTAAGAAGA AAAAACGTTC GAAAGCGCCA CGGACAACTG AATTTTGGGT CGAAGTTAGG AGACCTATAT CCAAAGAGGA AACTGCAAAA AAGAAAGGCA AAAGGCCACT GAAAAAAGCT AAGCGCACTG ACGAAGAAAT ATGGCATCTC GCCAAAGCAA CAACACGAAA AGGTCTCGTT CCCGATGCTC GTGTTCTTTG TGTTTTGCCG GCGAAAGTTG TCTTGGATCG TGTCAAATTT CATTACAGAA CTACTGGGGA CCCCGATAAT CAAGATGAGC AATGCTTTGC CGGTCAAGTA GTCGAGCTAA TTGATGATCA TGTTCGAGTC AGATTGGATG GGCTTCCAAA GTCCAGCGAC GAATGGATGC ATGTATGGAG TCCAAAGCTG TTTCTGGACG GTGGTCGATG GGGCGAGGAT CACGACGTTA CGGTAGAGGA CGAAATCGGG AAGACGTTAT ACTGGGAAGA AGTAGACTCC AAGAGCCTAT GTCTATGAGT TGTTGTAACA ACAGATTTGT TTTTGGAAGC AGTCAGTTTT CTTCGCACTG GGACAGCGCT TTGAAACCGA GATCTTCTTA ATCGGCTCGT ATGAAGTAGA GAAATTCGCT CAAGTTGTCG
|
Protein sequence | MGRDTDFHPS EPPVGAQLSV GRRIQPQSRN HPSGLPVSRR STRLFADLTN RLGEIITTER ALMESAVSTS CTASVSDAAI SGARVVPSNG APIPAEATHG WHNQTSPLLT VSSQTSSPTT TSTQSTGTCS SPLDLLAGVS STEHAKHQTP STTKVRPQSI GAAIKNTNPS ADAVSKTEGV SNRHVAAENQ ATGTSVDVHE SQLQGAETLE QFETKSDDVP SPHLSSFRKA TTTTTTIIRH NTRHKEKRRP VGYVAPKTER TVKPKATPKP RRNRTLQRAS GSFPRACFTH RPSLACLRSQ SLTDYVRDVV LPTASAYEPS TDPDDDDDYD DFNRIHEYRP LEWTEGMAKI TLPEGFCTLD GIARDRTGRG LDWQAGTPLG DYVIQTPIEQ NIRGLAGVYE YTFADKPQVT IASFREQADA YRKVQVGSAV DDGENADSDE AMDKLARKFW QRLGPTMPPA WYGADQEGTL FGDDPASGWS IAKLDSCLHV LSNVPGVTTP YLYAGMWASV FCAHTEDMNL LSINYLHAGA PKIWYAVAPG KDADRFAELC AFQYSMEARK CKEFMRHKRC LLSPKVLQKA GIRYTTAVQR PGDAMITFPG GYHFGFNVGF NLAEATNFGV PEWIPLGLQA HVCLCRPDSV RIDVERLIAL LKLYQQAEKR EVGLSWKTWS QRREEKLARR ALSERRRMSS PPSKKKKRSK APRTTEFWVE VRRPISKEET AKKKGKRPLK KAKRTDEEIW HLAKATTRKG LVPDARVLCV LPAKVVLDRV KFHYRTTGDP DNQDEQCFAG QVVELIDDHV RVRLDGLPKS SDEWMHVWSP KLFLDGGRWG EDHDVTVEDE IGKTLYWEEV DSKSLCL
|
| |