Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47736 |
Symbol | |
ID | 7202726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 683768 |
End bp | 686788 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181955 |
Protein GI | 219123279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCGA CAGAAAAAAG CTGCGCAGAC GAAATTGATC GAGAACTAAG TCTCATCTCC CTTTATGAGA GTGAACTAAA ACGTCTAAAG CGTAAGCTGA ATGCTCAAAA GGTAGCGCAG CCCTTGCATT CGGCAGCAGC ACCGGAACAC ACCGGTACCG AGTACGGCAA CGATTCCGAG CAGGATCCAC TCGACTGGAT TTTGCAGCAA CAGCAAACGT CGGCAGTTTC AAACTTGTTA GCTTCCATTC AATCACACGA AAGCCTTTTA CGGGATGATA TTGAACCCCT GCAAACAGCA ATTCGCCAGT TTGAGTTTAC TACCGTCGAT AAAGTACCGC CTCGCGATGG CAAGCCGTCA TCTCCTCACT CAGCCGTAAG CTATTCTCTA AAAGGCCACT TCCTGGCAAA TGAGTCTATC CATGCGGACT TTGTGATTGA CTTTCAACTA CAGAATTCCC GCGCCGAAGA CAAGATGACT CAAACGACAA AAATTCTGGG CATTATCTGC AACTTATCCA GCATAAATGA ACCAGATCGA GACTTGTCGT GGTTGGCTCG AGAAGCCAAG CCTACTGATT CCGGTAATTT TGCCAGTTTC GTAACAAAAA TATGTTCCTA CTTGGAATTT GATGTCCGTC GTGAAGCTTG TTTAACGAAA TGGGGCCATG TCACCGTAAC AAGGGATCAT TCGAAATACC TTATTGAGAT CCCTCTTGAT CATGATAATA CCGGTACGAA CTTTCATTTG TCGACGCTTT CCATCGTCTG GGGCTGGAAG TGGAAAGACG AGCATGATGT ATTGCGTCTA ACACAGACGG CTTTTGAGTT GGGGCTCAAA CAACAAGATT TGGACTTCCT TGTACAAGCA TGCGGCACCT GTGAGAAGGC GATTGGAATC GTTCTGGCAC AGACGAGCGG GGACACGATG CTTTTCCCCA ATGACGAGCA AGAGTCGAGG ACGCCAACTT TGCCAGATGT CGAACAAGAC GCTGATCAAG AGGATAATGA CAGTTTGGAC GGCGGCGTTC TGCGCGAAAA TGCTTCACTT GCGTCAAGCA CGCAATCCCT TACTAAATCC CCTAGTGGGC GTCGACGCTC CGACTACGAA GTACAGCGGC TTCTTAAAAT CCAGCGCAAC CAGCAGTTGC TGGAAAAATT GGGTCTCTCA CATTCCTCAA GGTCGAAACC AACCACGGCT GAGAAGAGGC GCCCTGCGGA AGAGCAAGCA AATCAGGAGC TGGAAACGAA GCGCAGGAAA CGAAAGAAAG AATTGGACGG AAAGCGACGG TTGTCGGGAC GAGTTCGCCT GAAGCCTGTC ACCTTTGCGG AAGAACAAGT ATTCCATCGG AAAGGGTCTT GGAAAGATTC GTTCTCGAAA AGGACAAATC AGATTCAAAA CGGTGAACAC AGGAGTTTAT CTGGACGCAT TGCTCAACCG CGCGGACGAC CACCCACCGG GTGCGCATGG GATACCAGAA TTGGAATGTG GGTAAAATTC AACGACACCA GCGATAGGAA GCAAGATGCA GCTGAGCTTA TGAGGCCGTC CTCAAGTGTG GCAGCCCAGA GCTCGTCAAC GCTAGAAGCA AATTATTCCT TGGTTGATGG TGCTTTTTCT TCCACTCGAG ATGGGACTGC TCCTCAGATG AACGAGTACA GCAGCCCGTC GAATGGTACA TTTCCTCGTC CGCGTGGAAG GCCACTTAGT GGGCATTCCT GGGACGAAAG ACATGGTATT TGGGTGCCGG GAACCAGCCC TGGAAAAGTC AGTCAAAACG ATTATGGTCG CATAGCACCA TCGATACGCG GTCCGACAGA AAGATTGCCA GAGACACCCA ACCATCCAAG CTTTGAAAAG TCGCCGTCCA ATCTTCGAAA TCTTGACCAT TCGAAAAGTG GCGTAAAATT CAGCACTCCG TTCGCCAAGA CAATTCCTCG ACCCCGCGGA AGGCCACGTA GTGGGCATTC ATGGGACGAA GTGCATGGTG TTTGGGTGCC CCTGGGAAAA CGGAGTCAAA GCAAGTTTGC GTATGTCATA CAACCGTTAC ACGACAGAAC AGAAAAACTG CCCGAGACGC CCAATCATCC AGGCTTTAAA AAGTCGGTGT CCAAGTCCAG CAATCGTGAT CTTCCGAAAA GTGACGAGGA GATAAAAAGT CGTTCCGCCA CGACGTTTCC TCGGCCCCGT GGACGACCAC CAATAGGATG CTACTGGGAC GAGACACGGG GTTGTTGGGC TACGCAACTG AGTACTCGAG AAGTCGACCA GCCGACAGGG GCATTGGCAC CCAAACTTTC CCGTAATTCC ATATTCGACA CGGGCGCATT GCATGTTAGC GCTCGATCAG CTCATATCAA TGTTACAGAT CATCCAAGAA AACCGACAGA TAATTTTTCC GGTCGCTTTC CGCGACCGCG GGGAAGACCA CGGGCGGGAT GTATCTGGGA CGAGGTCCGT GGGCTCTGGA TCCCTGAGAT GAAAGCCCCA GAGACAAAGC CAGTATTGAC GGAGCGTCCC ACCATTCCAC TTCCGATCCC ACGAACGAGT TCCAAAAGTC CGTCAGTCAC GCGGCTGCCA TTCTCGTTAC GTCCCGACAC TATCCAGCCA AAATCGTCTC CGGTAGCAAA CAGCGACGGT ACGTACGGCC GACCCCGGGG AAAAGCTCCA CTGTACTATG GATGGGATAC CCACCGCGGA GTGTGGGTAT CACAGTCAAA CCCGTCCGTC AGCTCGTCAC GGATAGCAGA CCGTTCTCCC GTTTACGAAT CACCACCCGG CCAGAAATAT AGCAATGCGA TAGCGATGCC ATCTCCAGCA GTCGTGAAGG TCGGTTCCGA GATCGATCGA ACGCGAGCAG CAGAAGGTTT CGTCATTCGG GAGCAAGGGA ATCCTATGGG CACGGGAAAT AACGTGGCTG TTGATAGGAA GCAGTCGGAA ATCGACAAAG AAACTGCACT TTACGAATTG TATCAAGAAA AGCTACGGCA TCTGGAAAAA CCGCGGAAAT CAAATGGGTA G
|
Protein sequence | MAATEKSCAD EIDRELSLIS LYESELKRLK RKLNAQKVAQ PLHSAAAPEH TGTEYGNDSE QDPLDWILQQ QQTSAVSNLL ASIQSHESLL RDDIEPLQTA IRQFEFTTVD KVPPRDGKPS SPHSAVSYSL KGHFLANESI HADFVIDFQL QNSRAEDKMT QTTKILGIIC NLSSINEPDR DLSWLAREAK PTDSGNFASF VTKICSYLEF DVRREACLTK WGHVTVTRDH SKYLIEIPLD HDNTGTNFHL STLSIVWGWK WKDEHDVLRL TQTAFELGLK QQDLDFLVQA CGTCEKAIGI VLAQTSGDTM LFPNDEQESR TPTLPDVEQD ADQEDNDSLD GGVLRENASL ASSTQSLTKS PSGRRRSDYE VQRLLKIQRN QQLLEKLGLS HSSRSKPTTA EKRRPAEEQA NQELETKRRK RKKELDGKRR LSGRVRLKPV TFAEEQVFHR KGSWKDSFSK RTNQIQNGEH RSLSGRIAQP RGRPPTGCAW DTRIGMWVKF NDTSDRKQDA AELMRPSSSV AAQSSSTLEA NYSLVDGAFS STRDGTAPQM NEYSSPSNGT FPRPRGRPLS GHSWDERHGI WVPGTSPGKV SQNDYGRIAP SIRGPTERLP ETPNHPSFEK SPSNLRNLDH SKSGVKFSTP FAKTIPRPRG RPRSGHSWDE VHGVWVPLGK RSQSKFAYVI QPLHDRTEKL PETPNHPGFK KSVSKSSNRD LPKSDEEIKS RSATTFPRPR GRPPIGCYWD ETRGCWATQL STREVDQPTG ALAPKLSRNS IFDTGALHVS ARSAHINVTD HPRKPTDNFS GRFPRPRGRP RAGCIWDEVR GLWIPEMKAP ETKPVLTERP TIPLPIPRTS SKSPSVTRLP FSLRPDTIQP KSSPVANSDG TYGRPRGKAP LYYGWDTHRG VWVSQSNPSV SSSRIADRSP VYESPPGQKY SNAIAMPSPA VVKVGSEIDR TRAAEGFVIR EQGNPMGTGN NVAVDRKQSE IDKETALYEL YQEKLRHLEK PRKSNG
|
| |