Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50058 |
Symbol | |
ID | 7198805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 270049 |
End bp | 273987 |
Gene Length | 3939 bp |
Protein Length | 1312 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184934 |
Protein GI | 219129518 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTAT TTCCCAAATC CTTCCTGGCG CACACACCAC GGATTCGTGG GAAGACGGTG CGTACCACCT ATAGGAGCCC TCTCCAACTT TTGCACATTC GACAACGCTC AATCACCCAA GCGACTGATT CATCCATTAC CATCGCAGGG GTAACGATTG AACAACCACC TGGCGGCAGC GGTCGGGACG ATCTGATACC ACCCTTCGAT GCTACTTCGT CCCGTTTCGT GCAGAGCTTG CCGCCGTCCT ACGTCTCCAC GTTAAGATGG ATGCTGCAAA AGGATTTAGT CATGAAACAG GATTTCTGTT TACTGGGACA TTCTGCTGCC GATCAACGAC TTTTGGCACT GACGTACGCC GCCTTGACTT CTCGAGAGAT AGAATATGTT TCCCTGACTC GTGATACTTC GGACGCGGAC CTTAAACAAA GAAAGGAAAT GCGCAACCCT TCCGACCCTC TGTCGTCGGC AACATCCAGC GTGTCTTCCA TTTACGTTCC GCAGGCGCCG GTACGCGCCG CATTGCACGG GCGCCTGCTG ATTCTGGATG GCATCGAAAA GGCGGAACGC AACGTTCTCC CGACACTCAA CAATCTGCTC GAAAACCGGG AACTCTCTCT GGATGATGGG TCCATGCTGG TCTCAAGTGC AACGTACGAT CAGCACCGCG AATTCCAACA TGATGACGTA CTCAATAGAA TGCATCGAGT GCACGAGGAT TTTACTGTCT TGGCCCTGGG TTCCCTGTCG GACTCGTCTG GTTTGGATCC CCCGCTACGA TCGCGATTCC AAGCTCACGT GGCGCCTGCA TTACCTCCTG GAGAAATGCT CGACGCGTTG GCATCTTTCG TTGATGCGGA TCGAGTCTCC ATGGAGCAAT TGCATACACT GGTACAGTCA CCACTGGTTT CTTCGACCCG AGAAGCTTCC TATTCTGTCG CTATCAAGGC GGCCAAATTT CTTCAAGCGA ACTCTTTCAT GTCGGTTTCA TCGGTAGTGC GGGGGTTTGG AATTGGGACA CCAATGACAG CGATGGACTG GAAAGCCGCC AAAGGGCAGA GCACGCATCG GACCTTGGAT CGAACAAAGG TCCCGTCATC CTTTATTCCA ACCCATTCCA CAACAGCTAT CACCGAACTT GTTCAAGCTG GTCTCGGAGT TGGGTGTGCG GTTGCATTGG TTGGTCCCAA GGGTAGCGGC AAGTCCGCAC TGATTCATGC AGCTTCTCGA GCACTTAATC AGCCATACGA ACTCTTTGCG GTCGTGCCAG ATATGACATC CCGCGATTTG TTGCTCCGAC GCGCGACCGA CGATGCAGGA AACACAATTT GGAGAATGAC GCCCTTGGCC CGAGCTGTTC AGAATGGTTC CTGGGTTGTC TTGGATGGTA TTGATAGGTT AAATTCGGAT ACCTTGACGA GTTTGGCGAG ACTTTTTGAA ACTGGTCAGG TTGACATGCC AGACGGATCA CGATTGACCG CACATAAAGG ATTTGGCTGT ATTGCTCTCG CCCATCCACC AGCTGCAGGA AAGTCTTGGA TTAATCCTGA AATTGCCAGC ATGTTTTGTT GGGTTGGTGT TGATCCAATG GATGCCGGTG AACTCGCAAT TGTACTGAAC GGTCTTTTTC CAGATATTGA GGAGAGAGAA CTCGACAAGC TGGTTCGCCT ACGCGACCGC TTGGATGCTG CGGTCAAGAG CGGTGCGGCA GATACGGTCG AGGAACGGGA AAGCTTGACT CTGTCTCTGA GAAAGATGAA GCACATTTGT CGCAGACTCC AGCGTGACGG CCGCCACTTG TCTAAGTTGG TAAAGGACAC GTTATTGACT AGTCTGATGC CGGAGCGAGA GCGCCAAGTG GTCCAAAGTT GCTTGCATGA TTGCGGCATC TTTGAATACA AGGCCGGCGA TCGTTTCGAA AAAGACATTG AAATCGACCG CAAACTAATA GATTCTTGTC GGCGGACTCC CGAGAATCCA ATTCTTGTAC CAAATCCCAG GTTTCAAGAA AACCCAGGCC ATGCTCGAAT TCTTGGCAAC ATTCTCGAGG CTCACGCCGT CGGCGAACGG GCGCTTCTTA TCATGGGCTA TCAAGGAGTT GGAAAGAACA AAGTTGTAGA TTTTTTGCTC CATCGGCTAC AGCGTGAACG TGAATACCTA CAGCTGCATC GCGACACTAC AGTTCAGTCT GTTTTATCAG TCGCGTCTGT GGAAAAAGGA CGAGTTATAT ATGCGGACTC TCCATTAGTT AGAGCAGCGA CGCATGGGCG AATCCTGGTC ATTGATGAAG CTGACAAGGC TCCTGTAGAC GTTGTTGCTT TGCTCAAAGG TCTTATTGAA GACGGCGAGC TAGCTCTTCC CGACGGAAGG GTCCTACGGT ATGACGATGA CGGTCGATCG AATACTCTGG CAGTCCATGC AGACTTTCGG ATTTGGGCTT TGGCCAATCC TGCGGGATAT CCATTTCACG GTAATAATCT CGCTCGCGAA ATGTCTGACG TGTTTTCGTG CCACACCGTC CCGCCATTGG ACAGTGAAAG TCAAATGCAG ATTCTCCGCA GCTACGCCCC TAATATCAAG AAGAAAAAGC TGTCAAAAAT TATCTCTCTT TGGCAGGATC TCGGTGAAGC GCACGCGAAA GGAACTCTTG TGTATCCCTT TGGTATCCGT GAAGCCGTGT CTGCCGCTCG TCACATGAAC GAGTTTCCTA ACGATGGGCT GACTGGAGCC ATTGAGAACG TAATTGCTTT CGATCGTTCG GATCTTGCTC TCATGAAGCA GCTAAATTCT ATATTTCGAA AACATGGGAT TGAACTCTCG TCTTCTGAGC CGATGAATGA AAAACAACGA ACTGAAGGTG GAATCTCAAC ACCAAGAACC CGAGTTGGCG ACCCGAAGCA CGGCAAAGTA GACCCAGATA ATACTCCACA TATTGGAGGC AACACCTGGT ACGGTGGTAC CGGCGGTTCG GACACTGCCG GACTGGGCGG TAGAGGCGGA CCGTATCGCG TAGACCTGGG GCACCCTGTC CACCAAGTAT CAGACGAAAT GAAAGCCCAA GTATCAGATG AAGCTCAGCA CAGAGCACGA GAAATAGCTG CTATAGAACT AGAAAAGAAA TTGCGGGAAC TGAGCATGGG AAAGTACGAT TGGGAAAGAT ACAAAGGGCT CCGGGAGCGC GTTGCCCTAC AGATTGAGCA GTTACGAGTT CATTTAAAAG ACTTGCAACA TCGTATGGAA GAGCGAATGT GGTTAAACCG ACAATTTTCC GGAGAATTGG ACGAATCGCG ATTGGTAGAT GCACTTGCGG GTGAAAAAGA TGTATTCAAA CGACGCGGAA CAGCAGTTGA TTCAAAGGTA TCCTCCAAAC TATCATCTGA TAAAATGCGA ATAAAGGTCG TTGTAGACGT TTCCGCTTCG ATGTATCGAT TTAATGGCTA CGATGGTCGA CTTGAGCGGT TATTAGAGGC AACCCTGATG ATCATGGAAG CGCTACGGAG CGATGATCGC TTCCGGCTTG AAATTGTGGG GCACAACGGG TCAAGTGCTG TAATCCCACT ACTAAAAGAC GACTCTTCGC TAGACGAAGC CACACAACTC CGAATTCTGC AAGGTATGAT TGCTCACACT CAGTACACGT ACGCTGGTGA CAGCACTTTG GAAGCCATTC AAAGTGCCAT GGAGAAAGCT CGCCCGAAAG ACTTGGTGTT AATGATCTCG GACGCCAACC TGGAGCGCTA CCGGATTGAG CCCATACAAG TTGTAAAATT GTTGCAAAAG CCGGAAGTTC ACGCTCATTT AATTTTGATC AGTTCTTTTG GAGAGGAAGC GTACGAGCTT GCAAATGCTG TTCCAAACGG ACGAGCCCAA GTCTGTCTAG ACAGCAGCGA GCTCCCTCTA ATATTGAAGA AGATCATCGT GGCTTCTGCT GATCTTTGA
|
Protein sequence | MSLFPKSFLA HTPRIRGKTV RTTYRSPLQL LHIRQRSITQ ATDSSITIAG VTIEQPPGGS GRDDLIPPFD ATSSRFVQSL PPSYVSTLRW MLQKDLVMKQ DFCLLGHSAA DQRLLALTYA ALTSREIEYV SLTRDTSDAD LKQRKEMRNP SDPLSSATSS VSSIYVPQAP VRAALHGRLL ILDGIEKAER NVLPTLNNLL ENRELSLDDG SMLVSSATYD QHREFQHDDV LNRMHRVHED FTVLALGSLS DSSGLDPPLR SRFQAHVAPA LPPGEMLDAL ASFVDADRVS MEQLHTLVQS PLVSSTREAS YSVAIKAAKF LQANSFMSVS SVVRGFGIGT PMTAMDWKAA KGQSTHRTLD RTKVPSSFIP THSTTAITEL VQAGLGVGCA VALVGPKGSG KSALIHAASR ALNQPYELFA VVPDMTSRDL LLRRATDDAG NTIWRMTPLA RAVQNGSWVV LDGIDRLNSD TLTSLARLFE TGQVDMPDGS RLTAHKGFGC IALAHPPAAG KSWINPEIAS MFCWVGVDPM DAGELAIVLN GLFPDIEERE LDKLVRLRDR LDAAVKSGAA DTVEERESLT LSLRKMKHIC RRLQRDGRHL SKLVKDTLLT SLMPERERQV VQSCLHDCGI FEYKAGDRFE KDIEIDRKLI DSCRRTPENP ILVPNPRFQE NPGHARILGN ILEAHAVGER ALLIMGYQGV GKNKVVDFLL HRLQREREYL QLHRDTTVQS VLSVASVEKG RVIYADSPLV RAATHGRILV IDEADKAPVD VVALLKGLIE DGELALPDGR VLRYDDDGRS NTLAVHADFR IWALANPAGY PFHGNNLARE MSDVFSCHTV PPLDSESQMQ ILRSYAPNIK KKKLSKIISL WQDLGEAHAK GTLVYPFGIR EAVSAARHMN EFPNDGLTGA IENVIAFDRS DLALMKQLNS IFRKHGIELS SSEPMNEKQR TEGGISTPRT RVGDPKHGKV DPDNTPHIGG NTWYGGTGGS DTAGLGGRGG PYRVDLGHPV HQVSDEMKAQ VSDEAQHRAR EIAAIELEKK LRELSMGKYD WERYKGLRER VALQIEQLRV HLKDLQHRME ERMWLNRQFS GELDESRLVD ALAGEKDVFK RRGTAVDSKV SSKLSSDKMR IKVVVDVSAS MYRFNGYDGR LERLLEATLM IMEALRSDDR FRLEIVGHNG SSAVIPLLKD DSSLDEATQL RILQGMIAHT QYTYAGDSTL EAIQSAMEKA RPKDLVLMIS DANLERYRIE PIQVVKLLQK PEVHAHLILI SSFGEEAYEL ANAVPNGRAQ VCLDSSELPL ILKKIIVASA DL
|
| |