Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45404 |
Symbol | |
ID | 7200529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 68431 |
End bp | 70338 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179784 |
Protein GI | 219118000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCT TCGCAGCAGT ATCATGCACC AGTACAGTCG TGTCGCGGCA CGTACTGCGA CCCTTCCTCA AGCGGCGGAG GCAACTCGGC GACCCCCCTT CGACAATTCT CGACAGCGTT CGATCGACTC GCAAAGGGAC GACAACATGG TTGCATCAAC AATCGCGCGC GTCCACTACG GCAAGATCCG CAGAAGATCA GCCGCTGGTG TCTCTCCGCA ACGCCAGACT TTCGTACCGT CCCGAAGATA CCTCGGAGGC TCACGTTTCG CAACCCATAT CGCTCGACAT TTGGCATCCC TCCCGAGGCG GTCACCTACT CCTCGGTCGC AACGGTACGG GCAAGTCACT CATTACGCAG ACACTGGCCA CGAACGGTAC AGGGACGTTG GTGGACGGGG AATACGTCGT GACCGCTCCG CAATGGCACA GTCGTACCGT CACCCACGTC ACCTTTCGCT CCCATCAAGA CGTCTTGCAG ACCTCAGCTC ACCTTACCAG TTACAAGGTT ATTGCCGAAG GAGGACAAGT AAGCAAGGCC GCGCAATTCC TCATTGTACG ATTCGGTCTT TATCCGCTTC TGCATAGGGA AATTTCGACG CTTTCCACGG GAGAAATCCG CAAAGTTCTG CTCGTCCGAG CCTTGGCCAC GCGGCCACGG TTATTGATTC TGGACAACGC CTTTGACGGA TTGGACGTGG CCAGTCGCGA AAACTTGCTC GACTTGGTCC GTCAAACACT CCGAGGATTC AAGCAAGACA TTCTCGTCCA AGGCATCGAC GCCAAGAATG CCGCTCGTAC GCAGATCTGC CTCGTCACGC AGCGTCCCGA AGAAGTGGCG GACGAATTCA CCAACGTGGC GTTTCTCGAT CCTCCCCATA CGGATCGGGT GTCGCCGGAA ACTTCGGCAC AGGGTGGCGA TTTGCGTACT ATGGTACGCA ACGGCCAAAG AGCGACACAA ATCTTTGCGC AAAGCTTGGG AACGACTTCG CCACTAGAGG AAGGTGACCT CGACGACTCC CCTTGGGACA GTCGAAAAGA CGAATACTGG AATGCACCGG GGTTACCGAC TTTGACAGAA ATGTCCATAT GGTGGAACCA TGGACGTAAA GATGACGATG ACGGCACTTC AAGTACAAAC ACAACACTCC CACTGGTGGA CGCCCAGGGT CTACGAATAC AGAAGGGATC CACCGTCGTG CTACAAGAGC TAGATTGGAA AGTCTGGCCA TCGCAACACT GGTTGGTGGC CGGCGGCAAC GGAGCCGGCA AGTCAACCCT CAGTCGGCTG TTGGCTTATT GTGAAACCGA TAGTGATACG GAGGGATATT TGCGCGTACT CCATGGAAAA AGGAATCTAC CACAGATCGA TATTGATGAT GGCCAGCAAA CAGTAGTGGG ATCGCAGTTT GTACACCGAA GGCCTGGGGT AGGGTGGGTC TCGACTGAAT CACATTTGCA GCGTGTTCAT GATCAACGTA CGGCACGAGA GATTCTGCTG GAAGAAGCTT CTTCTGATTC ATACATTGTC CAGACGGTTA CGGAGTGGTT TAACTTGACG CACGATCCTA AGCTACTCGA ACAACACTTT GCTGACTTAT CACAGGGGCA ACAGAAGCTT GTTTTATTGG CCGCAGCAAT CTCGTCACGT CCACGTATTC TTGTGTTGGA TGAGCCCTGC CAAGGTCTCG ACATCGTCCA CCGACGACTT CTGTTGGGAT TGGTGGAGCG ACTGTGCCAA GCGACCGACA CGAATGACAC CGACACCAGT AGTCGAAGCA TTACCTTGAT TTACATTACA CACCATATGG AGGAAGTTCT GCCGTCAATC AATCAAGTCG TGCATCTGAA AGACGGACAA GCAGTCTATC AAGGGTCAAG GAAGCTTTAC AACCCGGACT TGCTTTAA
|
Protein sequence | MSTFAAVSCT STVVSRHVLR PFLKRRRQLG DPPSTILDSV RSTRKGTTTW LHQQSRASTT ARSAEDQPLV SLRNARLSYR PEDTSEAHVS QPISLDIWHP SRGGHLLLGR NGTGKSLITQ TLATNGTGTL VDGEYVVTAP QWHSRTVTHV TFRSHQDVLQ TSAHLTSYKV IAEGGQVSKA AQFLIVRFGL YPLLHREIST LSTGEIRKVL LVRALATRPR LLILDNAFDG LDVASRENLL DLVRQTLRGF KQDILVQGID AKNAARTQIC LVTQRPEEVA DEFTNVAFLD PPHTDRVSPE TSAQGGDLRT MVRNGQRATQ IFAQSLGTTS PLEEGDLDDS PWDSRKDEYW NAPGLPTLTE MSIWWNHGRK DDDDGTSSTN TTLPLVDAQG LRIQKGSTVV LQELDWKVWP SQHWLVAGGN GAGKSTLSRL LAYCETDSDT EGYLRVLHGK RNLPQIDIDD GQQTVVGSQF VHRRPGVGWV STESHLQRVH DQRTAREILL EEASSDSYIV QTVTEWFNLT HDPKLLEQHF ADLSQGQQKL VLLAAAISSR PRILVLDEPC QGLDIVHRRL LLGLVERLCQ ATDTNDTDTS SRSITLIYIT HHMEEVLPSI NQVVHLKDGQ AVYQGSRKLY NPDLL
|
| |