Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50075 |
Symbol | |
ID | 7198757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 317396 |
End bp | 320387 |
Gene Length | 2992 bp |
Protein Length | 955 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184860 |
Protein GI | 219129363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGTCG AGAACGGAGA TGGAACGAAC GGCAGTAGGA GCGAAGAAAT CGGGGCAAAG GCTGCTCCCC ACGCGACTAC GTCGACAGAC AATCGCACAG CATCGTTCAC TTCCACATCG TCACCACCAT CCTTTGCTCC ATTTCAATCC CAACGAAGCG TTCAGCATAA GAGCAGCTAC GATGTGTCCA AGTCCATGGA ACGGCTTCGA TCTTCAATGA TTGGTAATGT ATCGTGGATT CCTGGTCGTC ACGAGAAGGA GGACTACGAT GAACTTGGTC AAGTACCGAC CACAAGGACT TCAACGACAA CTACTGCATC ACCAGCTACG CCCTTGCCTC CAAGTTTACA CCGCGCGTTG GGACAGATAC CAGCAATCGT GTTGATTGGC ATATTTCATC TCATGATTGG AATCCCCTTT GGTGCTTCTT ACTTTCCAAT TGGTTGGAAG GCTCCTGGGA GTGCCGACGA CGAAGACGAG AATGATGACG ACGGGGTCCA TGGAATTTTT CCGTTACCAG GCAAAGAAGC TCTGGGAATT CGAATGTTTT TGTTCTCCAC AATCCTGGGA CAAATTGTCT TTACGGGTCT TTCTGGCTTT CGCAATCCTG TCGGCTTACA AATGGTCGAG AACGTTCCGT TCTGTCACGA GCTGGCCACA ATCGCCGTGC GTCATCAAGG ATACACCCGA GAAGCTCTGT CAACGCTCTT TGCCATGTAC GGATTCTCGA GTCTGCTCGT GGGTGCGGTC TTTTATGCTC TGGGACGATG GAAACTAGGC AACGTTGTTT ACTACTTTCC GAAACACGTT TTGGTGGGAT GCATTGGGGG AATTGGCTTG TATATCGCCA AAACGGGTGT GGAAGTTACC AGGAATGCGG AATTCTCACT ACGGGCAGCC ACGGTCACGT ACGGACTTTT GTTGGTCGTC GTCCTTGCCT TTGAAGTCGT CCTGCGTCTA CTGGAATTTG GAACGCGTGA CGTTAGCGGG AACGCAAGGT ACCCATTGTT GTCGCCGTTA TATTTTTGCA GCATTACACC CGTCTTTTAT ATGGCACTCT TTGTGCTTGG CGTGAACATT GAGACTGCCA CGGAGGAAGG ATTCTTCTTT CCCGCATTGG ACAAATGTAC TATTGGAGGA GGTGAAAATG GTGAAGCCTG CTCCACGTCC TTGTGGGATT CCATTTTTGA TCAGAATCTG TTCAATATTT GGAAAGTCGT AAACTTCTCG ACAGTCTCCT TTCCCGCGCT GATGGACGCC ATTCCGACCT TGGTCGCACT GACTTTGTTC AGTCTCATTC ATGTTCCCAT TAATATTCCC GCCTTCGCAA ACTCCACGGA CACTGATGTA GATATGAACA AAGAACTGAT TGCTCATGGC TACTCCAATT TGCTGGTCGG CATTTTTGGC GGCTTGCAAA ACTACATGGC CTATACGCAG TCGGTCTTGT ACGACAAATC AGGGGGAACG GGAAAGGCCT CGGGCTATGC TGTCGCCGGC ATTACGTCGG TGCTTTTCTT GATTGGGCCC ACCATTGCTT CCTATATTCC CCGGTGTATG GCGGGGACCT TATTGGTCCA CGTAGGTGTG GATTTGTTTC TGGAAGGCGT TTACGAAACA TGGGGAAAGT TCGACGCACT GGAGTACGGT GGTATTTGGC TTATAACAAT GGTCATGACA CTGTACGGGA TGGAGGCCGC CATGATTGCC GGCTTCATCA CGGCTCTTTT TACATACGCC GTGCAAAATA CGACGTACGT TCATATCCTG CGTGGATCCA TGTCCGCAGC TACGTTGCGC AGTAGCAAAT GGAATCGCAG TACCCGGGCC AACGCTATTT TGGCGGACGA GTCGACCGGA CGCAATCGTA TCCTGGTGGT CCAACTCCAG GGACACTTGT TTTTCGGCAA CATGGTGCAA CTCACCCAGA GTGTAAACGA TGTGCTGAGT GAGAAAGCGA AGCCTCGTAC GGAACCTTGG ATTGTGATTA TTGATTTTGG TTTGGTACTG GGGATTGACT CTTCCGCGGC ACAATCGATC AGCAAACTAT CCAAGACACT GCAACACAAG CACGGTGTCG ATCTTTGCAT TTTCGTGACG GGTTCTGGGG AAGGCTTTCC AACGGCCTAC AGTTTGTCCA AGGAATTGTC CACTTTATCA TCGACCACGC CAGTGGTTGT TTCGGATGAA GACGTGCGGA CAACCGAAGC GACACCCTTA TTGGCACCGT TCGCGACACC GAATCCCGAT ACATCATCAT CATTGTACAC GGGCAGTCGT GTATGCACTA CGCTGGACGA TGCGCTGGTG TTTGCCGAAG ACGCGTTACT GGCGCGCACC GATTGGTCGT TGTTGGAAGC AGACCGTCAC ATTGGCGATC CCCTCCGCGG CGGCGTGTAC GATATCACGG ACGAAACGCG AGTGGCTTTG CGGTATTTGG AAAATCTGTG TCCACGCGGG GTGGACCAAG CGCACGTGCG TTTGCTGTGG AAGTGCATGA CACGGGAAAC GTACGTATGC GGCGATTCCG TGTGGTTGCA GGGTTCGGAG AGTGACTGTA TGAAACTGTT ACTGCGTGGA ACCTTATTGG CGTCTCTCGA GAACGAAGCC GGGACGAACG AAAGCATCGC GGCGGGCAAT ACGATTGGGG AATTGGGTTT GGTGGAACAC ACGCCACGGA TGAGTTCCGT CACGGTAGTG TCGGCGGACG CTGTCCTCTA CAGTCTACAC CGCGAGCGGT GGCGGGAATT GAAGGCCGTG TCCCCCCACG CCGCGTCACT GACGGATCGT ATCTTGATTC GTTACCTGTC TGCACGTGTC CAACACGTGA GCAATCGTAT CTACGAAACA CGGTGTTTGC CGATATAGAC GTTGGGTTGG GGGCACCAGG CCATCGTTTG TGGGAATGGC AAAACAAGTC GACCAATCAC GTGGGTGGCC GGGGTTTGCT GGTATCCATC CTTCCATGCC CGTAAGGATA CCATGTCACC CT
|
Protein sequence | MEVENGDGTN GSRSEEIGAK AAPHATTSTD NRTASFTSTS SPPSFAPFQS QRSVQHKSSY DVSKSMERLR SSMIGNVSWI PGRHEKEDYD ELGQVPTTRT STTTTASPAT PLPPSLHRAL GQIPAIVLIG IFHLMIGIPF GASYFPIGWK APGSADDEDE NDDDGVHGIF PLPGKEALGI RMFLFSTILG QIVFTGLSGF RNPVGLQMVE NVPFCHELAT IAVRHQGYTR EALSTLFAMY GFSSLLVGAV FYALGRWKLG NVVYYFPKHV LVGCIGGIGL YIAKTGVEVT RNAEFSLRAA TVTYGLLLVV VLAFEVVLRL LEFGTRDVSG NARYPLLSPL YFCSITPVFY MALFVLGVNI ETATEEGFFF PALDKCTIGG GENGEACSTS LWDSIFDQNL FNIWKVVNFS TVSFPALMDA IPTLVALTLF SLIHVPINIP AFANSTDTDV DMNKELIAHG YSNLLVGIFG GLQNYMAYTQ SVLYDKSGGT GKASGYAVAG ITSVLFLIGP TIASYIPRCM AGTLLVHVGV DLFLEGVYET WGKFDALEYG GIWLITMVMT LYGMEAAMIA GFITALFTYA VQNTTYVHIL RGSMSAATLR SSKWNRSTRA NAILADESTG RNRILVVQLQ GHLFFGNMVQ LTQSVNDVLS EKAKPRTEPW IVIIDFGLVL GIDSSAAQSI SKLSKTLQHK HGVDLCIFVT GSGEGFPTAY SLSKELSTLS STTPVVVSDE DVRTTEATPL LAPFATPNPD TSSSLYTGSR VCTTLDDALV FAEDALLART DWSLLEADRH IGDPLRGGVY DITDETRVAL RYLENLCPRG VDQAHVRLLW KCMTRETYVC GDSVWLQGSE SDCMKLLLRG TLLASLENEA GTNESIAAGN TIGELGLVEH TPRMSSVTVV SADAVLYSLH RERWRELKAV SPHAASLTDR ILIRYLSARV QHVSNRIYET RCLPI
|
| |