Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34010 |
Symbol | |
ID | 7197795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 870058 |
End bp | 871485 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178593 |
Protein GI | 219115595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.333756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCCCG GACTCCGCAA GATGGAGTAC GCCGTCCGTG GCAAGGTGGT GATTGCGGCG GACCAAATTT CCGATAATCT CGCTTCCGGA AAGTCCTACC CATTCGATCA CATTGTCTAC ACCAACATTG GCAATCCCCA GTCGGTGGGA CAGCAACCCT TGACGTGGCC CCGACAAGTC CTGGCCTTGG TGGATTTGCC CGACGCCGTC GGCATCCAGC ACCCCGATAT TCTCCGTTTG TTTCCGGCAG ATGCCGTTGC ACGGGCGAAG GAAATCAAAG AAGGATTGGG GGGACACGGA TCGGGCGCCT ATTCGCATTC CAAAGGATGT CGCGCCTTTC GTCGCGACAT TGCGGCCTTT CTCCAAGACC GCGACGGCGG ACTGCCCGCC GAGCCCGAAG ATATTTTCAT GACCAACGGA GCGTCGGCCG GAATCAACAT GATGCTCAAC GCTCTCGTCG CCGACTCTTC CTGCGGAGTC ATGATCCCCA TTCCTCAATA CCCCATCTAC TCGGCGACAA TCGACCTGTT GGGTGGACAG AAAGTGGGCT ACTACCTGAA CGAAGCCAAC GGGTGGGAAC TCAATATGGA AGAACTCGAG CGATCGTTAC AGGAAGCTAC CGAACAGGGA ATCAAAGTGA ACGCCTTTGT TTTGATTAAT CCGGGGAATC CCAGTGGAAC GGTCCTCAGC CGCACAAATT TGCAAGATAT TGTCCGCTTT TGCGCCAAAC ACAATCTCGT TTTGTTGGCC GACGAAGTCT ATCAGGAAAA CGTCTACGAC GAAAACGCCG AGTTTGTTTC GTGCAAGCGC GCAGCCCACC AAGTCGGTCT CTTGGAGGAC GACGGCATTG AATTGGTCAG CTTCCATTCG ATCAGCAAGG GCGTCTTTGG CGAATGTGGC CGCCGTGGTG GCTACATGGA ACTTGTCGGA TTCAACGCCG ACGTCAAGGA CGAACTTTAC AAACTCGCCT CGGCCAACCT ATGTGCAACT GTTTCTGGGC AAATCATGAC GAGTCTCATG GTGCGGGGAC CGGACCAGGG AGACGTTTCG TACGAAAGTC ACCAAGCGGA AAAGAAAGCC ATTTTCGAGA GTCTCCGTCG TCGTTCCAAA ATTGTCAGTG ACGGTCTCAA CAATATTCCG GGGATTTCCT GTCAGACCGC CACCGGTTCC ATGTATTGCT TTCCGTCCGT CGAAATGCCT AAAGGCGCCT TGCCGGCAGC CGAAAAAATG GGAGTCTCTC CCGATACGCT GTACTGTATG AGCCTCCTGG AGCGCACCGG GCTTTGCGTC GTACCGGCTT CGGGATTTGG TCAACGCGAA GGACGCTACG GCTTCCGTAC CACCTTTCTA CCGTCCGAAG ACGAAATGGC GAGAGCAGTC GAACAAATTC GGGAACATTA CCATGAGTTT TGTGAAATGT ACGCGTAA
|
Protein sequence | MYPGLRKMEY AVRGKVVIAA DQISDNLASG KSYPFDHIVY TNIGNPQSVG QQPLTWPRQV LALVDLPDAV GIQHPDILRL FPADAVARAK EIKEGLGGHG SGAYSHSKGC RAFRRDIAAF LQDRDGGLPA EPEDIFMTNG ASAGINMMLN ALVADSSCGV MIPIPQYPIY SATIDLLGGQ KVGYYLNEAN GWELNMEELE RSLQEATEQG IKVNAFVLIN PGNPSGTVLS RTNLQDIVRF CAKHNLVLLA DEVYQENVYD ENAEFVSCKR AAHQVGLLED DGIELVSFHS ISKGVFGECG RRGGYMELVG FNADVKDELY KLASANLCAT VSGQIMTSLM VRGPDQGDVS YESHQAEKKA IFESLRRRSK IVSDGLNNIP GISCQTATGS MYCFPSVEMP KGALPAAEKM GVSPDTLYCM SLLERTGLCV VPASGFGQRE GRYGFRTTFL PSEDEMARAV EQIREHYHEF CEMYA
|
| |