Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43490 |
Symbol | |
ID | 7197542 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 599190 |
End bp | 601260 |
Gene Length | 2071 bp |
Protein Length | 630 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177967 |
Protein GI | 219112431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.416955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCCGGCGA TATTTGGTTT ACTTTTTTAT TAGCGTGACT TGCCTTCTAA GCTTGTCGTA GCTCATCTGC TATTCCTTCC ACCAATCGGT TTTCATTACA GTCGCTCTAG ATAGGAACAG AGCGCCCCGG GATGTTCGAA TCTCGTACGG GCAGGACGAG GTTCCTCACT GTTCTACTAG TGAGCGCTGG AATTGGCTCG GAAGGGTTCG AAGCATCCGG CCGTCCTCGA CTACGGAATG GGAAATTGAA GACGCCCCGA CGTTTCTTCG TGCGACACGC TACTATCGCC GAGAATGAGG CACCGGAGAG CTCTCATCTA GGTGCGACAC CCCCGGATAT GGTTGCCTTT GCTTCCGGAT ACACAACCGT ATTTGAGGAG CTAGTATGTA AATCCTGCCA AGCATCGCAG GGAGAGGTAC CAGACGATCT ATTCGGAACC TATTTCCGTT CTGGACCCGC CATGTTCAGT GCTGGCTCAA TTTTACCACC CAAAAAGTCA TTGATCCAAC CAAAGCAACC GCCAGTTTTA GACGGGGAAG ACAAGACGCG GATGGTACCA CACCCCTTCG AAGGCGACGG CGCTGTTCTG GGAGTAACAT TCTCTAGAGA AGGAGATGTC ACCGCCCGTT TCCGATTCAT TCGAGGCACT CCTTTTACGT ACGAACGCAA GAAAGGCAAA CGTGTTTATA CTGGTATGGA TTCTACTCGG ATGGAGGGCC CGTCAGCCGG TGGCGGCTTA GCCAATGACC TCCCTCTTCC ACTTTATCGT CACAACTTGA TGCCAGGGCT GAACAAACTT CGGAAAAATA CCGCAAATTC TCGCGCAATC TATTGGGGGA AGCGTCTCTT TTCGCTATGG GAAGGAGGAC AACCGCACAA ACTGGATGCG CTTGCACTGT CAACAGATGG GAGATCGATG TTGGGAGGTG CTATAAAGAA GGAAGCAGAT CCGTTCGGAG GAAAGATGAT TTACGATCCA TCCAAAAACC GCGCTTTGTT TTATGCGGTA TCTCACGAAT CAAAGGACTC CAGCATTGTT TGTTACGAAT TCGACGACAA GTTCCGCCTG ATAGAAAACG GACGGATCGA AACGACAGTA CCAGGGTTTG CTTTGATAAC AGACTTCGCC GCGACCGAAA ACTACGCCGT GTTTGTACAG CCTCCCATCG CTACGAATGG GATGAAGTTC CTTATGGACA AAGGTCCCGG TAGAGCGTTG AAAGTGGAAG ACCGACCGTC TATCGTTCAC CTCATTCCAC GTCCCGAGTC TTCAAAGCAA CAGATGTCTT TACCTCTCCC GATCGATTCT CTTTCGGATT CAAACTTACA CTTTATAAAC GCATATGAGG ACGGTGGTTT GATCATTTTT GATGCATTCG CTCGGACGGA TCCAAAATAG GCGACAAAGT GCTATCTTGG CCTTGGGGAT CGTCCTTGGA AGAATACCAG GCGTGCGCCT CCAAAAAATC CCTTTGGCGG TACACGATAG ACACGCAGAG GGGATCCGTT TCTAAAAAGC TCATGTTCAA CGACCACTGC TTTTTTGGTG GGATTAACCC TGCTGTTAGT ATGAAAGAGC ACCGGTATAT CTACATGAAT GTCGGGGCTT TGGGAGCGGA TGTGGCTCCA CCTCAAGGGA TTGCACGATT TGACTGTGAA ACAGCGGAAA GTCAAGTCTG GATGCCCGAA AATTTTGAGT TCTGCGGAGA ACCAATGTAT GCTAGACGAG CGACAGAGGA TGGATCAAAC GACCCTGGGT ACATTTTGTC GGTTCTCTAC AATGGAAAGA AAAACGAGAG TGAATTGCTA ATCTTGCAAG CCAATAAGAT TCCGTCGGGG CCAATTGCTC GCCTTCCCTT AGATATTGCC ATTCCACACG GACTTTTCGG ATGCTTCAGT ACAGCTGAGG AAGCTACGTC CTGGTCGACG GAAGAAATCG AAAGGCGGGC CAAACTTGCT GACAAGATGG AATCCAAGGG AAACATGTGG AACGAGGTCC GCAGCGAATT TTCAGGTCTA GGTTTGCGGT TTTCGGACAT GGACGAGTAT GGGTTTGATT TTTTGTTTTA G
|
Protein sequence | MFESRTGRTR FLTVLLVSAG IGSEGFEASG RPRLRNGKLK TPRRFFVRHA TIAENEAPES SHLGATPPDM VAFASGYTTV FEELVCKSCQ ASQGEVPDDL FGTYFRSGPA MFSAGSILPP KKSLIQPKQP PVLDGEDKTR MVPHPFEGDG AVLGVTFSRE GDVTARFRFI RGTPFTYERK KGKRVYTGMD STRMEGPSAG GGLANDLPLP LYRHNLMPGL NKLRKNTANS RAIYWGKRLF SLWEGGQPHK LDALALSTDG RSMLGGAIKK EADPFGGKMI YDPSKNRALF YAVSHESKDS SIVCYEFDDK FRLIENGRIE TTVPGFALIT DFAATENYAV FVQPPIATNG MKFLMDKGPG RALKVEDRPS IVHLIPRPES SKQQMSLPLP IDSLSDSNLH FINAYEDGDK VLSWPWGSSL EEYQACASKK SLWRYTIDTQ RGSVSKKLMF NDHCFFGGIN PAVSMKEHRY IYMNVGALGA DVAPPQGIAR FDCETAESQV WMPENFEFCG EPMYARRATE DGSNDPGYIL SVLYNGKKNE SELLILQANK IPSGPIARLP LDIAIPHGLF GCFSTAEEAT SWSTEEIERR AKLADKMESK GNMWNEVRSE FSGLGLRFSD MDEYGFDFLF
|
| |