Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44889 |
Symbol | |
ID | 7199811 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 551669 |
End bp | 554786 |
Gene Length | 3118 bp |
Protein Length | 892 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178801 |
Protein GI | 219116012 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.423509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACGTAGCA CTAGCATCCA CTCTGCCCTG CGTCGACTCT ATTCCTATCG TGATAACACA CGGAATAGCA CTTTTGTTCT TTGCATTTCG TTCTCAAACA ACCATGGCCG CAAAAGACGA CACGGGAATT CCACGCTTTT CTCCTTCTTC TTGGAGCATG CGCCGGCACT TTGGAGTTTC CGTAGTGTTA GCGCTGGGCT TTGGGATACT ATTGTTGGTA TTATCTAGTG ACCCACACTC CTCGACCAGC TACACACAAC ACACCAACTC TCCGGTCTTT TCCTCCCGCT TGCTGCGCCT CGAGCCCACC TGGGAGCTGC AAACACGAGA CAATGTATCC TCCGTATGGA AAGCACTGGG AGTGTCCAAC GAGCAAAGGG TCCTCCTGAG GGTACATTAC TCGAAACACC AGCCCAAAAA CAAGCGTACA GTCAGCGATG CCAGTGGTGA TTGGTTTGAT GATTCCACAA TTCGTAGTCT TCAACAATTG ACATCCGAAG ACCGCATGGC AGCGGAAGCG GAAGAGCTGG CCGTTAACGT ATCGTTTGAA GACATTTGTA CGTAGAACGT TGAATATTCT GCAAATTGAT TCCGTATTAC TTTCTTACAG ATGCCAGCGT TTTTCTCTCT CAGATAAAAC AATTGTATTT ATCACCGCCA CTTGGGCTTT GGGAAATTTG AGCACCTGGC TGCATATGCC GAGCTTGGTG GGTGAAATCA TGTCGGGATT TTTGCTAGGT CCGCCACTCG CTGACTTTTG TCCCTTCCCT GAGGCCATGG TTCTCATTGG GAGTTTTGGA TTGATTGGGC TCATTTTGGA ATCCGGTATT GATTTGGATG TCGGCCAGCT TAAAGAAACA GGGAAACGCG CCATTCTGAT GGCGTTTACC GGAACGGCCC TACCACTATT AGTGGGAATG GGTCTTGGTC GGGCTGCTGG GCAAGAATTG CAATCCAGTA TCGCAATTGG CGCAACGTTT TCGCCTTCCT CGCTTGGTGT CTCGGCCAAT GTACTGTCGG CCGGAGAAGT TCTCAACACA CCGACGGGTC AAATGATTGT CGCCAGTAGC GTTGTCGATG ATGTCCTTGG GCTTATCATG TTGAGTATAC TGGATGTTTT TGTCAGGGAA AACACCACGG CGTTCGATTA TTGCATACCC TTCATTTCGT CGTTTGGGTA TTTGATTGTA CTTGGATACT CCGGGATCAC TTGGATGCCG TACATAATTG AACACAAAAT TATGGCTAGA TTTCCGGAGG GGTACCGCGA GTTGGTCGCC TTTTGTCTCA TGTTTTTACT CCTCCTCGTA TACCTTCCGC TGCTGAATTA CTCCCGTGCC TCGTACCTGA CTGGTACTTT TTTGGCGGGT CTGACATTTT CCCAGATAAA CTCGGTCCAT GCTGCTTTTG CTCAGCACGG ACGAGGAATT CTCGATTGGC TGTTGCGCAT ATTTTTTGCA GCCACTATCG GGTTCCAAGT GCCCATTACA CGCTTTCAAG ACGGTTACGT CCTCAAGTGG GGCGCGAAAT TTTGTAAGTC CTTGCGCCCG AGCAGCGACT TTATAGTCAT TGCTATGGAT TGATCCCATT CGCTCACGAT TCTGTTTTTA CTCAATGTAG TGGTGCCGAT TTTGGCCAAA ATGCCTTTAG GCCTTTACGT CCCGCGCTCC AACTGTAAAA CGCTCCCGGA GGACTTTCCG TATGACCCGT ATTGGCGCGA TGTATGGATT ACTTCGCTGG CACTGGTCTG CCGCGGCGAA TTCAACTTTA TTATTGCGAG TTTTGCATTA AGTGAAGGAC TTGTCAACCC TGACATCTAT TCGGCGATTG TCTTTTCAGT ACTATGTGCC AGCATCTTTG GACCGCTGAT ATTGGCACGT GTCATTGCCT ACTACAATGC TAAATCACAG GCCTATCTTT CTGGAAGTCA TCCCATCGAG AGAGACGGCG ATACGTCTGA CGGCTTCCGT CCGCTTTATC TAGCGATTCA AGCGAGAACT CCGATCCACT GGGGTTTGCA GGACAGTTTC AAGAAAGCTT TGGACGATGC TGGCCTCATC ATCATTGATC ATCGTTCCTG GCATACTTTG GGCCTCAACG CGATCGATAT TACGGAGCTC TTTGTTCAAG ACACGAGAGT CAAGGTTCGT GTTTGTGCAT GCTTCGAAAC TCGCAAAGCA GCAGCTGCGG CGGCTGCCAA TTCCGGTGAG TCGGTTGCGA TTCTTTTGCC GATTCAAAAA GGCCAAGTAC CTGAGTCAGC TACCATTAAC GGCTCAGCAA CAAGTCAAGA CACAGATGCC AAAGACTCGG AGACTAGTAG TTCGCAAATG GAAAAAGGCA AGACAGAAGA CGAGATCATT CGTGCACGGT GCGATGAAAT AAAACAACGT ACGTAGAACG ATTCTGTCGA GTATGATAAA AGATGCGTGA TGATATCCTA ACCCTTTCCT CCGACGTAGT TCTCTCCAAT TGCCTCCTTC CACACGATAC CGAGGACTAT GTGATTCAGG TGTCGCAGTG GCAGACATAT ACGTTCGACA ATCAAGACTT GAAGGGATCG GATGATGACA AAAAGTTCTA TCGATTCAAC TTGCATCAGC CAACGGAATT AATAGTAGCT CCGAGCGAAG TATCTGCAGA AGATTCTCTT CCAACTGCTT CAGAAGTCGA GCCGCTGCGC CGCCCGGCAT TATATCGGCG GTCGTCAACA ATCACCGTAA CCGATGATCC TACACCTGCT GATGAGCCCA TGCTTTCGGG TCCTGATCTG TGGGAATCAG ATGAAATTTC TCACGCAATG ACTCGCGACG GCTATGTCAT GTCTCCGGTT CCCGGCGGTA TTCACCGTTC GGTGACCGCA GGTCTAGTTG GAGAAGCTGA GCACGGACAT CATCCGGAAC CGTATCACCG TCGTCGCATA ACTTTCGACG CAGCCCTGTT GACAACTCAT GGGGACGAGC TTGAAACAAG CATGATCAAA GAGCGTCTGC ATGGATATGT ACGACCGCAT TTGTAGCTTT TGTGGTCATA CACGCGTCTT CTTAGCCCTG TCATCTCGAG ATACATTTTT GAATAGTATG TAGATTATAG TTTTTGTACT TGACAGAGAG TTATGCTT
|
Protein sequence | MAAKDDTGIP RFSPSSWSMR RHFGVSVVLA LGFGILLLVL SSDPHSSTSY TQHTNSPVFS SRLLRLEPTW ELQTRDNVSS VWKALGVSNE QRVLLRVHYS KHQPKNKRTV SDASGDWFDD STIRSLQQLT SEDRMAAEAE ELAVNVSFED IYKTIVFITA TWALGNLSTW LHMPSLVGEI MSGFLLGPPL ADFCPFPEAM VLIGSFGLIG LILESGIDLD VGQLKETGKR AILMAFTGTA LPLLVGMGLG RAAGQELQSS IAIGATFSPS SLGVSANVLS AGEVLNTPTG QMIVASSVVD DVLGLIMLSI LDVFVRENTT AFDYCIPFIS SFGYLIVLGY SGITWMPYII EHKIMARFPE GYRELVAFCL MFLLLLVYLP LLNYSRASYL TGTFLAGLTF SQINSVHAAF AQHGRGILDW LLRIFFAATI GFQVPITRFQ DGYVLKWGAK FLVPILAKMP LGLYVPRSNC KTLPEDFPYD PYWRDVWITS LALVCRGEFN FIIASFALSE GLVNPDIYSA IVFSVLCASI FGPLILARVI AYYNAKSQAY LSGSHPIERD GDTSDGFRPL YLAIQARTPI HWGLQDSFKK ALDDAGLIII DHRSWHTLGL NAIDITELFV QDTRVKVRVC ACFETRKAAA AAAANSGESV AILLPIQKGQ VPESATINGS ATSQDTDAKD SETSSSQMEK GKTEDEIIRA RCDEIKQLLS NCLLPHDTED YVIQVSQWQT YTFDNQDLKG SDDDKKFYRF NLHQPTELIV APSEVSAEDS LPTASEVEPL RRPALYRRSS TITVTDDPTP ADEPMLSGPD LWESDEISHA MTRDGYVMSP VPGGIHRSVT AGLVGEAEHG HHPEPYHRRR ITFDAALLTT HGDELETSMI KERLHGYVRP HL
|
| |