Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_51609 |
Symbol | |
ID | 7204239 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 872122 |
End bp | 874042 |
Gene Length | 1921 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186145 |
Protein GI | 219113123 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.73669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCTT CGACTCGTAG TAGTACCAAC GCTAGCAAGG AAAAGAACCA GTCTGCGGAC GGTCTTTTGG GGGGAGTGTC GCCCACCGCG GTCGTGGCCT TTCCGTCCTC GCATGTCGTG GATTCGTTGA CCCTGCCACC CCGTGCCACG TCTCGGCAGC CGCTTCCGGC GTCCGACAAG AGTATCCGGA CTAAGAACCC GATCCGAGCC ATTGTGGACC CGATTGTGGC CAATGTTCAG TCTGGACGGG AACGAGGAGA CGGCAAGGAT CTTATTTCGT TGGCGGTACG TACGTTAGAC CACAGACCTC GGGGGCGTGA CACAACAATC GTTGCGGAAG GAACAACCCC ACACGACTCG AAGATTCCGT CAAAAATGAC CGACACCGGA CTCGTTCGAT CTGGGCTGTC GTTGACCACA ATACTCGTGC CCGAACGCAA ACTCGAAACA AAACACTCAC TCGCGCTTGT TTATTCGTTT CGTTCATCCG TTCACTCGTT CTTCTGGGCA ATACTATACA ATAGTTGGGT GATCCCACTG CAGCGGGGCA TTTGACACCG TGTCCGGCCG CTATTCGTGC CGTTCGCGCG GTACTCGACG ACAATTCCTC TACCAAAGCG GCTGGTTACG TGAATGCGTG TGGTACGAGT GACGCCCGTC GAGCGATTGC CGCCTTTCAT TCCGTGCACC TTGCCCCCCG CCAACACGTG GACCATGACC CAACTCTTTC TTCTCCCCAC GGCAAAGGAC TCACGGAAGA CGACGTGATT GTGGCCAACG GATGTTCGGG AGCCTTGGAA TTGGCCTTGA CGAGTCTTTT GAATCCAGAC GACGTCCTAC TCGTTCCTCT ACCGGGATTT CCCTTGTACC AAGTCATTGC CGAATCGCAC GGTGCGTCCG TCCTCCCCTA CCGACTCGTG GAGTCTTCCG GCTGGGAATG TGATTTGGTA CAGATTGAAT CACTCGTACG GATGCCGACG CAGCGACAGC GTACCGGGCA GCAGTCGGCC AGAATCAAGG CAATCGTTGT CAACAACCCA TCCAATCCGA CTGGTGCGGT CTTCTCCAAA GATCATTTAC GACGCCTCGT GGCACTGTGC GAGAGACTCG AAATCGTCAT AATAGCGGAC GAAGTTTACG GAGACTTGAC CTTTAAACCG CACAAATTCT ATCCTATGGC GTCCATTGCC GCCGAACTGG GACACCAAGT TCCCATCATT ACCGCTAGCG GAATCGGCAA ACAATTTCTG TTACCGGGAT GGCGAGTCGG ATGGCTCGTA TTTCAGGACG AGTACGTTTG ATCGGTGCAA CCAACCGTCT TTCTTTTGCA CAGCACACTG GCTCACACCG CATGCATGTA CTTTTCCCCC TCGATTTTTT GCCAGTGTCT ACGGAAGTTT GTCACAAGTG CAAGCCGGGG CCAAGCGGTT GGCGCAAGTC ATTCTCGGTG CCTCTCATTT GGCACAAACA GCCATTCCTT CGTTACTGGA ACCCAAGAAT ATAGAAATCA GACAATGGAA ACACGATTTG CGGACGGCGT TGCAGACACA GGCCGACATT TTGTGTGATC GCCTGAGTGC GGCACCCGGT TTACGGGTTA TACGACCAGG TGGTGCCATG TACGCCATGG TACGTATCGA CGCGGACGTG TGGTGCTCGT CGTCGTCGTC AGCCGATCCG GCCATCACGT CCGATACCGA GTGGTGTCAG GCATTGTTGC GGGAAGAAAA CGTATTTGTC CTCCCGGGGA CGGCCTTTGG TTTGCCCGGT ACGGCACGGA TGGTCTTTGC CGCGCCACCT TCCACACTAA TGGAGGCTGC GTCTCGAATT GTACAATTTT GCCATCGACA CGCAATGGAC GCGCCACTAT CGAACCCTAG ACATAGAAAT GAGAAAATAC AAACCAATGG CGTAGATAGT T
|
Protein sequence | MTPSTRSSTN ASKEKNQSAD GLLGGVSPTA VVAFPSSHVV DSLTLPPRAT SRQPLPASDK SIRTKNPIRA IVDPIVANVQ SGRERGDGKD LISLALGDPT AAGHLTPCPA AIRAVRAVLD DNSSTKAAGY VNACGTSDAR RAIAAFHSVH LAPRQHVDHD PTLSSPHGKG LTEDDVIVAN GCSGALELAL TSLLNPDDVL LVPLPGFPLY QVIAESHGAS VLPYRLVESS GWECDLVQIE SLVRMPTQRQ RTGQQSARIK AIVVNNPSNP TGAVFSKDHL RRLVALCERL EIVIIADEVY GDLTFKPHKF YPMASIAAEL GHQVPIITAS GIGKQFLLPG WRVGWLVFQD DVYGSLSQVQ AGAKRLAQVI LGASHLAQTA IPSLLEPKNI EIRQWKHDLR TALQTQADIL CDRLSAAPGL RVIRPGGAMY AMVRIDADVW CSSSSSADPA ITSDTEWCQA LLREENVFVL PGTAFGLPGT ARMVFAAPPS TLMEAASRIV QFCHRHAMDA PLSNPRHRNE KIQTNGVDS
|
| |