Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23646 |
Symbol | |
ID | 7198557 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 470402 |
End bp | 472508 |
Gene Length | 2107 bp |
Protein Length | 648 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184808 |
Protein GI | 219129252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACAGGAAG AGCGAGATAT GGGGACGGAG ATCACGTTTG TTGATTTCCG CAATACGCTT TTGCACACGG GCATGGTGGA AAAATTGGAA GTCATCAATA AGAAAATGGC CCGCGTTGTC CTTAAACCCA ACGCAAAGGT GTCCAATGTC GTTGGTGTGA ACGAACCGGG CTTGCACAAT AATAGCATTT CCTCGGAGAA CTGGTCTTCG TCTGCCAACG ATAACACCCA AATGGCGTTT GAGACAACTG CGGGATCTTC GAATACCACA AATGGACTCA CGGCCTCTCC CAACTCCAAC GGAAAGAAAG AAAAGTCCTT CTACTTTTTC ATTGGCAGCG TCGAATCTCT AGAAGAAAAA CTGACCAAGG CTCAGGCCCA TGTTCATCCC GAAGACTGGG TTGAAGTTCA ATACATGTCC CGGACCAATT GGACGTTGGA GTTGCTCAAA TCACTTCCGA TGGTGGCCTT TGTAGCTGCA GTGTATTTTG GATCCCGTGG TTTGTCTGGA ATACCAGGAG CCGGCGCTGC CGGTCGGGGA GGAGGGGCTG GTGGGATATT TTCCATCGGC AAATCGACCG CCAAAAAAAT CACCAAAGAA GACGTCAGTG TGACTTTTGC AGATGTGGCA GGATGCCAAC AGGCAAAAAT GGAAATCATG GAATTTGTGG ATTTCTTACA AAACTCGGAG CGCTTCACAA AACTGGGCGC CAAAATCCCA AAGGGCGCGC TGCTGTGTGG GCCTCCTGGA ACCGGAAAGA CGCTGTTGGC GAAAGCAGTC GCCGGCGAAT CGGGTGTGCC CTTCTACAGT ATATCCGGTT CCGACTTTAT TGAAATGTTT GTTGGTGTTG GACCGTCACG TGTCCGTGAT CTGTTTAAAG AAGCCAGAGC AAATGCTCCC TGTATTGTCT TTATCGATGA AATAGACGCC GTCGGTCGCC AACGTGGACG TGGCGGTTTT TCCGGTGGTA ACGATGAACG AGAAAACACA TTGAATCAGC TACTGGTAGA AATGGACGGG TTTTCGCCGA CAACCGGGGT GGTCGTTCTG GCTGGTACCA ACCGAGCCGA CATTCTTGAT CAGGCATTGA CTCGTCCCGG ACGCTTTGAT CGACAAATAA CCGTTGATCG ACCCGATTTG CAAGGGCGCA AAGAAATTTT TGAGGTTCAT TTGCGGGGCA TCAAATTGGA AGGTGAGGTA AAAGAATACG CGGGTAGACT AGCTGGGTTG ACGCCCGGGT TCGCTGGTGC CGATATAGCC AATATATGTA ACGAAGCGGC GATTGTTGCA GCGCGTCGCA AGGCCGAGTC CGTCACAATT GTGGATTTTG AAACGGCAAC AGATCGTATT ATTGGCGGCT TGGAAAGCAA CAAAATCATG AGCACGGAAG AGCGCAGTAT CGTAGCGCAC CACGAAGCTG GACACGCTGT TGCTGGATGG TTCCTGGAAC ATGCCGATCC ATTGCTAAAG GTTACAATTA TCCCTCGAAC GAGCGGTGCT CTTGGATTTG CTCAATACTT GCCAAGAGAA GTATTTTTGC GCAGCCAAGA GCAAATAATG GACCTCGTTT GTATGGCATT AGCTGGACGA GCGGCTGAGG AGGTATTTTT TGGACGCGTG ACAACGGGTG CTTCCGATGA CTTACGACGG GTGACGCAGT TGGTCTACAG CACAATCAAG GACTACGGAA TGAATAGTCG AGTCGGTCAA CTTTCTTTCC CGAGAGACGA TAATGCCGGG CCGGGTGAAA AGCGATACTC CGACTCGACG GCCGAAGCAA TGGACGACGA AGCCCGCGCC ATTGTAGACG AGGCCTACCA GCGGACTGTA GACTTGATGA CGGAGAAGAA AGCACAAGTT GAGATGGTGG CCAACTTACT TTTGGAAAAG GAAACGATTA CGCACGACGA TTTGGTCGAT CTAATTGGGG CCCGTCCGTT TCAGGGTGAC AGCGCGTACC AGGAATACGT AAGCGGTCGA GGGCCGATGA AAAAGAAAGA GCCCCAAGAA GAAATTGATA ATACTGAGCT GGATGGTGTT CTTACACCAG GCCTCGCATA AGAAAATAGT CTACGCGTTT TCGTTTATAA ATCAATGTAC GTCTTTC
|
Protein sequence | MGTEITFVDF RNTLLHTGMV EKLEVINKKM ARVVLKPNAK VSNTTAGSSN TTNGLTASPN SNGKKEKSFY FFIGSVESLE EKLTKAQAHV HPEDWVEVQY MSRTNWTLEL LKSLPMVAFV AAVYFGSRGL SGIPGAGAAG RGGGAGGIFS IGKSTAKKIT KEDVSVTFAD VAGCQQAKME IMEFVDFLQN SERFTKLGAK IPKGALLCGP PGTGKTLLAK AVAGESGVPF YSISGSDFIE MFVGVGPSRV RDLFKEARAN APCIVFIDEI DAVGRQRGRG GFSGGNDERE NTLNQLLVEM DGFSPTTGVV VLAGTNRADI LDQALTRPGR FDRQITVDRP DLQGRKEIFE VHLRGIKLEG EVKEYAGRLA GLTPGFAGAD IANICNEAAI VAARRKAESV TIVDFETATD RIIGGLESNK IMSTEERSIV AHHEAGHAVA GWFLEHADPL LKVTIIPRTS GALGFAQYLP REVFLRSQEQ IMDLVCMALA GRAAEEVFFG RVTTGASDDL RRVTQLVYST IKDYGMNSRV GQLSFPRDDN AGPGEKRYSD STAEAMDDEA RAIVDEAYQR TVDLMTEKKA QVEMVANLLL EKETITHDDL VDLIGARPFQ GDSAYQEYVS GRGPMKKKEP QEEIDNTELD GVLTPGLA
|
| |