Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40578 |
Symbol | |
ID | 7198367 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 329600 |
End bp | 332419 |
Gene Length | 2820 bp |
Protein Length | 860 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184601 |
Protein GI | 219128818 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATGT CCTCCGAGTC TGACGAGCCT CCCAGCTCGC CATCCGCGGT GGCGCCATCG CAGTCGGAAG AAATCCGGCT GGATCCGGAC CAGCAGGCCG CCTACGACGC CGCTTGGCAC GCCTTGACGC AAAGCCTAAC GACCGATTTG GATCCTCCAC CACCCACGGT TGTTTCACTG GATGATCCGA ATCGTGCGGA TCGGGACGCC GACGAAGAGG ACGCTCCGCC CGTAGCCTTG GTCACTCCCA TGCGCGGTCA TCACGCACTC TTCCGAGCCC TCGAACGACT CGAAGGGAAT GACAAACGCG AAGAAACCCT GGTGACGCGG GCCCCTTTGA CGAATACTCT CACGAGCGTG GTTTTGCAGG ATTCGTACAC GGACAGCGCG GCGGCGCTGG CGGCCTTGCA AGCGGCGGGT TCCAGTTTTC CGCGCAAGGT CTGTCAACAT CCGTTTCGGA AAAACGATAT CGTCTGGGTC TGCCGGACCT GTCAAGCCGA CGAAACTTGC GTTCTTTGTC ACAACTGTTT TTCCCAGTCG AATCATGAAG GTCACGACGT TGCCTTTTAT CACGCTCAAG CCGGTGGTTG TTGCGATTGC GGAGATCCGG ATGGTACGTA CGAACAGTAG GGAATTGTGT TTGGAATACT ACCTTACGGT TCTTTGGTGG CGCGAATCGT CCGGACGCCA AGTATCTCAT TGTCTTGTTT CGTCTTGTGG CTCCAGCTTG GGACCCCGAC GGGTTTTGTC CCCACCATGG CCCCAACGCC GTAAACGCCA GTGATGCGCG GCTGCCGGAA GGCTTCACGC ATCGGGTCCA AGGCGTCGTC CCCGCCGTCG TGGATTGGAT GGTCCGGGAC GTGGCAGCTG TGGGGGAAAC CGGTTACGCC CGATGTCACG CTCCAACACC CTTACGGCGT GCAAGCTCGG ATCACGAAGA CCTGCCCCCA TCGTTACCGA GTTTGGCACA ACGCTCCGCG TCCATGCCGC TGGAACAGGA GCTCCAGTCG GTTGGGATGG ACATTGATGA GGAACTCTTT GCGGAGACCG CAGAAAACGA TGAAACGGCC GCCGACGACG AAGACCTCTT GTCGGATCGG TTTCATTCGG TGGTCGATTC CTTGCGTATG TACGGCCCTT CCAACTTCAA TCACCGCACT CACGTCTTTT CGCCGACGGC GGCGTCGAAA AGTGGGGCCG CGGCCGATAG TCTCCGGCGC CACGAAGCCC GGGCCGAAGC TTTGGGGCGT AAAGGTGCTA CGGGTGGTGG TTTGTTCCTG GTACTACACC ACGATGACGT TCACACTTCA CAGTCTTGGC TAGACGCGTT ACGTGAATTT CTGGGATCTA CCAATTATTA TACGGACACT TTGTTGGGTA AACTCATCAA AGCGTTGAAA ACGTACGGGC AGCTGATCGT TTGGGGGACG ATGGAAGTCG CCACGGAGGT GGGCTGGACT CAGGTGCAAC TTTGGATGGA TGGTGACAAG GTGGCATCGA CAAGAATTGG GGCACTCTTG CTGGAGCATG CGAGCCGCTT AACTCGGCAC GGGGCATTTT GTAGTATTTT GACCCGTGAG GAGCTGTGGT TGGAGCAAAA GGCAGTTGGC GTCTTGCAGT GGTTGTCCAA ACTCGCCGAA TCCTGTCACC CGTTTTGCAA TACTGTTGCG GCCTGCATTT TACCCAACCG CCATCTAGTG CCTCTGCTCG AGGCGGATTT CAAAATGAGT GCGCGGGTTA TGAAAGCTTG GTATTCGCTT TTGCTGACTT TGCTGGCGGT CCCGACCTTC AAGTCGCACC TGGCAGCGGC GTATTGCGAT ACATATCGGA GCGTGACTGC CAAATATGCG CGTGGTATGG GCGTCTTGGA ACGCAGTGGC TACCAATTGT CGGTGCAGTT CTTGAACCGT GTGCAGTACG TGGTGGATCT CGTCCAGGGT CGGGACTTGC TGGGAAAGTT GGGCAAGACT CTACTGGACA CCTTGCTGGT TGCTCGCCGA CCTCGGAACC TAAACGGGCG ACTCAATCCG AATCATCATG TTTTGGCTCA TCGCCGATAT TCGCCGTGTA TTTCTGATCT CAAGTGCGTA CTGAATGTCA AGGGGATGCC GCGGCTCTTT GCAGCCAAAG GGGGCACCTT TTTGGGCGAC TGGATTGAAG TCTTGAGCAT TGCACAGTTT ATGGATCCAC AGAGTTGGAG GCACTGGACT TTCGGACACG TCGAAGATGA ATCTCGTGGA TGGGTGGGTG CGTTTAATGC TAGCATTTCG CTTGGTAGTA TTTTTGAACG CTTGCTAAGC TGGGAAGATG AAGAGCCGTC GCCGATCACC GATTCGACTT CCCCTCTTTC ACGAAATCTC ATGCCTTGTC TCGAGCTTAC GTTTCACATT CTGGTGGAGG GGGTATCTCG GTGGCAGAAA TCGGAAGTCT TATTTTACGA TTCAACTCCA AACTCTTCCG TAATCGAAGT CCACAAACGA TGTCCCGCAA GTCTTCCCTT TTCGACCATT GCAGCACAAC GAGGGACAGC ACTGGCAATG AGGCAGCTTC CCATTTCCCA AGTTACCCCT TTTAGCTTTC ACTTGCCTTT GCACCGATTC GTAGCTGGAT GTATACGAGA GCTGTGCCTA CGTAAGCATG ATCTTACGGG CGGGATGGCT GGCCTTATGG AGCTGTTGCG AAGCGAACTG TCTCCGAGAG ATCAAGACGA GCTTTTTCGC GGATTAATGG AGTTTCCTGT ACTCGTCTTA TCCAGGGCAT CCCAGATTCG TGCGTCGCTT TGGCGCCGCA ACGGGCCTGC CCTGGGCGAT CAAGTGTTGA
|
Protein sequence | MEMSSESDEP PSSPSAVAPS QSEEIRLDPD QQAAYDAAWH ALTQSLTTDL DPPPPTVVSL DDPNRADRDA DEEDAPPVAL VTPMRGHHAL FRALERLEGN DKREETLVTR APLTNTLTSV VLQDSYTDSA AALAALQAAG SSFPRKVCQH PFRKNDIVWV CRTCQADETC VLCHNCFSQS NHEGHDVAFY HAQAGGCCDC GDPDAWDPDG FCPHHGPNAV NASDARLPEG FTHRVQGVVP AVVDWMVRDV AAVGETGYAR CHAPTPLRRA SSDHEDLPPS LPSLAQRSAS MPLEQELQSV GMDIDEELFA ETAENDETAA DDEDLLSDRF HSVVDSLRMY GPSNFNHRTH VFSPTAASKS GAAADSLRRH EARAEALGRK GATGGGLFLV LHHDDVHTSQ SWLDALREFL GSTNYYTDTL LGKLIKALKT YGQLIVWGTM EVATEVGWTQ VQLWMDGDKV ASTRIGALLL EHASRLTRHG AFCSILTREE LWLEQKAVGV LQWLSKLAES CHPFCNTVAA CILPNRHLVP LLEADFKMSA RVMKAWYSLL LTLLAVPTFK SHLAAAYCDT YRSVTAKYAR GMGVLERSGY QLSVQFLNRV QYVVDLVQGR DLLGKLGKTL LDTLLVARRP RNLNGRLNPN HHVLAHRRYS PCISDLKCVL NVKGMPRLFA AKGGTFLGDW IEVLSIAQFM DPQSWRHWTF GHVEDESRGW VGAFNASISL GSIFERLLSW EDEEPSPITD STSPLSRNLM PCLELTFHIL VEGVSRWQKS EVLFYDSTPN SSVIEVHKRC PASLPFSTIA AQRGTALAMR QLPISQVTPF SFHLPLHRFV AGCIRELCLR HPRFVRRFGA ATGLPWAIKC
|
| |