Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47512 |
Symbol | |
ID | 7202288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 854246 |
End bp | 856632 |
Gene Length | 2387 bp |
Protein Length | 733 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181818 |
Protein GI | 219122991 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGG TCGATCCCGG GGAACTAGCC TGGCGTCGAC GTTGGGCCAA AGCTTTCCCC ACACTTGTGC CGCACTTGAT GGATGTCGGG CCGTCGCCAG CGACCTTGCC TGTACCGATT ACGAAATTGC CGTATCCGGA AGAATTTACA GCTACAACCG ACAATATGGA TCATTTGTGG TGGCAGCGTC TGGCTTGGCG AACCTTTTGG GACGAGTACA CGGACGACCG ATCGCTCCCG GACGTGGCGT CCGACGATTG TATCCCCAAT ATTAACAGCA ACAGCAACAA CACTGTAAGC AACGCCAACA TCAACAACAT CACAGGCCTT CCTCCCGAAG ATCCCCCTTC CGTACTCCGC AACATTGCGA CCTGGAATCA GAGTTTGGGT TACGGAGAAA TCACCGCCGA AGCGACCTGG ACGTTGGTGC GTACCATCCA ACCGTACCTG CCATCATCCC AATCTCTGAC AGCCGTGGAT TTGGGTAGTG GGAACGGTAA GGTCTTACTG GCGGCAGCCT TTGCCTATCC CTTTCGTAAA CTCGTAGGCC TGGAACTCCT GCCCGGTCTC CACGAAGACG CCCTGCGGTA CCAAGCCTAC TGGCAACAAC GGTGGAAAGG TACAGCCGCA TCCAATACAA CAATCCCGAC AACACTAGGA GCAGCAGCAA TATCGACAAC CACCACAGTA TCGAATCTCG ATCCGTCCAC CCTAGCCACA AAATCCAGCG AGACCGGCGC TCTTTCTCCG GCCGTGTTGG AATTCTACTG CGACGACTTT ACCAATAGTC CAAACGTCGA TTGGATCCAT CAAGCCGACG TGGTCTTTTG CCACGCCACC GTGTTTCAAA CAAAACTCTT GGAGCGTCTG CAAGTCTGTT GCGAACAAAC TCGTGAAGGG ACCTTGTTTT GTATGGTCAC CAAACCCTTG CAGTGCAACG CACGCATCGA AACGTTGGCC GAAATACGAC TCGATATGAG TTGGGGACGA GCGGCCGTCT ACGTGCAACG CCGTAGAATC CATCATCGAC TACCACCAAC CGAGACAGCG ACAGATTTGT CTCTTAATGT CACCACGACG ACTGTGTCAA CCACGTCTCC GAACGATCAA TCGGCCCCAC AGGCCACTGA ACTAAGAGCC AACGCCGATT CCTCCGACCG TGGCACTAAT TGTACCAGAC ACAGCATGGC TTCCCAAAAA CGACCAATGT AAGTATCGTC TCGTGCATTT ATTGGAACTT TTGTTGTACA CGAGAGACAT CCTTTCTCAA ACGGATTCGT TCGGCGATCC AATGTAGGCC TACAAGTTCT ACCGAGACGG CTGTAGCACA AAAGAAGCGT CTAGTGGCCG ATTCCGAAGC AACGCTTGCA GAATCTGCCC TGTCGTTGAA CGAGGTACCG GAAACAATGG AGTCACCGCC TCATTTGGTA AGCTTCGAAG CACCTTGTCC AAGAGGCGAG CCGAGAGAGG AAAAAGTGTG TTCGACAGTC GCGAGACCAA TCGGAGATGA GTACCGAATT TTGCATCTGG ACGATTGCCC CGATCTTATA CACCAAGTTG GCGGTGCCGT CCTTACGTGC CCACCAGATC TCGAATCCAT GCTGTGGGAG ATGGGCGACA GCGTTCCGGA TCCTACCAAA CTGACGGGAC AGCAACAGTA CAACCGAGTA TCGGGCAAAT CTTCCATGAG TGTACGGTAC CGTATGTACG ATGGCAGCAA TTTGCAGCAA TCGTTCGCCG CCAGTAGCAA TACCCGAACG GAAAAACAGT TCCTCGCCCG CTGCGGCCCA TCGCTCGATC TATTTGATGC GGTCCTGCGG CAATTTGTGC TGGAGCAAAA ATTTTGCAAG TCCTGGCACG AGTCATCAAC CATGAAGTAT CGCTTTTCGG TAATGTTCAC CGACAGTCAG GCCGTTCCAC AAAACGCTCA CATTGATTAC CAATGGGATG ATTTGGACGG ATCAGATCCC ATGCCGTATC TCGGATTTCT ACCGTTGACG AAAGCGGGTA TGTTTTTACA ACTGTGGACG GGTAATCCGG ATGACGGTGT GATCAAAATG GGGAACATCG TGTTTGTACC GTGGGGAAAG TTGCTCTTGG TTCCGGGCAA CACGGTCCAC GGCGGGGGCT TCCGTACGGG TAATCACGGC AATTTACGGG CGCACTTTTA CATTCACTTT GGCGTCCTCA AGGTCAACGC CAACAATCAC TACAAGAATC GGTACGGGTA CGATCTTTCC TTGACGCATC TGCACAATCC ATCCAACGAT TTTAGCAAGT TTTGGAACTA GATGAATGGA ATTGCTGTCG CAATCGCTAC GCAAACAGAT TGAATAGATT TTCCCTGGGG AGAAATTTGT AAAATATAAG CTTATGCTCA TTTTTTG
|
Protein sequence | MATVDPGELA WRRRWAKAFP TLVPHLMDVG PSPATLPVPI TKLPYPEEFT ATTDNMDHLW WQRLAWRTFW DEYTDDRSLP DVASDDCIPN INSNSNNTVS NANINNITGL PPEDPPSVLR NIATWNQSLG YGEITAEATW TLVRTIQPYL PSSQSLTAVD LGSGNGKVLL AAAFAYPFRK LVGLELLPGL HEDALRYQAY WQQRWKGTAA SNTTIPTTLG AAAISTTTTV SNLDPSTLAT KSSETGALSP AVLEFYCDDF TNSPNVDWIH QADVVFCHAT VFQTKLLERL QVCCEQTREG TLFCMVTKPL QCNARIETLA EIRLDMSWGR AAVYVQRRRI HHRLPPTETA TDLSLNVTTT TVSTTSPNDQ SAPQATELRA NADSSDRGTN CTRHSMASQK RPMPTSSTET AVAQKKRLVA DSEATLAESA LSLNEVPETM ESPPHLVSFE APCPRGEPRE EKVCSTVARP IGDEYRILHL DDCPDLIHQV GGAVLTCPPD LESMLWEMGD SVPDPTKLTG QQQYNRVSGK SSMSVRYRMY DGSNLQQSFA ASSNTRTEKQ FLARCGPSLD LFDAVLRQFV LEQKFCKSWH ESSTMKYRFS VMFTDSQAVP QNAHIDYQWD DLDGSDPMPY LGFLPLTKAG MFLQLWTGNP DDGVIKMGNI VFVPWGKLLL VPGNTVHGGG FRTGNHGNLR AHFYIHFGVL KVNANNHYKN RYGYDLSLTH LHNPSNDFSK FWN
|
| |