Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47597 |
Symbol | |
ID | 7202653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 224543 |
End bp | 227078 |
Gene Length | 2536 bp |
Protein Length | 810 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182029 |
Protein GI | 219123433 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCA GCTTTTTCTA TGGCTTTCGA ACCCGACTGT TGATGCTGTT TCTCTTCACA ACTCTGTTTG GATTGGTATC AATTCACCTC ATCATTACCA GCTCAAATTC TGTTACGCAA TATGAACCAA GGCCAGCTCC AAGGAGAGAA AGTTCCGGGG TATTTCACGA CGCGAAAGCC CATCCAGAAT TTGCTCCAAA GGAAACAAAG CTGAAAGCGC TAGTTGGACT TCCAAAATAT GCACCCCCGA CGTCTTCTAT TATTCTCATT CGGGCTCTGG GCAACGCCTT GCCGCCAAGA CACAGTACAA ACCAGACTTT GGATAATCTC GACTTTATTC TTGCGCACGA GGATTCGTTT CCCAACACAA CGCGACACTG GTTCGTCAAT CGCTTCGTCG ATCCCGAGAT GGAAAATCAA GTTTTGGATA GACTTCGGAA AGCCCAGGAA TCCTACACTG TCATTCCTTT TGACTTGCAG ATCTATGACA AAATCGAATA TGCATACGAT CAAATCCCCA AAGACCAGAT TCATCTTCCT ACCACAAGGA GAATCACCAA GAAGGAAGTA CATCTTGCTG AAGGGCAGAT ACAGCACAAC AAAATCCTAT ACGTCATCAA CGTAAATGGG GTCAGGAACG CTATGTTAGA CTACGGCCGT ACTCATTCGA ATGCTGAATA CATTCTTCCT TGGGACGGGA ACTGTTTCAT GACGCGAAAT GCGTGGTCTT CGATCCAGTC GTCCTTGGCC GAGAATCCAC AGGCTAGGTA CTTTAAAACG CCCATGGATC GATTGCAGGA GTCAAACGAA GCTCTTCTAT CAGATACGTA CGAGCCCAAA CCTGTTGAAG AACCCCAAAT TATTTTTCAC CGAATGGCGA GGTCCAACTT TAACGAACAG CTAAGATATG GGCGAAGAAA CAAGGTCGAA TTGTTGGTGA GACTCGGAGT TCCAGGACCT TGGGATAAGT GGTCGTGGCT GGATAGTGAG ACGGCCATCT CAGATCGCGC CCATGCTTTC GACGCCGTCG GAGATGTACC AATAGCCGGG TGGATAACGC GGTTGAATTC AGGAACGTTC TTGGCTGAAA GGTCTGCCAA GGCTCGAGGC AGACTTCGAA ATAAAGCAGT CACCTTGCTT CTGGAACGAC TCGATTTTCG AGCTGCTCGG GACTTATATG GCTTGACTTC TTCAACTCTC CTTTTCTTCA ACGAAAAGCG ATTGTTGGTG GAACGCGCCG AGTGGAAGGC CGGCAAGAGA AAACAAATCT TTCGAGAGCT GGTACGGCTG GCTGACCAGG CGCTACTAGC TGGACCTTGG TCGGTCATGG ACAAGAAAAA ATTCGGTTGT GGTATTTCTG GGGATTGTCA TGATTACTTT CACCCCTCAC CGTACATGTG GCCGCAGAGG AATGAATCCG GGCACACTGA CTGGTCGAAA CCCTTCAAGC GACGTGACGG TGTGCGAGCT CCTGGTACAT CCTTATTCAG CTCCGGGAGT GAGCAGTATG ATCGATCCGG GTTGGCTGCA ATGAAGTACA ACACAACACT CCTTGCGTTG GCCTATTCGT TGACAGACAA CAAGGCATAT GTTGAGAAGG CAGCAAGCAA TCTCCGACAT TGGTTTATCC ACAATGCAAC ACGTATGAAT CCACATCTCA CCTACGCCCA GGTAAAATGG AAGGCAGATG CAACAGCAGT GGGATCATCG TATGGTCTTA TTGAGATGAA AGATGTGTAC TTCTTCTTGG ATGCGGTCAA AATTGTGGAA AAATCGGAAG CACTGTCACT ATTGGAACGC GACTCTATGC GTGAATGGTT TGCTGACTAT CTTGAGTGGT TGGTCTCAAG CTTGCAGGGT CAACAAGAAT TTGTCCAAGA CAACAACCAT GGTCTCTTTT ACGATGTCCA GGTTGCACCC ATTGCCTTGT ATACTGGCAA CATAGCATTG GCATTGTCAA GGATGCAACG ATCAGCTTCG CGTCTCTTGA CACATATAAA TACCACTACT GGTGTTCTTT CCCAGGAATT AATTCGTCCA ACGTGTGAGC ACTATCAAGC CTTTACTCTG CAAGGATGGG CTAACATGGC CCGCATGAGC AGAAAAATTG GCCTGGACTA CTGGGGCCGC TTCCGTGACA AGGCAACCAA TCAGAGTATC CTGTGTCAGG CAATGCGGTA TGCAAACCCT TACTTGCAAA AGCGAGAGAT ATGTCCCGGG AACTCACACA GCGAGGACGT GCGGCGATGG TGGCCTCTGC TTGTAGACTT CTCTCAGCAT TGCCAACAGC CCTCCAACGA AGGGTTGCTC AATGTCAGTG ACTGGATCCC TTCAGCGCTT CGGAATCCCG ATATAGATCG GTATTTGATG CCCCCTATGT ACGACTATGG AGATGGAATT GCTCCATTTT GGAATCTAGG GTATCATTGG TAAACACAGT TCACGTAGCA AGCTACCTGC AGTTGATGAT GTGATATTTG CGGGCACCAC CACTAGACTC CCGTAACAAA TAGATGAAAT TTCAATTTTC TTCTCT
|
Protein sequence | MARSFFYGFR TRLLMLFLFT TLFGLVSIHL IITSSNSVTQ YEPRPAPRRE SSGVFHDAKA HPEFAPKETK LKALVGLPKY APPTSSIILI RALGNALPPR HSTNQTLDNL DFILAHEDSF PNTTRHWFVN RFVDPEMENQ VLDRLRKAQE SYTVIPFDLQ IYDKIEYAYD QIPKDQIHLP TTRRITKKEV HLAEGQIQHN KILYVINVNG VRNAMLDYGR THSNAEYILP WDGNCFMTRN AWSSIQSSLA ENPQARYFKT PMDRLQESNE ALLSDTYEPK PVEEPQIIFH RMARSNFNEQ LRYGRRNKVE LLVRLGVPGP WDKWSWLDSE TAISDRAHAF DAVGDVPIAG WITRLNSGTF LAERSAKARG RLRNKAVTLL LERLDFRAAR DLYGLTSSTL LFFNEKRLLV ERAEWKAGKR KQIFRELVRL ADQALLAGPW SVMDKKKFGC GISGDCHDYF HPSPYMWPQR NESGHTDWSK PFKRRDGVRA PGTSLFSSGS EQYDRSGLAA MKYNTTLLAL AYSLTDNKAY VEKAASNLRH WFIHNATRMN PHLTYAQVKW KADATAVGSS YGLIEMKDVY FFLDAVKIVE KSEALSLLER DSMREWFADY LEWLVSSLQG QQEFVQDNNH GLFYDVQVAP IALYTGNIAL ALSRMQRSAS RLLTHINTTT GVLSQELIRP TCEHYQAFTL QGWANMARMS RKIGLDYWGR FRDKATNQSI LCQAMRYANP YLQKREICPG NSHSEDVRRW WPLLVDFSQH CQQPSNEGLL NVSDWIPSAL RNPDIDRYLM PPMYDYGDGI APFWNLGYHW
|
| |