Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45785 |
Symbol | |
ID | 7200799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 264259 |
End bp | 266837 |
Gene Length | 2579 bp |
Protein Length | 818 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180205 |
Protein GI | 219118877 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0670392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGATTATCTC GTAGTAGGAT CCCCAAGGAT GACGAGACTT TCACTACACT ATGTAACAAC ACAGCTATAC ATTGTGTCCG TAAAGAAACG ATGCGTTTTC GAACGAAAAG ACGAGGTCTA CAATGAATCA CAAGCGCCAT CGTCGACGGA GTCCTCACAG CCCACCCTCT TCTCGCTGGG CACCGTCCAT ATCCATGCCG ATGCTTGGCT GGGTCCTGGC GGTACTCTTG GTGCGATCGG TTGATGCTTT CTTGCCACTT TTCTATCCTA CGAAAATCCG GACCACGAGT CGGAATGGTC TAGTGCGTTC TGCGGAACGA TGGCAACAGG GTGATCGTTC TGCCCGTTCC GGCTCGCCTC CCGTTGGTAC CACAACGAAA CCCCGTCTCG CCGCTGTAAA CCGACCTACC CAGCTTCGAT GGATCTTCCA GAGTATTGTA AAGTTACAGC AGCAAAGCCA AAGTAATCCC GGACCCGAGG CACAACCCTT GCTGGACGCT GTCGCTGGAC TGGTCCAAGC GCATTCACCC CAACAGGTGT TGTATGCCGG AACCATGTTG ACCGCACTGG ATCTCTCCAG ACAGCCCTTG GCGATCCAAG AGCGTGTCGC GAAGGCTACA GCCATGACGG GGTTACTCCA TATTGCTTTA AACCTTACCG ACAACATGTT GAAGGAGAAT ATACTGCCCT CGGAGGTGGC ACAGGATGCC GTCAGCAGCG GATTACGTCA AACGGGTCGC GTACATCGGT TGGAACGATT TTTACTGGAA GTGGGACGCG TTGCGATATT ACACAAGCAA CACGTCAGTC AGTCATCCTT TAACCTATAC TTGGCTGCTC TCTGTGACGC CGCAACGGAC AAGGACCCCG AGTCGAGAAG GGCTCCTCAG AATGGGGTGG TGTTGATGGA CCGAGCCTGC CAATGGCTTT CGCCGAAGTC TTCGTCGCAC CAAACGCAAG CTCTCCTCGG TTTGATGCCG GATGCGGTTT CGTATGCTAC GGTGCTGCAC GCCGCCGCGA CGTTGGACAA TCACACACTC TCGCGTCACG TTTGGACGTA CGCGCAACAG TGTGGGGTTT CTCCGAACAT TAACGCCTAC AACGCCCGGC TAGCCGCCGT TCCTAATGAC ACACAAGCCT TGCAACTGTA CGAGACAATT CTTCAGGATC GACGTGTGCG GCTGGATCGC TACACGATTG ATCTGGTTTT ACTACCTTTG GTGCGAGCGG GTCGCATCGG AGATGTAGAG TCTCTTTTGG ATAGATTCGT TTCGAATAAT TCGGAACACG TTGTGTCTCA AGGCTTCAGC GCCTTTTTGT CTACCATCAT CCGGGGAGGA GATGTGGCGT CGGCTCGAGC CATTTTTGAT ACGTACATGC TCCCGACGCT TGCCGCCGTC ATTGAAACGG ATGCCGGTTC CATGCGATTG GTGCGGCCGC AAACGCGACA CTTCAATATA CTATTGGAAG GATACCGGAA GAAGTATGTC GGATCTCGCG AGAGCTCTGC CTCAAGTGAA GTGGACGATG CAACGGCCGT CATTTGTGAG GAAGGCTGGC AGCTCTACGC ATTGATGGTA AACTCTTCTA GAATTGTTCC CGACCCTTTC ACAGTCACGT CGATGATGGG ATTGAGTCGT TCTCCTTCAG AATTGACAGA TATTCTGGAC AACGCAATTA ATGGCTTTGG CTTGAACTGC TCTTCCGTAG TCCTTCGTGC TGCTGTCACA GCGTATGGAG AACTAGGCGA TCCAGCCAGC GCTTGTCAAT TATTCGCGGA TTATGTTGAG CAGCCAGTTT CCACTAGAAA CTGGAATGCG TTGCTCGGCG CAATTGCCAT GGGAGCTGGC CACAACGCGA CAAAACGCCT TGACCTTCTT TCAGCGAATG TTGCAACGAA AGTTGGAAGA GACAATTCTG GTAAATCTAA AAACAAGATC TCTGATTCCA TCCAAGGCCT TTGGTGCTCG GAGGCAGTCA AACACCTGCT CGGGGAAATT CCTGATCCGA CTTCACAGAC ATACTGCATC GCCGCCAAGG CACTTCAATA TGGATCAAGC AATGCCATTA CAGCCGAACA GTTGTTTCGC AACGCAACCT TGGCCGGTAT CTCTGCGGAC GGACGCTTTG TAAACGCCGT CCTAAGATGC TTTGGCGGCA ATGTTGATGC CGCTTTGAAG TTCTGGAAGG ATGACTGTAG GGGGGCCTGT ATTCAACACG AAAGTCGAGC ACGTTTCAAG TCGCCATCCC GGAGCAAAGG AAAGAACCTA ATTGCTGCTT ACAATGGTCT TCTGTACGTC TGTGGTAGGG CGCTTCGTCC CGATGTTGCA CTTCGCCTTG TCTATGCCAT GTCAAAGGAG GGACTTGAAC CTGATGAATT GTCTCTAAAT TGTTACAAAT CGGGCAAGAA AATACAGCAG AACATGCCAG CAAATACGAG GGCAAAACTT GCTCAAAGTT TGAAGCAAAC GCTGAACTTG GTCGATTCAT ATGAGGCTCT ACTCTACATA GAGTGTATGA AGTATGACCA GAACGACAGA CGACGGACAG GAGAGAAAAG AGTCCGAATT ATTGTATAA
|
Protein sequence | MNHKRHRRRS PHSPPSSRWA PSISMPMLGW VLAVLLVRSV DAFLPLFYPT KIRTTSRNGL VRSAERWQQG DRSARSGSPP VGTTTKPRLA AVNRPTQLRW IFQSIVKLQQ QSQSNPGPEA QPLLDAVAGL VQAHSPQQVL YAGTMLTALD LSRQPLAIQE RVAKATAMTG LLHIALNLTD NMLKENILPS EVAQDAVSSG LRQTGRVHRL ERFLLEVGRV AILHKQHVSQ SSFNLYLAAL CDAATDKDPE SRRAPQNGVV LMDRACQWLS PKSSSHQTQA LLGLMPDAVS YATVLHAAAT LDNHTLSRHV WTYAQQCGVS PNINAYNARL AAVPNDTQAL QLYETILQDR RVRLDRYTID LVLLPLVRAG RIGDVESLLD RFVSNNSEHV VSQGFSAFLS TIIRGGDVAS ARAIFDTYML PTLAAVIETD AGSMRLVRPQ TRHFNILLEG YRKKYVGSRE SSASSEVDDA TAVICEEGWQ LYALMVNSSR IVPDPFTVTS MMGLSRSPSE LTDILDNAIN GFGLNCSSVV LRAAVTAYGE LGDPASACQL FADYVEQPVS TRNWNALLGA IAMGAGHNAT KRLDLLSANV ATKVGRDNSG KSKNKISDSI QGLWCSEAVK HLLGEIPDPT SQTYCIAAKA LQYGSSNAIT AEQLFRNATL AGISADGRFV NAVLRCFGGN VDAALKFWKD DCRGACIQHE SRARFKSPSR SKGKNLIAAY NGLLYVCGRA LRPDVALRLV YAMSKEGLEP DELSLNCYKS GKKIQQNMPA NTRAKLAQSL KQTLNLVDSY EALLYIECMK YDQNDRRRTG EKRVRIIV
|
| |