Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41473 |
Symbol | |
ID | 7199233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 253785 |
End bp | 254954 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185405 |
Protein GI | 219130507 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0185557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTATC GACTACCGCC ACCGTTATTT TCTCTTATGA GACAGGGTGT AGTCCTAGAG ATCATTAACG ATAATCCTCG AAGTCCGCAA AAGTCGGCGA CGGCAGCTTC ATATCTCAGA AGAAAACTTT TGCCGACTTC TGTTCGCTCC TGTGAAGGAG CTTACATTCC AACACGGCTC CCTCGACGAA GCAATAGAAG TAGCGCGAGC GATACCATGT TGTGTGGCGC TCCTCGCAAT AGCAAGGATC GGAACAAGCA ATCTCGTTGG GCCGCCTGCG CAATACATCC TCTTCTCCTG GAAGAACGTC GATTGTTACT GTTGAGGGAA GTTCAAAGAA CAGACACAGC TTTTCCATCC GATCACGACG ATGGAGGTTC TAATTCCTCG AGTAGTCTAG CATCTGTGTT CCAAATAGAA AGTGGTTGTA TAGACATGCG ATTCCGCCGA CGATCACTGC GACACACCAG CGCTTTGATT GAGCACATAC TTTCCCAACG AAATACGGCG CAAGCCCAAA GTCCGAAGGT TCCCACCCGG CGTCGGTCGG TAGAGCAAGA AAAAGAGCGA CAGCAAAGTC CACACAAAAA CGAAGCCCAT CAAATTACTG CGGATTTTAT TCTTAATAAG ATGCAGGTTC TGGATTTTGC GGGACGACGC GAAAGCCCAC CCTGTAAACC TACCCGCCGT TGTAGCGACG AACATCAACG ATATATGACG CAGCAAGCGA TTGCCCAAGT GTTGGACGAA ATGGAGGACG ACAATGACTG CAAAAAAGAC GACGATGACG ACGAAAATGA CAACGAGGAA GGAATACTCG AGAAAAAGTC GGCGAATGAA AATCGGCGTC TCAGCGTACA ACGAAACAAG GAGACTGCCG CTATCGCGGA AGCCTTGTCG GAAATGAATC TGGATAGTGA TGAAAACGCA GCGCAGGAAG ACTCTACCTT TACCAGGGGA GGTCAACTCC ATCACACCGC AGGATCCGAT ATACAAACAG GCCACACCTC TTTGAGAACA GCGCTTCGTA GAGCCAGCTT TCGTCCCAGT TTTACCCCTG AAATAACCGA AACCGCCTTA GCACAAGCCC TGGTAGAGCT TGATGACCAA TCGGAAAGTG GCGGAGACAC CAGCCCCGAC TGCGTTGCGC TCGTCCCCGC AGCAATCTAA
|
Protein sequence | MAYRLPPPLF SLMRQGVVLE IINDNPRSPQ KSATAASYLR RKLLPTSVRS CEGAYIPTRL PRRSNRSSAS DTMLCGAPRN SKDRNKQSRW AACAIHPLLL EERRLLLLRE VQRTDTAFPS DHDDGGSNSS SSLASVFQIE SGCIDMRFRR RSLRHTSALI EHILSQRNTA QAQSPKVPTR RRSVEQEKER QQSPHKNEAH QITADFILNK MQVLDFAGRR ESPPCKPTRR CSDEHQRYMT QQAIAQVLDE MEDDNDCKKD DDDDENDNEE GILEKKSANE NRRLSVQRNK ETAAIAEALS EMNLDSDENA AQEDSTFTRG GQLHHTAGSD IQTGHTSLRT ALRRASFRPS FTPEITETAL AQALVELDDQ SESGGDTSPD CVALVPAAI
|
| |