Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44300 |
Symbol | |
ID | 7197962 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 157355 |
End bp | 159294 |
Gene Length | 1940 bp |
Protein Length | 603 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178440 |
Protein GI | 219115289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTTT CGTTAAACAT TACGATAGCC GTTGGAGACA ACAGTTTTTC TACCAGGTCA GTTTCCCACA ACGGTCCCGA TCCCCGGTGG GGCAAGGGAA GGTCAGAGGG TTTTTCTTCA CGAATCCGTC TCGTTGGCGA CTCCCATCAT TCGGTGTTTG CTTCAACCCA TTTCTCGTTG AACAAGCATA CCTCGTACCC GCCCAGAACC GCGCCGCATC TTATTGCCAA CCTCCTTATA TTTTACCGGG GTACACTGCG GTGTTGTTTG TTTACAGTAA ACTCGTTCGA ATTTGGAGCT GCTTCGGTGC GGACCGGAAG GCTCGGCACT TTTCCTTTGG GCCATTGGCA CGTCACGATC ATGAAAGTCA AGTCAAGAAA GTATCGTCAA GGAAAAACGC CAGCTACATT ACTAGCACTG GCTGCTTCGA TGATTGCCAT TTCGGGCCGT GTTATTCTTA CGAGTACGAG AATGCCCACG ATGGACTACG ACGGAAGCGG GGAAACGCGT GTACTGCTAC GGAAGGAACC AACGCAAGGG CCAATGGCGC CATCGCAGCG TCGGAATCTG TCGACATTAG CCGTTGCTGC GCCGGCTGAA AACCACAGCA CAAAAACTAC CGAGACTAGT TTGTGCAACA CGCAAAGCCG GATTTCTTCC GCCGTTGACC CGGGCGATGA ATCGTTTTCG GATCCGATGG CACAGCAGGC CACAGACAGC ACGTTCTCGG CCTGTATTCT CTTCATGGAC GATAACCCTC GTCTTGTGGA ATGGATGGCG TACCACTACT ACGCATTGAA CCTTCGCGAA GTTGTCGTTG CCGTCGATCG TCGGAGTAAG ACGAGTCCCT GGGACTCTCT GAGGAGGTGG ACTCCCTACA TGAACATTAC TGTCTGGAAC GACGCCGACT ACGGCTATGT TGTCGACGAC GACATTGCAA TCAAGGGATC GGTTGCCCAA AAGACAGATG CACACAGAGA CCGACAGCGA TTCTTCTACA GACAATGCAC CAATCACTTG AAGCGTCGAC ACCGTACATG GACGGCTTTT TACGATGTGG ACGAGTACAT GACCATCGAC GAACGCTTGG TAACGAATGC CGAAGAACGA ATGGCCAAAC CAGGTAGCGT ATTGCAGATG ATCCAAGAGG TGCAGAAACG AGATCCAGTT CCGTTCGGTT GGAGAAGGAA TTGCGTCCTG ATCCCCCGAC GGCATTTTTC GGCCGCGCCG AGCCATCCCG AAGAAGTGAG CAAGCTTGTT CCATCTGTCG TAAACGCAGA CCAGCTGGAA ACACTGCGGT GGAGGTATCG CACTGAAGGG GGGACGGACG GCTATCCCAA GTCAATAGTG GACGTGTCAA AGATTACGGT TAAAGAAACA ACGAAGTTTG ACATTCACAG GGTCGTCAGC GATGCGTGTC CCAAGCCAAG GGCCGGTCAC ACATTCCTTA CCATTCATCA CTATTTGGGC GACTGGAATC TATAGTAAGT GAAGCGCGTT TGTATCACAG CAAAGCGGGA AACGTATCAC GTGTCCTAGA CCATGCCGAT ATTGCCGTTG CTTTTATGTG ATTGGTTCTC ACCCATGTTC AACACAATTC TTCTGGCCTC AGCTCGTACC GAGACGACGC CCGAAAAGGA GCCAGGAGAA GTAGACATGT GTGGGAGCTT CGAGCATTTC GAACCGAGGG TGGGACTACT GACCAAATCC GACCATGGAT CAGCGGTTTT GTAGCAGCCA TGGGAGAGGA TCGAGCTTCG TTGCTCTTGA AAGATGCTGG CGTTCTGGCC ATTTTGAATG TCACCAACGA TGAAGATTCG AAATGGGATG TTGAAATTGG TCAGCTTATG GACGATCTTC CCAAGAAGGA TCGTGATTTC CTTCAGCGTC TTCACACACG TTTCAATGTG TCCAAATCCG TCGTCTCTTC GGTCAAATGA
|
Protein sequence | MPLSLNITIA VGDNSFSTRS VSHNGPDPRW GKGRSEGFSS RIRLVGDSHH SVFASTHFSL NKHTSYPPRT APHLIANLLI FYRGTLRCCL FTVNSFEFGA ASVRTGRLGT FPLGHWHVTI MKVKSRKYRQ GKTPATLLAL AASMIAISGR VILTSTRMPT MDYDGSGETR VLLRKEPTQG PMAPSQRRNL STLAVAAPAE NHSTKTTETS LCNTQSRISS AVDPGDESFS DPMAQQATDS TFSACILFMD DNPRLVEWMA YHYYALNLRE VVVAVDRRSK TSPWDSLRRW TPYMNITVWN DADYGYVVDD DIAIKGSVAQ KTDAHRDRQR FFYRQCTNHL KRRHRTWTAF YDVDEYMTID ERLVTNAEER MAKPGSVLQM IQEVQKRDPV PFGWRRNCVL IPRRHFSAAP SHPEEVSKLV PSVVNADQLE TLRWRYRTEG GTDGYPKSIV DVSKITVKET TKFDIHRVVS DACPKPRAGH TFLTIHHYLG DWNLYSYRDD ARKGARRSRH VWELRAFRTE GGTTDQIRPW ISGFVAAMGE DRASLLLKDA GVLAILNVTN DEDSKWDVEI GQLMDDLPKK DRDFLQRLHT RFNVSKSVVS SVK
|
| |