Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44190 |
Symbol | |
ID | 7204106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1242962 |
End bp | 1244947 |
Gene Length | 1986 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186213 |
Protein GI | 219113259 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.694416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCA ACCGGTATTC TTTATCCGGT AAACGTCTCG ATGGCCTTCT CCCTTTTATC GAAAGCGAAA TGATGGTGAA GGTAGATATT TCAAAGGAGG CATTTTTAGA TCGCTGTGAC TTCAGCGAAG CAAGCTCCCA ACCTATCGAT GTAGACTTGT TGGGAAAAGC AAGTTCTTTT GTTTCCATCT CCGTTGCTGA AAGAATTCGT CATGGAACAG GGCACTGTCA AGAAGACATA TACGCTATTC GTTCTGGCGA AGCCCTTCGG ATCCCTGATG CCGTTGCGTG GCCCATCAAT GAAGATCACC TTTGTGAAAT TGTATCGCTG GCAAAGGCTC ACCTTTGGTG TGTTATACCT TTTGGGGGAG GAACAAACGT ATCTCAAGCT ACTCGTTGTC CTCCTATACA GGTCGAACCG CGCCCCATTA TTTCAGTTGA TATGACCCAG ATGAATCGAA TCTTGTGGGT GAGCGAAGAG AATGGACTAG CCCATGTAGA AGCTGGGATC ACTGGCCGAG CTCTAGTTGA AGAAATGAAT CGACGGGGAT TTACGATTGG GCACGAACCA GATAGCTATG AATTTAGTAC GCTCGGTGGT TGGATAGCAA CGAAAGCCAG CGGAATGAAA AGGAATAAGT ATGGCAACAT AGAGGATATA GTCAGGGGAG TTCGAGTTGT TGGCGCGAAC GGGGTCTTAG CTCATGGATA CAAAGGAGCA AACGGATGCG GCCGCGAAGC AGGCAGCTTT GACATAACTT CATTGATCAT TGGATCAGAA GGTTGTCTAG GTGTCATTGG GAGCGCAGTC ATTCGTATTT GGCGACTACC CAAGAAAAAG GACTTTGGAA GTGTTCTCTT CCCCGACTTT GAGCACGGCA TTCGTTTCAC GAAAAACGTC TCAGAGCTTG GTCGAATGAT ACCTGCAAGT TGCCGCTTGC TGGACAATGA GCATTTTCGC TTGGGACACG CCCTTAGACC AGAGTCAGAA TCGATTATTG ACACAGCCAA GCGAGCAATA ACTTCTTTCA TCGCAAGTAT TTCGTTTTCC TTGGACCCTA AAAAGGTTGT CTGTGCCACA ATTTTATATG AAGGATCGTC AGAAGAAGTG AACAATCAGA AAAAAGCCAT CGGAAGACTG TCTCGCATGC ATGGAGGAGT TCAGGTCGGA GCAAGTATTG GTCGAGCAGG GTATGAATTG ACTTTCATGA TCGCCTATCT TAGGGACTTC GCTATGTCAT ATCATTTCCT TGGAGAGTCC TTTGAGACTT TTGTTCCTTG GTCGGGACTG TTGGCTCTTA TTACCGCCAC GAAAGAGCGT ATTTTGAAGG AACACAAGGC TCGTGATCTC CCTGGGAAGC CATTCGTTGG TTGCAGAGTG ACCCAACTAT ATCACGAAGG AGTATGTTTG TACTTTTACT TTTGCATGAG CTTCAAAAAT GTTGCCAGTG CAAGCGCTGT CTTTACCGAG ATTGAGCATG CCGCTCGTGA AGAAATCTTG GAGCACGGAG GATTGCTCAG CCATCATCAC GGAGTTGGAA AGGTTCGGTC ATCGTTCCTT CAAGCCATCA ACTCACCCGC CCTACAGGAA GCCGCGCTCT TGACTAAACA CGCGTTTGAC CCAGAGAATA TATTTGCTGC CCGCAATGGT GCATATGCCT CTTAGGTCTG AACGTTTCAG CAAGGATATC CAGAATTACC GAATCTTGAA ACATAAACAG AAGATTTAAA ATATTATCCC TTGAGGATTA TACAAAGATT GAACGTCGAA AAGAAACGCA TTAATGCACT CGAGATCTGA ACAGATGCTT GCGAGAGTGG TGCTTTCGAT TCTCGCTACT CTCATCCGAT GACAAATCGT CGTTGTAATC CCTTCGTGTC GGATTCACTT CGTTAATTAC CATCGAACTT TCTCCTGTCT CCTTTGTTTG CAAGACTTGC TGCTGTTTGT GGACATCTTC CATGGGATGG ACTGTTCGTT TGTAAG
|
Protein sequence | MKGNRYSLSG KRLDGLLPFI ESEMMVKVDI SKEAFLDRCD FSEASSQPID VDLLGKASSF VSISVAERIR HGTGHCQEDI YAIRSGEALR IPDAVAWPIN EDHLCEIVSL AKAHLWCVIP FGGGTNVSQA TRCPPIQVEP RPIISVDMTQ MNRILWVSEE NGLAHVEAGI TGRALVEEMN RRGFTIGHEP DSYEFSTLGG WIATKASGMK RNKYGNIEDI VRGVRVVGAN GVLAHGYKGA NGCGREAGSF DITSLIIGSE GCLGVIGSAV IRIWRLPKKK DFGSVLFPDF EHGIRFTKNV SELGRMIPAS CRLLDNEHFR LGHALRPESE SIIDTAKRAI TSFIASISFS LDPKKVVCAT ILYEGSSEEV NNQKKAIGRL SRMHGGVQVG ASIGRAGYEL TFMIAYLRDF AMSYHFLGES FETFVPWSGL LALITATKER ILKEHKARDL PGKPFVGCRV TQLYHEGVCL YFYFCMSFKN VASASAVFTE IEHAAREEIL EHGGLLSHHH GVGKVRSSFL QAINSPALQE AALLTKHAFD PENIFAARNG AYAS
|
| |