Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32848 |
Symbol | |
ID | 7197476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 930098 |
End bp | 932155 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178036 |
Protein GI | 219112569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.270203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCATC TGCTAGCATT GATGGAGGAC GAGCCTCTGG AAGACAACAG CAAAAGTTCC CACTCAAAGA TACCAGAACA TATCGTCCGA CAAGCCCGCA ATGTTTCCCT AGAACAAAAG ACCACACCCG CGCTAGCAAC TGACATTCAG GATGCCGTGC ATCTTGAACC GCAGCACCAA TTGTCTCTGA GTGTGGACGG TCGAGTTGGT ATTCGTATGA TCAAGCGAAA GATCTCGGGG TCGCAGTTGT TGGACATTCT AACAGCACAT CCTTACCAAT CACCGGCAAG CTTGGCGGCC ATGAGTCTCG AGTATTTGAA TCGTTTGCTG CTAGAGCCGG CCTCAATAAT AGACCAAGCA ACTGTTTGCG GGCGTACCAA CGTGATGACG GTCGGCATCG TGTTTTCCAA CTCGGGTACA CGAGTTGCCT CTTCGGGAAA TGCTTATTGT GTTGTTACTC TCGGATCACT AAATACGGGT CCAGCTGTTT CCGTATTGTT GTTTGGTTCC GCTTACGGTA AGCACTGCCG TAGCTGTATC CCTGGTAAAG TGGTGGCCTT GGTCAATCCT CGCTTAATTC CTGCTAAAGG TGCAGCCCAG GGAGACACCT CCATTTCGTT TTCTGTCAAC GAGGAACGTC AGCTTTTGGA CGTCGCTGAT GCCCGTGACT ACGGAACTTG CAAGGCTGCG GTCCGGGGGA AAAACGATAA CGGTCATTGG GTTGCTGGTG GTAAATTTTG CGGTCACTTC GTGGATAAGC GCATCAGTGA GTATTGCAAT CCACACCGGA AACAGGCCAA CGTCAAGACG GGCACAGCCA ACCACACCTC CACATCCGCC CTCCAAAACC TCCGAAATCA AGCCGTCGCT TTCCCTAGAA TTCAGACTAG AGTGATGATA CCACCTGGTG GTGTTAAACC CTTTCAGACA CCAAAGCAAC AAACGAAATT GCGATCCGCC CAAATGATGT CTGACTTTCT AGCTCAGTCT ACCGCCCCTG GGGCGGGATG CTTACTTCCT TTCCAGCAGC CAACACTATC ACGCAGCCAA CCTGCTATGA ACAAAAATAC TATTTTGAAT CCGAAACAGT CAGCTACTAA GTCAGTCGAA ATTCCAGGGA GAGGATTGCT CAATCCGTAC GCCAAAGGAG CATCTTCCAC CGCATCGTCG CACGCGAGGG GTGGGTTTTC TCCACCCAAT TCCGTACGTA TCCAGGACAC CACCATACGA AAACCGTTTA CAGTCAACCG GGTTAGCCCG CCGGCTTCAA CCTCGCACTC GGTCACAGAA GATTGGTTAC AAAAAGCTTC CAAGAAGCGC ACTCCACTTG GCAACGCACA TAACCAAAAA CGCCAGCGCA GATTTGTCAA TACCGACACA AGAAACTTCA ACGGGTCCGT ACCAGTTCCT AAGCCCTCAC AGATGTTTCA GACTGCACGC ATAACCAAGC GAGTACCTAT GATACAGACG AGCGATGTCA AGGAAAAGGC AAGGGCGGCC CAGGTTCTGT CGCAACAACA GATCTTGGCG TGTCGACTGC GGGAACAGGA TGGCGGAGGT AGCCAGAGTT CCGTTTTTAA AGCGTCTCGC CCATTATCAG GGGCCGATTG CTCAGCTGTA AGACAGCAAG ACCGAAGGGA GGAATTTTAT GCGTTGCTTG ATGACATCGA CATTCAAAAT GCTTCCGCCG CTACAAGCCA ATTCGCGGAC GAGGTCAGTG CAGAAGAATA TGCCAGAAGC CGGCGCGTCG TTACTGAACT CGAGGAACAA GAAGGAAAGA AACAGAGCAA GGTGGCCAAG TCGAAGACCC TCGGTGACAA GGATAAGACA GCTATTCGAA AAGAATGGTA CTGTCAGCAA TGCCGAAAAT CGTCCCCGTT CAAACCAGCT GGATGTGTAC GCCGAGGACA CAGCGTTGCC ACGAAGCGAG AGATTGTTCA AGCCAAATCT ACATCTGAAC GACGCCTGGA TTTGGCTAGC AAGGATGCCA ATGATGGCGG CTTAACTCTA GGCAGTGGTA TTGAATGGTC CGCAAATAGA TGGAGTCGCT TCAACTAA
|
Protein sequence | MDHLLALMED EPLEDNSKSS HSKIPEHIVR QARNVSLEQK TTPALATDIQ DAVHLEPQHQ LSLSVDGRVG IRMIKRKISG SQLLDILTAH PYQSPASLAA MSLEYLNRLL LEPASIIDQA TVCGRTNVMT VGIVFSNSGT RVASSGNAYC VVTLGSLNTG PAVSVLLFGS AYGKHCRSCI PGKVVALVNP RLIPAKGAAQ GDTSISFSVN EERQLLDVAD ARDYGTCKAA VRGKNDNGHW VAGGKFCGHF VDKRISEYCN PHRKQANVKT GTANHTSTSA LQNLRNQAVA FPRIQTRVMI PPGGVKPFQT PKQQTKLRSA QMMSDFLAQS TAPGAGCLLP FQQPTLSRSQ PAMNKNTILN PKQSATKSVE IPGRGLLNPY AKGASSTASS HARGGFSPPN SVRIQDTTIR KPFTVNRVSP PASTSHSVTE DWLQKASKKR TPLGNAHNQK RQRRFVNTDT RNFNGSVPVP KPSQMFQTAR ITKRVPMIQT SDVKEKARAA QVLSQQQILA CRLREQDGGG SQSSVFKASR PLSGADCSAV RQQDRREEFY ALLDDIDIQN ASAATSQFAD EVSAEEYARS RRVVTELEEQ EGKKQSKVAK SKTLGDKDKT AIRKEWYCQQ CRKSSPFKPA GCVRRGHSVA TKREIVQAKS TSERRLDLAS KDANDGGLTL GSGIEWSANR WSRFN
|
| |