Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44534 |
Symbol | |
ID | 7198067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 829471 |
End bp | 830881 |
Gene Length | 1411 bp |
Protein Length | 417 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178313 |
Protein GI | 219115035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.058235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAATGAGCA GTTGCACACG TGCGGGCTTT CCGATAAAGC GGACAGTCAT TTCATCAATT ACAAGCTTGA TTTGACACAA GTGAGCACTC TCCGTTTGAC TCAACGAGTA TGAATGAAAC ATATTCGCCT TTTGAGTGCT TACCAGTGGC TGACTTGACT TCGGCTGGTC TACTACAGAA GCAGGATGAT ATTGAAGTCG AGCTTATTGC CCGAGAAAAC GATTGCGAAT TGCGTAGTGG AGTGGACGAT CCCATGGTAG CAATAGGTGC CAACGGACAA GCGGTATCCA CACATGGGAA GGCTTGGGTG GATCGAAAGA TGGGCTTGAC TGTCACTATC ACAACCCACC GGATCGTCCT CATGCAGCAA ACGTCGGACA AACGAGTCAA CGCTCGTTAT ATCCACCTTT CTCACGTTCT AGCCGCTGTG ACTGAAAACC AACTTTTTAA GAGTCCTAAA ATAATATTGG ACTCCTACAG CGGGGAGTTT CTTCTCGTGT TCAAAGGCAA AGAGGCCAAT AAAGATCGAG ATGCCGTGCT CTACCACATA CAAAAGGCGC TTTCACGTCA AGATTGGGAG ACAGCCGACC GGGCAGCGCA ACACCGAAAG GCTGTAGCAA ATTTGACTTC CCGCAAGGTA GGCGTCGACG CGGTTCTCGC CAAGCACAAA ACTCGGCACG CTCAAGCGGC TCGTCTCACG GACTCCGCTT TCGATGGAGA CGCCGAAACG TTGCTACGGG AAGCCCATGA ACTCGTCGCT GTCATTCACA AATACGTGGC AACGCTCGAT AAGCAAAAAG AAGTTTCCTC ACAAGACGAA CAGGATGCAA CCCGTTTGGC AGATTTGCTG CAAAACATGG GAATGACGTC GGCCCTGTCC AAAGCGAACT TTCTAGGCTC GGAAGATGCA TACTATACGC AATTGGCCCG ACAGCTGGCC GACTTTTTAG AACCCCATTT ACACAAGGCT GGTGGTATAC TAACACTGAC GGATGTGTAC TGCTTGTTTA ATCGTGCGCG TGGCACAAAC CTGATTTCGC CCGAAGACTT GACCAAGGCA GCGTCTCAGA TGGACGCATT GTCCATCGGG ATGTCTCGAC GGGTTTTTCC AAGTGGACTA ATTGTTATTC AGGATGACTC CTTTGACGAT CACGCTATGG CAGAGAAACT GCAAGCTTTG GCTTTGGACG CCCCACAGGG TTTGACGGAA ACGGAAGCCT CACGACAGTG TCAAATCTCA GCCTTGCTGG CTCACGAAGA ACTACTGGCG GCTGAACGCA TGGGCATTTT GGTGCGGGAC GAAACATTGG AGTCGACGCG ATTCTTTCCT AACCGATTTG AAGCTTGGGC AGACATACAA TAGTCTTTCC GAAAAAGTTA AGCCGTACAG CAATAGAGCC AACTGATTTT G
|
Protein sequence | MNETYSPFEC LPVADLTSAG LLQKQDDIEV ELIARENDCE LRSGVDDPMV AIGANGQAVS THGKAWVDRK MGLTVTITTH RIVLMQQTSD KRVNARYIHL SHVLAAVTEN QLFKSPKIIL DSYSGEFLLV FKGKEANKDR DAVLYHIQKA LSRQDWETAD RAAQHRKAVA NLTSRKVGVD AVLAKHKTRH AQAARLTDSA FDGDAETLLR EAHELVAVIH KYVATLDKQK EVSSQDEQDA TRLADLLQNM GMTSALSKAN FLGSEDAYYT QLARQLADFL EPHLHKAGGI LTLTDVYCLF NRARGTNLIS PEDLTKAASQ MDALSIGMSR RVFPSGLIVI QDDSFDDHAM AEKLQALALD APQGLTETEA SRQCQISALL AHEELLAAER MGILVRDETL ESTRFFPNRF EAWADIQ
|
| |