Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39374 |
Symbol | |
ID | 7195119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 326262 |
End bp | 327362 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183342 |
Protein GI | 219126183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCGT TCGATCCCGT GGACTCGGAA AAACCCGCGG AACTCCTGGA TCCCCAAACC TTGCACAACG GTACCATTCC CTATCCAACC GCACTTTCTC CTTCGGCCAT TTTGGAATTT CAGAAATGCC CGCAATCTTA CCTATTCCAA TACCTGTACA AACTCAAGCA ACCCACCAGT CTCGCCTTGG CGAAAGGTTC CATGCTCCAT CACGCCTTGG AACAAGTGTA CGATCTGCAA CCCGCCGAGC GTGATTTATC CACTCTGCAG AATCTATTCC GTCGCGTCTG GAGCCAAAAT CGCGAATCGG ATGTGTACCG GGACTTGTTC GCGACTCCCG AAGCCGCCCG GGATCTCGCG ATGGAATCCG TCTGGGGTCG GGAGGGGCTC CAACTGTTGC AGAATTACGT GCGCTTGGAA AATCCGCAAG CCGTCACTCG CCCTAATCCA GTCCAACGGG AAATATGGGT CCGGGCTCAT CTCACCATCG ATTCCTCACA GGGTGCGACG GGGTACGTCC TGCCCGGTCG CAAACCCCAG GCCACGTCCA CACCGAACAA CAAGGACGCA GCCGCCTTTT TAGTCCGTGG CATTGTGGAT CGGTTGGATA TGGTCCGGAC GCCTGAATCG AAACAAGCAG TCCTACAAAT CGTCGATTAC AAAACCGGCA AGGCGCCGCA TCTCAAGTAT AGTGCGGGCA TGAATCAGAA AATTCGGGAC GAAGCGCTGT TTCAACTGCA AATCTACGCA CTCCTGTTGC GCGAAAAGCA ACTCCAGAAA CAACAGCAGT TCGAAGACGA TGCGGCCGCG TCCTCGCAAA CCTTACCCGT GCGATTTCTG CGTCTCTTGT ACCTAACAAA CGTCAACGAT CAGGCCGAAA CGCTGGACAT GGACTTGGGA GCGACGCCGC TTGAGCGGGA CTCACGACTC CAAGACGTGC ACGCGCAAAT CTCTACCGTG TGGAACTCCA TTATCGATAT GGTCAGTCGA CAGGACCCTC ACGCCTTTGT CGGCTGTGAC CGGTCGTTTT GCTATTGCCA CAAGTGCCGA TCGCGATTCG TGCCGGGATC TGTATGGGAA CCGCCGGTGG AACCGACCTA A
|
Protein sequence | MAAFDPVDSE KPAELLDPQT LHNGTIPYPT ALSPSAILEF QKCPQSYLFQ YLYKLKQPTS LALAKGSMLH HALEQVYDLQ PAERDLSTLQ NLFRRVWSQN RESDVYRDLF ATPEAARDLA MESVWGREGL QLLQNYVRLE NPQAVTRPNP VQREIWVRAH LTIDSSQGAT GYVLPGRKPQ ATSTPNNKDA AAFLVRGIVD RLDMVRTPES KQAVLQIVDY KTGKAPHLKY SAGMNQKIRD EALFQLQIYA LLLREKQLQK QQQFEDDAAA SSQTLPVRFL RLLYLTNVND QAETLDMDLG ATPLERDSRL QDVHAQISTV WNSIIDMVSR QDPHAFVGCD RSFCYCHKCR SRFVPGSVWE PPVEPT
|
| |