Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_9509 |
Symbol | |
ID | 7196442 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1307846 |
End bp | 1309180 |
Gene Length | 1335 bp |
Protein Length | 404 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177264 |
Protein GI | 219111025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.363857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTGA AGGACCGCTT AAAGAGGTTT GATACACACT CTCCAGTATC CAAAGAATTT CGTGTTTACA CTGTTCAGGG TGCTGTTCTA TCCATCGTTA CTTTAGTCTT CGTCGGGTAT CTTGTCACCG CTGATTTTTT TTTCAATTTC CAAGTAACGC TTCAGGAAAA AGTTCATGTG AATGCAAGCA GTCCGTCGGG AATTGAGCTG GAGTTCGACG TCAGTTTGCC GGATGTCCCG TGCTCCAAGC TTAGTATCGA TGCCAACGAC CCGAATGGAC AAAAGCAATC CCTTCATTTA GACACAGATC ATCATGTATG GAAGCATCGT ATTACTTTAT TGCCTAACGG GCACCGCCAA CTATTAGGAG AGCGATCTAA ACTTGAGCTG GGAAGCACTT TGCTGACGGA AAAAGACTTA GAAGTGAAAG CAGAGGAGCT GCAGAATGCC AAGGACAATT CTGAATCGAG GACTGAAATG ACACCGTGTG GTGATTGTTA CGGCGCAGGA GAGGAAGGCG AATGCTGCAA GTCTTGCGAG GACGTGAAAA GAGCCTACAA AAGACGGGGG TGGTCGTTGC GAGATACATC GGGAGTGTCG CAGTGTCGAA GAGAGTCGGG AATCGCGGAA GCAGAAGGTG AGGGATGTAA TGTACACGGA GTCGTGGCAT TGTCAAGTGG AGGAGGAAAC TTGCACATTG CTCCGGGACG GGATACGGAG GCCAATTTTC CTGGAGGAAT GAATATTTTC GACGCGCTTT TGCAATCGTT TCATCAGTGG AATGTGTCGC ATCAAATCCA CAAACTGCGC TTCGGAAAGG ATTATCCTGC TGGTGTCTAT CAGTTGGATG GCGAAACGAG AACAATTACA GATGGATACG GCATGTATCA ATACTATTTT CAGGTATGTT AACCAATTCA TCTTTCTTAA GCGCGGCTAG CTCGTTTCTG AAGACCGCTT TTGATTTTTT TCAGGTCGTC CCTACCCGAT ACACTTTTTT GAATGGCACG ACAATCCAGA CACACCAATA CAGCGTGACA GAGCATCTCA GGCATGTAAG CCCTGGATCC AATCGAGGGT ATGTCTGTAG CGAAAGTGAT CACATTTCCA CCAGCGCTGT TCTCTCACTA GTCCTTAAAC TCCAGAATGC CTGGCATTTT CTTCTTTTAC GAAGTCAGTC CGCTACACGT CGATATAATG GAAGTCTATC AAAAGGGCTG GATTGCATTT CTTACAAGCG TCTGTGCCAT CGTCGGCGGG GTAGTAACCA TCGCTGGATT GATCGATCAT GTCATTTTCA GTAGACAACA TTCGTCCAGA GAGCTGATGC GATGA
|
Protein sequence | MDLKDRLKRF DTHSPVSKEF RVYTVQGAVL SIVTLVFVGY LVTADFFFNF QVTLQEKVHV NASSPSGIEL EFDVSLPDVP CSKLSIDAND PNGQKQSLHL DTDHHVWKHR ITLLPNGHRQ LLGERSKLEL GSTLLTEKDL EVKAEELQNA KDNSESRTEM TPCGDCYGAG EEGECCKSCE DVKRAYKRRG WSLRDTSGVS QCRRESGIAE AEGEGCNVHG VVALSSGGGN LHIAPGRDTE ANFPGGMNIF DALLQSFHQW NVSHQIHKLR FGKDYPAGVY QLDGETRTIT DGYGMYQYYF QVVPTRYTFL NGTTIQTHQY SVTEHLRHVS PGSNRGYSLN SRMPGIFFFY EVSPLHVDIM EVYQKGWIAF LTSVCAIVGG VVTIAGLIDH VIFSRQHSSR ELMR
|
| |