Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50611 |
Symbol | |
ID | 7199429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 80271 |
End bp | 81988 |
Gene Length | 1718 bp |
Protein Length | 445 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185563 |
Protein GI | 219130841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGGGCAGT AGTGCGACGG AACCTACCAT CGGCGAGTAT CAGTGGACGG AGGGACACTT CTTCTCCTTA CTACTCTTAC TCGTCTTGCA ATAAGCCCTC TCCACTAGAG TACTACTCAC AGTCGCCAAC CATTGTTGAG GAGTAGACTG CCGAGCGGCA AGGGTGTGTT GGTGGAGTGT TGGTGCGATT GCGTTTCTCC GTAGGTACCG TACCGTACCC GAGAGACGGA GGTTCTACAC TAGGATAGTG TAGGACACGC GCGAGAAGAA GTAGTCTTGC TCGCGCTGTC ACTCACACCA AGTCAACAGG ACAACAAAGA GTGTGTTTGT GTGTGTACCA CAACAAATTT GAAATAACCT ACAGCTGCAC ACGGCTCATC ATGTTGCCGA CGGGTTGGTT TACCAAAACG AATCGTTTGC GTCTGTCGAC TTTGGAGCAT TGGCTTCCGG ACGGTGGTGG CGCTGGAGCT CCGGTAGAAG GCCATCTGCA CCTAACCTTG CCGGATCTCG TGACGAACGG ATGGACTTGG TCGACGGTCT ACAATTTCTT GGACGATGAT TCCACACTCC GCGTCGTCTG GGTACAACCC CACGATGCCT TTCTGATACC CGGCGGTTTG GACGATATCT TTGGACAACA GCCGCACTTT CACGGCTACA CCCGACGAAT TTCCGTCCAG GTACTGACCG TCACGGGCTT GTTCATGAAT ACCTACTCGG ACTCGTACGA AAACTACATG ACCGTTTCGG TCTACGCAAA CTCGCCGGCC CAAGCCGCCT TGACCATTGG CGATCTCTTT CGGCTCATTC CGTCGAGTCG TCTGGAAGGA CTCCGCCTCG TCAGTAACCA GGGCGAAGAA TTTCCTCTCG ATAGTCACGT CACCCGACAA TTTCTGGAAG CCAACCAGCA TCTGAAAACG CTGCAGTTCC TTTACGGAAC CTTCAGCGAA GAACACTGCC GCGTTATTGC CAATACCATG CGGACGACCC ATTTGCATTT CCGGGAAGCC CAATTGCGCG ATGATGGACG GGCCCTCATT GAATGCCTCC GGAATAATCA GGGACCAACG CGCTTGACGC TCGACGATAC ACAGATCTCT GAACACAACC TGGAGGCAAT TGTCGATAGT ATACAGGCCA ATCAACAACT GAAACGCTTG GGACTTTCGG ATATGAAGTT GACGAACCAG TTGGTACAAA CTCTGTCCGC CACGTTACGG AAAAATCGGT GGTTGGTCGA ATTAGATTTG ACCTGGAACA GCATCAGTGA CGAAAACTGG AGCGATTTGA ATCTTACCAT TCGCGATCAT CCGACCCTGC AAGCTTTGAA TCTGTACGCC ACGACCAATG CGGGCGTCGG AGAAATCCCC ACCTCGCGCA AAATATCGCG AACGCTCGCG ATTTGGGATA TGGTCCGCAA AAACGAGATC CTGCAGGAAA TCAACCTATC TTCGACTGAA AACGATGAAA ACCTCATGCC GAATATCCGC GGCTGCCTCG TCCTCAATCA GCATCGTCCT AAAATGGAAC GGCTTTGTGC CGAAACCGTG TACCAACGGC AAACGCTCCT GGGACGAGCC TTGGTGGCCG CCGGACGACA GGAAACATAT GGGCCGGATC TACAGTTTTT GCTGGTCTCG CGGAATGTGG AAACCATCTT GGCCGCGCCG AGGCAATCCC GCCCCTTCCC CGAGAACAAG CGTGCACGCG TGATATAG
|
Protein sequence | MLPTGWFTKT NRLRLSTLEH WLPDGGGAGA PVEGHLHLTL PDLVTNGWTW STVYNFLDDD STLRVVWVQP HDAFLIPGGL DDIFGQQPHF HGYTRRISVQ VLTVTGLFMN TYSDSYENYM TVSVYANSPA QAALTIGDLF RLIPSSRLEG LRLVSNQGEE FPLDSHVTRQ FLEANQHLKT LQFLYGTFSE EHCRVIANTM RTTHLHFREA QLRDDGRALI ECLRNNQGPT RLTLDDTQIS EHNLEAIVDS IQANQQLKRL GLSDMKLTNQ LVQTLSATLR KNRWLVELDL TWNSISDENW SDLNLTIRDH PTLQALNLYA TTNAGVGEIP TSRKISRTLA IWDMVRKNEI LQEINLSSTE NDENLMPNIR GCLVLNQHRP KMERLCAETV YQRQTLLGRA LVAAGRQETY GPDLQFLLVS RNVETILAAP RQSRPFPENK RARVI
|
| |