Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19604 |
Symbol | |
ID | 7200309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 227981 |
End bp | 229307 |
Gene Length | 1327 bp |
Protein Length | 363 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179388 |
Protein GI | 219117187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACCTCGAG CAACTCTCCG GCAGTGTACC CCAAAATTCC TTACTCGCGG ACTTGACCAC CGCACTCCAC GACGTTGCCA CCGATTGGCA ACCCTCCATC GCCCTCGCCG AAGCCGTACT CGACCTCGAC GCCGCTCCCC GCGAATTCCT CGTACAAGCC GCCTACCGGG AAGAACTCGG AACTCTACAA CGGGAACTCC AGGACGTCCG GGACCAAGTC CGGGACTGTC ACGCGCACAT GAACGACTTG TGGGCCTCCA CCACCGGCAA CGCCCAAGCC ACGGTCAAGC TAGAAACAGC CGACGACGGT TTCCTCTTTC GACTCACCAA CACCAACGAC ACCAAACTCT TGCAGAATCA ACTCGGGAAC GTGGTGCAAA TCCACAAACT GCTCAAAAAC GGCGTCTCCT TCTCCACCAA GGAACTCCGC CAGCTCGCCA CCGCCCAGCA GGATTTAATG GCCGAATACG ATCGCCAGCA AAAAGTCGTC GTCCAAGACG CCCTCAAAGT TGCCGCTACC TACAGCGTCG TACTCCAACG CGCCTTTGAC GCCGTTGCCA CCCTCGATGT CCTAGTCGGA CTCGCCCACC AAGCCGCCTA CAGTCCCCAC GGATACTGCC GACCCACTTT GATCGACGGC GACGACTGCG CCGGTCACGG CATTCAGCTT CAAGGCGCAC GCCATCCCTG CGTAGAAGTA CAGGAATCCG TCTCCGACTA TATTCCCAAC GACGTCGATC TCACCCACGA CCGTTCCAAC GTACTCCTCG TCACGGGCCC CAATATGGGT GGTAAGAGCA CGTACATTCG CGCCGTCGGT GCGATTGTCC TGCTCGCGCA AATTGGGGCC TTTGTGCCCT GCCAATCGGC CACCATTCAC ATTCGGCACC ACATTCTCGC CCGCGTTGGG GCCGGAGACT GGCAAGATCA GGGCATTTCC ACCTTTTTGG CGGAAATGCT CGAATCGGCC GCCATTTTGC GGACCGCCAC GGCCCGATCG CTCATCATCG TCGACGAACT GGGGCGCGGC ACGAGTACGT TTGACGGATA CGGCCTGGCC CGGGCCATTG CCGAATATAT GGTCCGGAAC GTTGGTAATC TTTGTGTCTT TGCCACTCAC TTCCACGAAT TGACCAGTCT GGCCGACGTC TTTACGAACG TTCGGAACTG TCACGTCACG GCGCAGCGGG ACGTGCAGGG GTTGACCTTT CTGTATCAGA TCCAACCCGG TCCGTGTCTA GAGTCCTTCG GCATTCAAGT CGCCGAGCTG GCCGGCGTAC CCGCCGTCGT CGTGCAGGAT GCCCAACGCA AGGCGCGAGA ACTGGAA
|
Protein sequence | MNDLWASTTG NAQATVKLET ADDGFLFRLT NTNDTKLLQN QLGNVVQIHK LLKNGVSFST KELRQLATAQ QDLMAEYDRQ QKVVVQDALK VAATYSVVLQ RAFDAVATLD VLVGLAHQAA YSPHGYCRPT LIDGDDCAGH GIQLQGARHP CVEVQESVSD YIPNDVDLTH DRSNVLLVTG PNMGGKSTYI RAVGAIVLLA QIGAFVPCQS ATIHIRHHIL ARVGAGDWQD QGISTFLAEM LESAAILRTA TARSLIIVDE LGRGTSTFDG YGLARAIAEY MVRNVGNLCV FATHFHELTS LADVFTNVRN CHVTAQRDVQ GLTFLYQIQP GPCLESFGIQ VAELAGVPAV VVQDAQRKAR ELE
|
| |