Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35954 |
Symbol | |
ID | 7201321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 171724 |
End bp | 173160 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180588 |
Protein GI | 219119666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0465172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCGGA CTTCCCAAAG CTTGGCGGAA GGCGATGACC GTCTCTTGGT AGCAGTTTGG TCTCCGTCTC TACAAAACGA GGTTGATGCC GTCGGCAACG GCCGCGCTGC TGCTAGTGCC AGCCTTTGGA GCAGCGCAGA CGTTGAGCTT TCGTCAACTT CGATCGGCGT TCGTACAGGT GTTGTTGTAC GCCACTTGCC CGATCCAGCT ACGGGGGACG ATCGTGCCTA CCTGATTTCT TACCCCAGCA GCAACAACGG TGTTGGTAGC AATAACGGGG GCACGTCAAC CGTTACAACT TCTTCCCTAG CGTCTACAAC CAAAACCACT ACCGACCGTA CTAATCCGCA CGTTGCCGTG GCGGAGTTAC AGTCAATTTC GACACGGTAT GGCTCCTACT TGGTTGGAGA CGACTATGTT GTGGGAGACG GATCTTTGTA CACGGCGACT CCAGTGGATC CGCTCTTTTG GTGTTTGACC GACGACAATG CAACTGCCAA CACTACGACC TCCGGGAACC ACTCTGCAGC TCAGTCGCAA TCCTGGCAAC CCCTCGAACA GATTGTGGCC ACAACATTTC CTCCCCACGT AAGCGACCTG GTCATGGATA AGGCGCAGTA TCGGCATGTC TTGGCCAGTA TGAGTCTGGC TATGGACAGC GCATCGTCCG ATCCGAACGA ATGCTTTTAT AGATTCTCCG TCGCCAAGGC ACTCGTCTGG TTGACGCGTA AGCAACAAGC CGTAGAGGCG GTGCTGTTAC AGCAAGCGGT CAACGAAACG GCCACCTCAA ACGCCGCCGC AGCTCTGGCT AAGGAAGCGG TTAACGGCGG TGCCTTTAGC GATTCCTTCC AGTTGGGAGG CCAGCGCGAT GAGTGTGATC CGAAGAGATC CAAATCACCA GTAGCGGGTT CACTTGAGCC GCCACCAGCC GCACACTTTG ACCAGCCTCT GTCACAGACG GGCTCGCCCG AAAACGCCGC AACGACCACG CCGACGTGCG TCAGTACCCT CCAACCGCAG TCTCTATCGG TGAGTGCACA AATACAAGCC AAAGAAGAGA GCATTCAGGT TGTATGTCAG TATCTCCGTC CAGTTTGGCA AAGCCGTTTT CTCGAGCATC TAGAGGTGAC CGAGTCGGTG CTCGAAACGA CAACCGAACG GCAAAAACGG CGACTCGTAG CCCAACAGGC TAGTAACAAC CCGGATGCTG GCCACTCGGG AGCACCAACA ATGACGCTGC CGACCGTTTC CACGGCCGAC TGGAACGAAC GTCTCACACA AACCGTATCG GAAGATACAA GTGGCAACAT CAACACGATT TCGACTAAAC GAGGGCCGCC TTCGCAAACT ATAGGAGCAA AAAAACTGGC CAAGGTGAAC ACCAAGGGGA TGAAAAAAAT GAGCGCCTTT TTTGGGGCGG CAGCCAAGAA GAAATAA
|
Protein sequence | MIRTSQSLAE GDDRLLVAVW SPSLQNEVDA VGNGRAAASA SLWSSADVEL SSTSIGVRTG VVVRHLPDPA TGDDRAYLIS YPSSNNGVGS NNGGTSTVTT SSLASTTKTT TDRTNPHVAV AELQSISTRY GSYLVGDDYV VGDGSLYTAT PVDPLFWCLT DDNATANTTT SGNHSAAQSQ SWQPLEQIVA TTFPPHVSDL VMDKAQYRHV LASMSLAMDS ASSDPNECFY RFSVAKALVW LTRKQQAVEA VLLQQAVNET ATSNAAAALA KEAVNGGAFS DSFQLGGQRD ECDPKRSKSP VAGSLEPPPA AHFDQPLSQT GSPENAATTT PTCVSTLQPQ SLSVSAQIQA KEESIQVVCQ YLRPVWQSRF LEHLEVTESV LETTTERQKR RLVAQQASNN PDAGHSGAPT MTLPTVSTAD WNERLTQTVS EDTSGNINTI STKRGPPSQT IGAKKLAKVN TKGMKKMSAF FGAAAKKK
|
| |