Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45973 |
Symbol | |
ID | 7200845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 818866 |
End bp | 820346 |
Gene Length | 1481 bp |
Protein Length | 475 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180323 |
Protein GI | 219119113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGGGA AAAAGCGCCG CTCAACGAAA ACGCTCGAAT CTTTGTCGAC GACTCCATCC GATGGTGTCA ACGGAGAGAA ATCTGAATGT GTCGATGAAT CGTATTCCAG TCTTTTGCAT CCCCGACCCC TCGCAACAGT TTCATGGAGT ACATCTTTTT GGGTTGCTCT CGTTTTTAGC TATCTAATCA TTCTGTTCCT CATCTCTAAC TGGGCGAGCT CTCAGTTGCC AAATCCCTCC GTTTGTTCCG TGGCTGCTCC CGCCGTAGAT CCGTCCCCGA AATCCCACCA TAAGTGGCTG AAGGCCTGGA AAGCTTCTCC CGTGGTCAAG GGTTGGAAAG GAGCCCGCAG CTTTACTCGC TTTGTATTAC GACGCAAGTC CAAAACAACT CCCTTGTCTG CTCTTAACAG TAGATACAAT TCGATCGCCA TTCCACAGGC TTCCACGTCG CCTCCAAAAA TATCGAGCTT GGAACAGCAG CTTTTGACCG AGCTGGCTAC TCGTGTGCAC CAGAAATGTC CGGATGTGCT TGATCGAGCC AAAAAAGTGC CCTGGGGCGG GCATGGTGAT GCCTCTTGGT GGTTTCCGGT AGGAAATCAT ACAAAACCTA AATCATTGAC ACCGCTCGGA CAAAAAGACG GGGGCTTTCT TTTATACGGC CACTACAGGA TCTTGTCCAA GTCCCTGCGA AATCCTCGCG ATCTTTCCTT CACCCACTTT CCTTTCCGTT TATGCAAAAC GGGGTGTCCC GCCGAGCAAG GCGTACTGCA CACTTTGCAA TGGCGTGAAA CGTACCGACC TTGGATGATG TCACCGTCAG GCATTTCGGA GAACCGTATT GGATGGGTAT ACACGAGAGG ATTCGCCAAG GCCTCGCCCC AGAATTCTCG CTATGGTCGT CATGCCATGA TTTGGGTCCG TCCCGGGATG CACCAAACTG TTGATGGCAT GGCTTATTTT CGTGTAATTC TCAACACAGT TGACGCAGCC ATTGCCGCCG CGCTCCGCGA TTCCCACGGG CGTGTCGGCA AGTTTAATGC GGTCATCGAT GCCACCAATT ACGAGTGGTC AAAAATGCCA AACATTGCAC ACATTAAACA GCACGTCACT ATGCTCCAGG ATCATTACCC GGACCGGCTT GGAGTGCTAC TTTTGATCAA TCTCTCGCGA TCGGCCGAGT TTTTCGTCAA TATTGTCAAA AATTTATTGA CCAAAGAAGT CAGAGAAAAG ATCATGGTGT TGCCGCATAA TAAAGAAAAG GCTCTTGCTC AGTTGGGCGC GGTAGTTGAA AATGAATACA TACCAGACTG GCTAGGCGGG CCAGACAGAT TCCGATTTGA TGGTTTACAT TACTATGCAA AACAGCAACG CATGAGTGAA GTTGATAGCC GCTCCTTCCT TGTCGCTATG CCCTACCACG CAAATTAAGT ATGAAGCACA AAAACCATGT AAAATATGAA AACAAAGCTC TATTATATTA C
|
Protein sequence | MVGKKRRSTK TLESLSTTPS DGVNGEKSEC VDESYSSLLH PRPLATVSWS TSFWVALVFS YLIILFLISN WASSQLPNPS VCSVAAPAVD PSPKSHHKWL KAWKASPVVK GWKGARSFTR FVLRRKSKTT PLSALNSRYN SIAIPQASTS PPKISSLEQQ LLTELATRVH QKCPDVLDRA KKVPWGGHGD ASWWFPVGNH TKPKSLTPLG QKDGGFLLYG HYRILSKSLR NPRDLSFTHF PFRLCKTGCP AEQGVLHTLQ WRETYRPWMM SPSGISENRI GWVYTRGFAK ASPQNSRYGR HAMIWVRPGM HQTVDGMAYF RVILNTVDAA IAAALRDSHG RVGKFNAVID ATNYEWSKMP NIAHIKQHVT MLQDHYPDRL GVLLLINLSR SAEFFVNIVK NLLTKEVREK IMVLPHNKEK ALAQLGAVVE NEYIPDWLGG PDRFRFDGLH YYAKQQRMSE VDSRSFLVAM PYHAN
|
| |