Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54101 |
Symbol | |
ID | 7197201 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 680008 |
End bp | 681616 |
Gene Length | 1609 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177983 |
Protein GI | 219112463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.644826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAAA CTGACAAGCC AACTATCGTA GAAGCCGGTG AACCTGTAAG TATTGAAGAG GCCTTTGCCG CACCAGAACT GTGCGTTTGG CAACAATATC TGACTCGCTT GTTTGGACAG GTTGAATGGA AGCAGTACAG TACGTACAGT ATTAAGACCG ACCCCGACCA AGATGATAAG GCTACAGAAA TCAAACTGTG CAGCTTTGCC CGGCCCCACA TGCGAGCTTT CCACTGTTCT TGGTGGTGTT TTTTCATTGC CTTCTTCATT TGGTTTGCCA TCGCCCCCCT TCTCTCCGAA ATCAGAGACG ATATTGGCAT CACCAAACAG GATGTTTGGA CTTCGTCGAT TGTCGGAGTC GGCGGAACAA TTTTGATGCG CTTTATTATG GGACCCATGT GTGATAAATA CGGTGCTCGT ATTTCTCTTG ATTCTGTCGT TCGCTTCTAT TCCTACGGCA TGTACTGGAT GCATGTACTG GATTCGTGAA CAGCGCCACC GGACTCGCGG TCTTGCGTCT GTTCATTGGT GTTGCTGGTT CTACCTTTGT TCCTTGCCAG TATTGGTCGA GCCGTATGTT TTCGAAAGAA GTCGTTGGAA CAGCTAATGC TTTGTGCGGT GGCTGGGGAA ATCTGGGTGG TGGAGTCACA CAGCTTGTCA TGGGATCTGC CCTCTTCCCG CTGTTCAAAA TTTTCTTTGA CGGCGACTCA GAAATGGCCT GGCGAACAGT TTGTGTTATC CCAGCCATTA TTGCCATGGC ATCTGGTATT ATTGTGTATC GTATCAGTGA CGATGCTCCG AAGGGAAACT ACGTTGATAT GAAGAAGCAT GGTACCATGC CTGAAGTCTC AGCTGCTGCC TCATTCCGTT CAGGAGCATT GAACCTCAAT ACATGGGTCT TGTTTGTACA GTATGCGTGC TGTTTTGGAG TGGAGCTGAC TATGAACAAT GCCGCGGCTC TGTATTTTAA GGACGAGTTT GGTCAATCGA CAGAATCTGC TGCTGCAATT GCTTCCATTT TTGGATGGAT GAATCTTTTT GCTCGCGGTC TCGGAGGCTT TACAAGTGAT AAGGCCAACG CCAAGATGGG AATGCGCGGA CGTCTTTGGG TACAAACTAT TTTTCTTGCG CTCGAAGGTG CCCTTGTTCT GGTATTTGCT CAGACTGGAT CGCTGGTTGG AGCCATTGTT GTCATGATTT TCTTCTCCTT GAACGTCCAA GCCGCTGAAG GCGCTACTTA TGGAATAGTT CCCTATGTCG ACCCCGCCTC TACTGGATCC ATTTCCGGTA TCGTGGGAGC TGGAGGTAAC ACTGGTGCCG TCTGCTTCGG ACTCGGATTC CGTCAGCTCA GCTACGAAAA AGCATTTAAC ATTATGGGGT ATTCCATCCT TGCGTCAGCC TTCATGTCAG CTTTAATCAA CATAAAGGGG CATGCAAGTA TGTTCTGGGG TAAGGATGAA ATTATCGAAA AGGGAATACT TGCTGTTCCT ATGCCAGAGG CTGAAGAAGA GATCGAAGCC TAGAGTCTCC TGGTTGATTT TGTCCATTTC CCCCGACTAT TTCCTTCAAC ATCTTATTCT TAAAGTTACT TATTTTCTTT TCTACACTA
|
Protein sequence | MSETDKPTIV EAGEPVEWKQ YSTYSIKTDP DQDDKATEIK LCSFARPHMR AFHCSWWCFF IAFFIWFAIA PLLSEIRDDI GITKQDVWTS SIVGVGGTIL MRFIMGPMCD KYGARISLDS VVRFYSYGIA TGLAVLRLFI GVAGSTFVPC QYWSSRMFSK EVVGTANALC GGWGNLGGGV TQLVMGSALF PLFKIFFDGD SEMAWRTVCV IPAIIAMASG IIVYRISDDA PKGNYVDMKK HGTMPEVSAA ASFRSGALNL NTWVLFVQYA CCFGVELTMN NAAALYFKDE FGQSTESAAA IASIFGWMNL FARGLGGFTS DKANAKMGMR GRLWVQTIFL ALEGALVLVF AQTGSLVGAI VVMIFFSLNV QAAEGATYGI VPYVDPASTG SISGIVGAGG NTGAVCFGLG FRQLSYEKAF NIMGYSILAS AFMSALINIK GHASMFWGKD EIIEKGILAV PMPEAEEEIE A
|
| |