Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45241 |
Symbol | |
ID | 7200257 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 582026 |
End bp | 583104 |
Gene Length | 1079 bp |
Protein Length | 262 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179462 |
Protein GI | 219117335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00235229 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTGTTGAT CCTTGTCAAG CTGCCTATTG AACGCATCAA TAAAACTTTA GCTATGGTGA CTTTTGGAGA ACGGTTGCTT TCATCGGAAG ATGAGCCATA CTTGAAGGAC CCAATTAATC AAATGAAGTC ACAGTAAGTG CCTTTTATAA ATGTTGTGGA GAAGTGACGA TCTCTCAAAA GCTGAAAGGA GTATGTTCAT CGGAGTTGTT GTCTCGTTGT ACTTCTCCAA AGTTACTGCC ATCATGGTGT CCGTCAGATC TGCGTCGGCC GTCCTTACAA AATTACGATG GCTCCGGCGG AGCAAATTGA ATTGGATTTT CGTACTCGTA ATTTCATTCT CACCGAGATC GTTTCCTTTC GTCTCAAAAG ATGGACAGTG CTATTGCTCA CTTGGATAAG TTGGATTTTT GCTTTGGTTA CCGTGACACG TTGTACGTTC ATCAAGATTG TATCGGACGA TGAGAGCTCT TCAGACATAA AAAGTCTCGG ACTTTTTTCT GTGCCAATCT ACACTGCTGA TAAAGACATC CGTGGATGTG TCTCATATGA AAGTAATGAC GGTCGAACCG GTGGTTTTAA AGCTGGACGC GCTTTCTCAA TTTTCCTCGT GGTGACCATT TCAATCGCTT TTGTCATTGT GAACGGGATG ATGCTGTTCA TTCAAACACA CATGACTCGG CGACTGATGT ATTTGCTTGT TCGTGTATGT GTCCCTGTGG CCTTCATCGC AAATACTCTA ACATTTGTGA CGTTCTCGAT GGAAGAATGC AGTGAAAAGG GAACAAAGTG TTCTCCCGGA GGAGCTGCCA TTGTGGCAAT TTTGAATGTT CTTGTTTTGT TCATCTTGGC TGTTCTTGTA ATTGTCACGC CAGCGCCCAG TCAATGCGCT TTTCAGTTTG TGTACGATTT TACCAAGGTC GCCCCGAGGG ATCATAGCGG GCAATCTTCA TCCTCGGAGC AGAAACACGC GCAGGGAGAT ATATATCGTT CACCTCCCAA AAAGAAGAAC AAAAAGAATA TCTTCCGAAA GCGCCGAGAA GAGAACCCGG AGAATCAAGA TGACTTGGAA CAAATCTAA
|
Protein sequence | MVTFGERLLS SEDEPYLKDP INQMKSQWTV LLLTWISWIF ALVTVTRCTF IKIVSDDESS SDIKSLGLFS VPIYTADKDI RGCVSYESND GRTGGFKAGR AFSIFLVVTI SIAFVIVNGM MLFIQTHMTR RLMYLLVRVC VPVAFIANTL TFVTFSMEEC SEKGTKCSPG GAAIVAILNV LVLFILAVLV IVTPAPSQCA FQFVYDFTKV APRDHSGQSS SSEQKHAQGD IYRSPPKKKN KKNIFRKRRE ENPENQDDLE QI
|
| |