Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45464 |
Symbol | |
ID | 7200567 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 243988 |
End bp | 245736 |
Gene Length | 1749 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179611 |
Protein GI | 219117639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000560804 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTATTCCG GATTGTGTCG ATCTTTCTGT AGAAGCTGGT ATCTTCATTG TTCGGTGACT TACAACCATG GTACGATGTA GAAGCGTAAA TCAAGAGATG GTCGTTCTGT GTAGTCGTCC CAATGCTTGG CGGCTGCGTT CTTGCTTTTT TGACCGGAAA AGAGGTCATC AGCAAAGCAC TCTCGCATTC GTTTAAAATC GGTGGTCGCC CCCGCATCTT TCTGATCACT CTCTTTCCTA CCTTATTTCC CAGGCCTCCC TTGTTACATG CGTAATTACC TCCGCCGGAT GGTGCTTTTG CACCGCCTGT GCTTCTCTAC TGGGAGCCTG CTGCGGCAAC GACAAGGCCT CGACGATTCC ACCAAGCGTC ACTTCCGGAC GGCGACGTTC GGTGCTACTT CTGTTTTTTT CGATTGCGAT GGCGCTCGTT TTCCAGTACG GAATCGCCCC CGCTATAGTT GACGCCAACG ATTTGACTAG CTTTGTTAGG GACAAGTGGA CGGATGGATG TTCCCAGTAT GTCGACATCG ACGTGGTCAA GTCTTGTGCT GGCAACAATG GTGTTTATCG CGTATCGGCT GCTTCGACGC TATTCTTTGC ATTCGCGGCG CTGGGAGCTC TACTTAAACC CACGGCCAAT CGAGAAGCAT GGCCGGCCAA ATACACTCTC TACTTCTTTC TCTGTATCGT CACTATTTTC ATTCCCAATG ATCCGCTTTT TTCTGACGCC TACTTGAACA TTGCACGTAT CGGCGCAGTC CTTTTCATTG TTGTTCAGCA GCTTGTCATT GTTGACATGG CTCACGAATG GAATGACAGC TGGGTCGCTA AGGCGGATGC CGCAGAAGCA CAGGAGGCTG GGTCCGGGAA AAGGTGGCTC GGTGCTATTG TGACTGCTTG CATAATGCTC TTTGGAATAT CCATCATTGC AATAGGCGTC ATTTTTTCTC GCTTCACGGG ATGTGGCACA AACAATGGAT TTATTACTGT CACGCTTGTG CTCGGCGTCT CAATTGTCGG TGCGCAGATG TCTGGCGAAG AAGGTTCGTT GCTAGCCAGT GCCTGCGTCT TTGCGTGGTC TGTGTTTTTG TGCTACACAG CCGTTTCCAA GAACCCTGAT GCATCCTGTA ATCCTATGCT GGGCGAAATG GATACTGTGA GTATTGTGCT GGGCTTGACC GTGACGGCAA TTAGCCTTGG ATGGACGGGA TGGTCGTACA CGGCCGAAGA CAAGCTGCGG TCGTCTTCTG AAGAGGAAAG CGCTGCTGCA GCCACGGCCA GGGCCAGTGA CGACTCCGAG AAAGATGTCA GGCGGGACGT CACAGGTGTG GTCACAGGCA ACGACTATGG AACGCAAGAC GACGAAGAGC AAGCTAACAG TGCGGGTCAT GCCGAAGTGG ATGAATCAGT CTTGAACAAT CCTAGCCGTC TATCCAATTC ATGGAAGCTG AACGCCATTC TAATGAGTGT ATCATGTTGG AAGGCTATGG CTCTAACCAA TTGGGGCGCG ATTGTGGCCA ATGGCAATGC TGCTAATCCT CAAGTCGGCC GTGTTGGGAT GTGGATGGTT ATTGCCTCGC AATGGCTTGT TCTGACGCTG TACTTGTGGA CATTGCTGGC ACCGAGACTC TTTCCCAATC GCGAATTTGG CTGACTTTGT CTGATTTGCA GGATTGTTGG AGATAACAGC AATGTTTCAC AATATTGTAT AAGTTGACGG TCTAATATAG AGAGGCGCTG GATAAACGG
|
Protein sequence | MASLVTCVIT SAGWCFCTAC ASLLGACCGN DKASTIPPSV TSGRRRSVLL LFFSIAMALV FQYGIAPAIV DANDLTSFVR DKWTDGCSQY VDIDVVKSCA GNNGVYRVSA ASTLFFAFAA LGALLKPTAN REAWPAKYTL YFFLCIVTIF IPNDPLFSDA YLNIARIGAV LFIVVQQLVI VDMAHEWNDS WVAKADAAEA QEAGSGKRWL GAIVTACIML FGISIIAIGV IFSRFTGCGT NNGFITVTLV LGVSIVGAQM SGEEGSLLAS ACVFAWSVFL CYTAVSKNPD ASCNPMLGEM DTVSIVLGLT VTAISLGWTG WSYTAEDKLR SSSEEESAAA ATARASDDSE KDVRRDVTGV VTGNDYGTQD DEEQANSAGH AEVDESVLNN PSRLSNSWKL NAILMSVSCW KAMALTNWGA IVANGNAANP QVGRVGMWMV IASQWLVLTL YLWTLLAPRL FPNREFG
|
| |