Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44417 |
Symbol | |
ID | 7197663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 488854 |
End bp | 490755 |
Gene Length | 1902 bp |
Protein Length | 584 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178522 |
Protein GI | 219115453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.101127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACCATGTCG GAACCAACAA AGGAACGATA CTTGGAGAGT TGAGCGTGCA TGCTGTCTTT CTCTCGGCTT GGTAATTCCC GCAATTGCAC GTGTTGACGG TAAATAGCAA TCTAGCAAAC TCTCGCCGCA AGGCATATGC TAAGCGAATG CTCGTGAACC TTACAAACCG CGCAATGGCG CTGCTACTGG CTGGAATGCT CCTGATGAGC AATGGCGACA TTCCCGGATG CTCCAGTGTT ACCGCGTTGG CTTTGTCGCA GTCATCCGTC TCCGCGGTAC ATCGCCGTAA CGTGGTGGTC GTCGGAGGCG GTCCCGTCGG ACTCGCAACC GCACTGACTT TATCCCGTCC CCCTCATTCC TGTAACGTCA CTGTACTAGA ACGGACCGAC GGTGACACGT CCGTGGCCAC CTACAATCCG GCACGGTCCT ACTTGTACAA CATCAATCCC CGAGGGTTAC GCTGGGTCGA TTCCGTACCC GAGGTGGCCA CAAAATTGGA CGACCGCGGT GTGGTTGTGC GGGGAGGATT CGGTCGGTTC TGCATCGTGC CTGCCGATCC CTCCGTTGCT ATTCCGGAAC CCACAGGCGT GACCGTTGCC GGCTCCCCAC CCAAGTCCCA GAAGCGCAAC ACCAGGGCCA CGCCATCAAT AAGCAAGTCA GTTTGGATAC CTCGCCACCA AATGGTGGAG CTCCTCCAAG AATGCTGTGA GGAACAACTA TACAATGATT CCAAAGGAGT CGGTTCTATT CAAGTGTGCA TGGGCAAAGA GGTTGCGTCT ATGGTGGACA CCGCGGAGAC TGCACTACCG GAATCCCAAA AAGGACCGTC CATTACGGTG CATTGTCGGG ACGGAAGTAT GTATGGAGCC GATCTCGTCG TGGCGGCGGA CGGCATTGAC AGCGCGGTAC GTGCCCAATT GCGCGACCCG GCCCAGTCCG AGTCCTGGCT CGCATCCAAG GCCCGTTCCT TTGTCGTCCG GCGCTACCGC TCGCCCGCCA CGGGTCTAAA ACTCAAAGCC TTGCAACTTC CGCCCAATTT TACCATGATC GATCACGACG GGACCTTGGT GACGACACAG GCACAGAACT TGTACAGTAT TCGCGGTACC AACCAAGGGA CCCGCGACTT TGCCTCGCTC GGACTACTCC CCATGAAAGA TCCCGATATG GTGCGGCCGG CCAACATTAT TACTCGGCCC GATCACGAAC TCTGGAAGCT GGCCGATGGT TCGGCGGTCA AGGCGTGGTT CCAATCGAAC TGGCCGCGTT TGAATTGGGA CGAAATGGTG GACGACAAGG AATGGGAGCG ATTCGTTCAA GCCAAAACGA CGACCTTTCC GTATTGCCAA TACGTGCCGG GATCGGTCGT TGTCGCTCCC AACGAATCCG CGGGTGTCGT CCTGGTTGGG GATGCCTGTC ACGCGTTTCC ACCCGATATT GGTCAAGGAA TTAACGCGGG TTTACAGGAT GTCGTCGCCT TGGACCTCGC CTTGCAAGAT CGGGAAATCG ATTTGTGTCA AGATAGCGTA CCCCCGTCTT GGTCACCGTC GGCATCCGCT CCGACGCTGG GACAAGGTCT TTTGCGCTAC CAACGCAATC GTCGAGCGGA ACACGGTGCG TTGATTCGAC TGGCTCGATT CGGATCGCCC TACCAGTATC GACAGCCTTG GATTCGCGAT CGGATTGGAC GTGTTTGTTG GAGTGCGAAC GTGGCCTTTC GTTTGCTCCT GAATAAGGCT ACTTTTGGTT GGGTACCCCC GGCAGCCATT CTGCTGGCAC AAAATGTCAA TCTATCGTAC CGGCAGGTAA TGCGGAGAGC GGATACCACC GCTCGTGTAT TGCAATCGAC CGTGCTTGCC GCGGTGGCTT GGCTGCTGGT TCGAAAATTC TCGTTGCTGT AG
|
Protein sequence | MLVNLTNRAM ALLLAGMLLM SNGDIPGCSS VTALALSQSS VSAVHRRNVV VVGGGPVGLA TALTLSRPPH SCNVTVLERT DGDTSVATYN PARSYLYNIN PRGLRWVDSV PEVATKLDDR GVVVRGGFGR FCIVPADPSV AIPEPTGVTV AGSPPKSQKR NTRATPSISK SVWIPRHQMV ELLQECCEEQ LYNDSKGVGS IQVCMGKEVA SMVDTAETAL PESQKGPSIT VHCRDGSMYG ADLVVAADGI DSAVRAQLRD PAQSESWLAS KARSFVVRRY RSPATGLKLK ALQLPPNFTM IDHDGTLVTT QAQNLYSIRG TNQGTRDFAS LGLLPMKDPD MVRPANIITR PDHELWKLAD GSAVKAWFQS NWPRLNWDEM VDDKEWERFV QAKTTTFPYC QYVPGSVVVA PNESAGVVLV GDACHAFPPD IGQGINAGLQ DVVALDLALQ DREIDLCQDS VPPSWSPSAS APTLGQGLLR YQRNRRAEHG ALIRLARFGS PYQYRQPWIR DRIGRVCWSA NVAFRLLLNK ATFGWVPPAA ILLAQNVNLS YRQVMRRADT TARVLQSTVL AAVAWLLVRK FSLL
|
| |