Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42815 |
Symbol | |
ID | 7196174 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1212024 |
End bp | 1214006 |
Gene Length | 1983 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177241 |
Protein GI | 219110979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.952072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATT TCCAAATGAT CACCAAAACT TGTCTTTCTG TCCATGAAAC GGGATGTTTT CATGTTCTGG GGTTGCAGGC CGAGCTACCC CAAGAGCAAA TAGCGGCTGC GGTGTTGTCG CCGAAGGCTG TCAAGAAAAG TATTTTTGTC GACATTGATG AGCCCTACAC ACGTAAACAG GTCCAAAAAG CCTTTCTGAA GCGATCGGAT ATTTTTCTCG TCACCATGGG TCCGGGTGAA GGAATTGATG CGGTCAAGGT TCCAGGTCAT TGTGAATTCC AGTGGGCCGA GTACGAACGC ATTGACTGGG ACGCAGTCTT AGCGGGTAAG CATGGCGCCT CGTCGTACTG TATCCGGAAA GGATTAAGCA GGAAAGCCCA GCTTGCTTAT TATACCCATC GGCATGTCTG TAAGCACCCC AACAGTATCC TTAGTTATTG CATACCAAAA ACAGTAATCC TGGATACCTG GTCCGTCTGG GACGATAATG CTGGGACGAC AAACCATGAA GGATTTGCAG ATCTCGTTGT TTCTATGGGA TCTCCAGCGA TAACCTCGGG TACATTGAAT CGTCGCTGGA GGCTAGATCA AAGCCTAATT GAAGCCAAGC GGTTGATGCA ATGCGTCAAT TATAACGGTA GTCAATGTGA TGGCGATCCT TTTCCCATCT GGATCCTTAA AGGGAGTACC GTGAACAAAG GTGCTGGGAT CTTTCTTATC CATTTTTATG AGGAGTTAGT GGATATTTGC TGGAGTGAGC CACAAATCCG GGAGTGGTGC GTAGCTGTGA GCCATGTACC TGGAGTAAAT TGAGCAAGGG GCCCTCACAA TTTATGTTTC TATTCATGTA GGGTGTTACA ACAATACATT ACAGTGCCGC TACTGCTGCG GAAACGCAAG TTTCACATTC GGGCTTACGT CGTGGCTGTT TCTGCAATTA AAGTCTATTT TTGCCAAGAA TGTCTAGCTC TCTGCTCAGG AACGAAGTAC AAGAATTGCG ACACGAAAGA TCTGTTCTCT CACATAACAA ACACTGCGTA TCAGGATTTG GACCCTGGCT TTAGCGAAGA GAACTGTATT TTTGTTTGGA ATGAACAGTC GATTGTTCCT ATCCTGCTTC GAGATAACAC CTGTGAAAAT GAGGAGGATG CTTGTGCGAA GGTGAGGAAC GTAGTTCGCG ATATGGAGTC TATTGTTGCC GAACTATTCA ACGCCTACAA ATCAGAATTC GGGGTTTTTG CACCAATTGA AGGATGCTTC GAGCATTATG GTCTTGATTT TTTGGTGGAT GACAACTGGA ATGTCTATTT GCTAGAAGTA AATCCTGGGC CGGACTTCAA GCAAACCGGC ACACGGCTTC AGGATGTTGT CGGGAATTTA ATGCTAGCAA CAATTGACGC CGTATTTGAA ATATCAAATC GGAAAGAGAT CGGCACTCTC AAGTTAGTGT ACGAAAACGA GACTCGAGGA GCCAAAGTGA AAAGTGGTAT TAATATAATG CTGACATGAC GCAGGATCGG GAAGCTGAAG CGCAGTATAT TAGTAGTGGT ATGCCGTCGT CTAGCCACGA AGGTGCCGGT TGAGATATTC GATAAGTTTG AGATCATACA GACTTTCCCA TATCCCACAT ACAAGGAAAT AGAGACCGAC AAGTCACGCA AAAGACATCT TAATCCGTAC GAAAACAAAT CCTGCTTCGT GTGTTAGACA ACTCCATTAG CTCGCAGATT TCCTTTTCGC ATTGACGATG TGTACGAATC GTCGCTTCGC TGCGGAAGCT TAATGCTCGG AATAAGATTT TTTTCCTGTT GCGCGTATTG GAGGTATTCC GTAAAGGGCA AGCTTTCGAG GAATACTGGG CATCCCCAGA TGTAGGGGTA CATTTCGTCA ACAAGGGTAC AAGACTGACC ACTACCTGTG GATTCGGCCA CGCCAAACAA CCTTTCCTCA AAACCATACG TCAGCGTCAG CGCTTTCCAA CCA
|
Protein sequence | MSDFQMITKT CLSVHETGCF HVLGLQAELP QEQIAAAVLS PKAVKKSIFV DIDEPYTRKQ VQKAFLKRSD IFLVTMGPGE GIDAVKVPGH CEFQWAEYER IDWDAVLAGK HGASSYCIRK GLSRKAQLAY YTHRHVLILD TWSVWDDNAG TTNHEGFADL VVSMGSPAIT SGTLNRRWRL DQSLIEAKRL MQCVNYNGSQ CDGDPFPIWI LKGSTVNKGA GIFLIHFYEE LVDICWSEPQ IREWVLQQYI TVPLLLRKRK FHIRAYVVAV SAIKVYFCQE CLALCSGTKY KNCDTKDLFS HITNTAYQDL DPGFSEENCI FVWNEQSIVP ILLRDNTCEN EEDACAKVRN VVRDMESIVA ELFNAYKSEF GVFAPIEGCF EHYGLDFLVD DNWNVYLLEV NPGPDFKQTG TRLQDVVGNL MLATIDAVFE ISNRKEIGTL KLVYENETRG AKDREAEAQY ISSGMPSSSH EGAG
|
| |