Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_2936 |
Symbol | |
ID | 7195696 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 300093 |
End bp | 303036 |
Gene Length | 2944 bp |
Protein Length | 344 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183965 |
Protein GI | 219127485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGCTTGCG GTCTTTCCAA CTTGGGGAAT ACGTGCTACG CCAACTCTGC CATACAATGT ATGAGCTACC TCCCATTGCT TCGGGCATAT CTTCTCAGCT CGCAGTACAA GACGGCCGGT GATTTGAATA GGGACAACCC TTTGGGGACC GGCGGAAAGC TTTTGGAAGA ATTCGCCGAA TTGCTTCGAA GTATGTGGAG CGCAAAACTT GGAGAAAAGT CTCCCGTCCG CTTTCGGTCA CAACTTGGCA AGGTCAATGA CCAGTTTTCC GGGGCAGACC AGCAAGATGC GCAAGAGTTT TTGAACTACA TGTTGGACGT ACTTCACGAG GATAGCAACA AAGTGCGACA AAAGCCATAT GTTGAAGCTC TTGAGGACGA ATGGGTTATG CGAAACCACT TGCCCCGAGT TGGGGAAGAA GCCTGGAGAA GGTTTGTACA TGCAATCATT TTGTGTGTGG TTTAGAGGGA TGCTAACTCT TCGTTGCTTG TCTTGCAGGT TTCTCCGGCG GAATCGCTCA ATCATGGCTG ATGTTGCCAT GGGACAGGTT TTGAATACGG TTACATGCCC CCAATGCAAT TTTTCAAGCC GTAACTTTGA TCCTTTCAAC CTGCTTTCCA TTCCGATTCC CACTGTGGCG GATGTCACCT TTCAGTGCAC AGTCTACCGA CGAGCTACTG CTGTGAACTG TCCGTGGGTC CTTAACAGGA CACGGAAGGG TGACAAACGC CCCGCACGCT ATCCCAGAAA GCGCTCTGGC TCGTCCACAG GTCCTCCATC TGTAACGTTC GTTGCTGAGC GGTACATGAT TGCCATGTCA CGACTGGCAG ATAGCGGTGA TCTTCGTCTC CAGATTCAAA ATATGTGCGG TATCCCAGCG AGTCAGTTAA AATTATGTCG TTCTGAGGAA GTGCTTACTG ACGATAAGAA ATACCATAGC ATTCTTCAAA GCCAGACGAA GGTAATTCCT TTGACCGACA AAGAAGGCCC TTGTAGTCAG CTCGCTAAGC CACGCACAAG CGAGGATTCA GCAGACAAGC CCACCCACAT TGTAGCGTTC GAGACAACTC TGCAGCTTTC TAACGTTACT GCCACTGTAA ATCTGGAGTC CCTTGAAGAC ACGGCTGACG AAGACGAGGA TGAATATAGT GATGATGATA TTGCGCCAAG CCCAAAGGAG CAGAAGCTCA TTGAAAAGCA TCTAAGTGTG TACGGGGACG CTAAGGAGTG TCGTGTAGTG GACAGCGACC CGTGGTATTT GTCGAGGGCC GTATCTCGAA GCCTCTGGCC GCGTTCAGAA AAGGAACTCA AGCTCGGTTT AAGGGTAGAC GCGAAAGATC AACGCGGAAG CTGGTTTGCT GGAACTGTTG TCGAGGTGTT GAAAGGTGAC GTCGATGCCG ATATTGGTGA AGATGTGGAG CTTCCTGAAA AGAAGGTCCT AGTGCACTTC GACCATTTTT CATCGAAATG GGATGAGGTG TATTCCATTA GCAATTTCAA GGACCGTCTC GTTCGACCAC TCTTTTCGCA CGCCACTCCA CGCGGTAAAC CCACAGAATT CTTGGTTCAA AATCGGTTCA CTGACCACGA CACTGGCCGA TTCGTGGCGT TTGGTCAATC CTTTTATGTG CAATGCTGTT CCGAGTGGAG TAACGCTCGT GCTGGAGCGC AAATAATGGC ACAAGCTTCT CGATTTCTGC ATCAAATGCC AGGATGGGGT GACGCATCGG ATGTGGCCGA TTCGGGAACA ATTGACCGCG AGGCCAAGGC CCGTAAGCTT TACGAGAGAA CACACGGGGC CATTTCAGAT TTAATCGATC TCTTGGTGGA TTGCGATCGA GAGTACCTGC GACTGGCTTT AGGTCTATCG GATCACAAAA GCAAAGACGA TGACGCGAAA CCGTACCGGA ACCCGACGTT TGATCCAACC AGTTTGTCGA CGTTGCTGGT AAAAAAGGTG TCAGCTCTTT TGCATCGCAT TCCGTTTGAA GTTCGCGTCT GTACCGTCGA CAATTCTCAA CCCGACAAAC CCAATACGAT TTCTGAAGAA GAATCCTTCC CGTTTTCTTT GATACGAACG GTTGGCAATT ACATGAATGC TCGTCACGTT ATCATTCTCC AATGGCGCGA TCCTCCGACC GACAAGAAAA CCCCGAACTC GAGTAGTACG AGTACCGGCG CCAGTTACGT CAACTGTCCC GTGCTGTACG TCCCACCGCC TATCGTCGAA GACCAAGCGA GTGCCGATCT TGTTCGAAGC AAAGTCAAAG CCGACGAACA AAAGGCGCAA GTAGCCGGTG ACGGTATGGA TCTTGGAGTT TGTCTGACCG AGTTTTGCAA GACGCAGAAT CTTTCCTTGT CGGACAATTG GAAATGCCCT CGGTGCAAAA AGTTCCGCCA GGGCCAGCAG AATATGAATC TGTGGCGCCT GCCGGATTTG CTCACCTTTC ATCTCAAGCG CTTCAACATG TCGGCGCGGT GGCGGGAAAA GCTGACGACG AAGATAAACT TTCCGTTGAC GGGTTTGGAT TTGCGCCACT GGTGTCACAA GGAATCTCCC GCCGTCCAGG TGGATCCGAT GGAGAGTTCG GTGTACGATT TGATCGGTGT GGTGAATCAT TACGGCAGCA TGACTGGTGG GCATTACGTG GCGACGTGCA AGGCTACGGC GTGTACAAAG GACGGCAAGG AAGAAACCGC GTACGGATTC AACGGGGTAA ACACGAGTGT GTTGGACGGG GAAGGGTCGG AGCCGAATTC GGGCTGGCGT CTGGGACGGC AAAAGAACGA ATCGAATGTG AGCAAGATGG CGGCGCTGGA AGCTTCCAAG GCCGTGTCGG AGTCGGCCGA ACCGCTGTGG TTGCAGTTTG ACGACGAACT GGTGGAACCG CTGGCACCGG AACACGTAGT TTCGGAAATG GCGTACGTAT TATTTTATCG TAGA
|
Protein sequence | GACGLSNLGN TCYANSAIQC MSYLPLLRAY LLSSQYKTAG DLNRDNPLGT GGKLLEEFAE LLRSMWSAKL GEKSPVRFRS QLGKVNDQFS GADQQDAQEF LNYMLDVLHE DSNKPGEGLY MQSFCVWFRG MLTLRCLSCR FLRRNRSIMA DVAMGQVLNT VTCPQCNFSS RNFDPFNLLS IPIPTAQVAG DGMDLGVCLT EFCKTQNLSL SDNWKCPRCK KFRQGQQNMN LWRLPDLLTF HLKRFNMSAR WREKLTTKIN FPLTGLDLRH WCHKESPAVQ VDPMESSVYD LIGVVNHYGS MTGGHYAVSE SAEPLWLQFD DELVEPLAPE HVVSEMAYVL FYRR
|
| |