Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28684 |
Symbol | |
ID | 7202525 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 455523 |
End bp | 457613 |
Gene Length | 2091 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181559 |
Protein GI | 219122453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGATCTAC CAACAATGGA CGAAAATCGC AAGCGCAAGG CTTCGGAAGA TATTGAGGAA GGGATTGTTT GTCCATATTT GGACACGATT CAGCGATCGT TGTTAGATTT CGATTTCGAG CCGGCATGCA GTATTTCAAT GCAAACAGGC CCACACATTT ATGGTTGTCT CGTTTGCGGC AAGTACTTTC GCGGACGTGG AATCCAGACA CCAGCTTACA CACATTCAGT AGAAGAATCG CACTTTGTGT TTGTGCATTT GACTAACGGG ACATTTCACT GTCTGCCGGA TGATTACGAA ATTAAAGATA CGTCACTAGT CGATATTCAA GATGCATTGC ATCCGAAGTT TTCCCCGAGC GAAATTCGAA CAATCGACAC CCACACAGAG CTCAGTCGGG ACCTCTTTGG ACGACGGTAC CTGCCAGGCT TTGTTGGCTT GAATAACCTT CACAAAACTG ACTGCGTGAA TGCCGCCGTA CAGGCACTGG CACATGTCCA GCCATTGCGT GATTTCTTTC TATCGAAAAG CCATAACGAG TCACTCCTGT CGTCGAAAAA ATCCCAAGCG TCGAACCGAC TTGCTCATCA CGTGGCACAA TGTTTCGGAG AGCTTGTGCG TAAAATTTGG AGTTCTAAGC GTTTTAAATC GACGGTCGAC CCCCACATGC TGATCCAAGC AATTGCCACT GCCTCGAAAA AACGCTTCAA AGTCGGTGTA CAGGCCGAAG CGGGGGAACT TGTGGCGTGG TTGCTGCATC GGTTGCATGT CGGGACAGGT GGAGGTCGTA AGGCTGGTAG TAGTATTGTG CACAAAACAT TTCAAGGGAA AGTACGAGTC ACGACAAGAG AAGCAAAGCG GAAAAGGTTG GAAGCGAAAG CTGAAGAAGA CGACCGATGG GGAAGCGAGG ATGAAGGCGC GACTGAGCAG GAAGGTCTCA AAATGAATGA TCAAGAAGTG TTAGTAGAAA TTGAAGAAAC CGCCACCGAT ACACACTTTC TACAGCTCAC TTTAGACATA CCGGAAAAGC CACTATTTCG CGACGAAGAC GGTGGTTTGG TCATTCCACA AGAACCGCTG GTGTCTGTTC TGAAAAAATT TGATGGTGTT ACTTTTTCAG ATGCCCTCAA CCGCAGCGGC GTGGCCCAAC GGAAGCGCTA CCAACTCCTA AAACTACCGG ACTACTTAAT CTTACACTTG GCTCGCTTCA AAGACAATCG GTATACAAAA GAAAAAAACC CTTCAATTGT CATGTTTCCG GTAAAGAACC TTGATCTTGG CGAGTACGTG CACAAGGAAA AACAAAGTCT ACCAACTGAG GAGCAAATTC GAGGAATGAC TGTACGTCAG GTGTGCGTTT TCGTATGATT TCGCTGCTAT TTTTCAAATC GTTGTCTGAA TAGTGTTTTC GTGATGTCAA CAGGTAAAGG AGCTAATGGC ACTGCTTGCG AAACACGACC GCACCGCTTT AGGAGTATCT ATGCTAGAGA AGAAGGAGCT CGTTGACGCA ACCGTGGATT TTTTTTTGAA GAGTTTGCCC GACTTGCTCT CTGAGAAGTA CGATTTGGTT GCGAATATAA CGCACGAGAG TCCTGCTGAC GTTGGTCGCG AAGGTCAACA CGACCCATTG CAGGACGGCC ACTACAAGTG CCATGTGCAG CATCAAGCCA CGCGACAGTG GTATGAAATT CAAGACTTGC ACGTTCAAGA GATTATGCCG CAGCAAATTG GACTTTCCGA ATGCTATCTT CTGATCTTTC GAAAGTCAGG ATTGTAAGTA AATACTAGAT TTCAAAACTA CCAACCTTCA ATGAACTTTA GAGACGTAGT CTTTGTCGGG TCCTGAAAGA GCCGGTGGAA CAGCCAAGAA CAGTCCATAA ATATCCATGT TCTTACGAAC AAATGCTGGA TCCAACTCCT GCGCTCGACA CCATTTCAAT GCGGCTTGTT CCATACGCTC TCGTCTCTGC AATGCCGTGT CCGATTGAGA AGGCAAGTCG CTGTCTTCGT TTTCGTCTTC TTCGTCCGTC GACCGTCCCC ACGGGAGTGA CGACGGAAAG AATGCACGAT CTCGTTGCGT ATTGGGATCC T
|
Protein sequence | MDENRKRKAS EDIEEGIVCP YLDTIQRSLL DFDFEPACSI SMQTGPHIYG CLVCGKYFRG RGIQTPAYTH SVEESHFVFV HLTNGTFHCL PDDYEIKDTS LVDIQDALHP KFSPSEIRTI DTHTELSRDL FGRRYLPGFV GLNNLHKTDC VNAAVQALAH VQPLRDFFLS KSHNESLLSS KKSQASNRLA HHVAQCFGEL VRKIWSSKRF KSTVDPHMLI QAIATASKKR FKVGVQAEAG ELVAWLLHRL HVGTGGGRKA GSSIVHKTFQ GKEGLKMNDQ EVLVEIEETA TDTHFLQLTL DIPEKPLFRD EDGGLVIPQE PLVSVLKKFD GVTFSDALNR SGVAQRKRYQ LLKLPDYLIL HLARFKDNRY TKEKNPSIVM FPVKNLDLGE YVHKEKQSLP TEEQIRGMTV RQKYDLVANI THESPADDGH YKCHVQHQAT RQWYEIQDLH VQEIMPQQIG LSECYLLIFR KSGL
|
| |