Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48951 |
Symbol | |
ID | 7195236 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 126063 |
End bp | 127547 |
Gene Length | 1485 bp |
Protein Length | 445 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183555 |
Protein GI | 219126630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATTACTTCA TTTCGCAAAA AAAAGTATTG ACAAAATCAT GAGACCTAGG AAGGGATCCA GAACGCGGAA GTCGACTAGG GGAAGGACAT GGCAAAAGGA GATTGAAGAC GATGACATCG GATATGCTGC AAAGCTCGGG ACGAAGCTCT CAGATCTCTC GAACTCTTCA GAAGAGGAGG TCGGTGCGAG CACCGCTGAC GAAAGTACTG ATAGGTTTAT CTCTTTCGAA AGAGACCTTC GTCAGAAGCA TTCAAAAGCT TGCAAAGCTA TGTCGGTAAG AAGCGGGGAA TGCTCTCGAA AAATTCGGCG AACGAATATG ACTAATCCAT TTCCCTTCTC CTTTGCAGAA AGCTGTCTAT CACGTAGCTC TTGAAGAGTT CGAATCAATT CTTGCTGACC TTTTGTCTCG CTACGGTGAA CGGCATGAAC GTGTCGGTGC CGCCTTGCAC AACGTAGCAA TCGCCAATCT CAGAGCCGGG AGTTTAGACG ACGCGATGGA TGCTATTGAA GAGGCCATTA AAATTCGGTC TCGGGCTGTT GGTCGATCAC ATCCGAAAGT AGCAGATTCC CTGGTGGAGC TGGGCATTAT CTTGCTTTCC ATGGAAGAAC ACGACGACTC ACTCAAAGTC TTCCAACGGG CATTGAAGCT TCGGAAGGAA GAGCAGAATG ATGTTCTGTC GGACGACGAT TTAGACGAAA GCAATTTAAA GATCGCCAAG GTTTTGAACA ACATTGGCTG TGTTAGTTTC GAAAAAGGAG AGCTGGTAGA AGCCAAGCAA TCATTTGAAC AAGCAATAGT CCTCCAGAAA GGGGTCTTTC ACAGCTGGTT CAACATGCTT TGTGGAGCAG ACTCCAACAG TCCTGGTATT TTGACAATGG CGTCAACAAT GTGCAACAAG GGCTACGTCG AAATCGAGCA AGAGAACTAT CTTGAAGCAG TAAAAGTTTT CACGGAGTCA TTACAAATTC AAAAATCAGT TTTAGGATCG GGCAATAAGC TTGTACAAAG CTCACTCGAC AACCTTGGCT ACGCTTATGT AATGTTGAAT CAAAATGAAA AGGCTTTGAA GGCCTACGGA GAAATATGGA ACGCTCAGAG ATATTCAAAC GATCCAAAAG AAGAGAAAGT TGAGACTCTA CGAAAAGTCA TTGCGTCTTA TGGACAGCTG AAGGATTGGG CAAACGTGTT TCCAGCTCTC GAAGCTCTTG AGGATCTGTT GCTCGATATG GATGACGATA AAAAAGAGAT GACAAAGACA AGGAAACTGC TAGGCGAAGT CAACTATCAG CTGTTGAAGC TGCCGTCGCT TTCGGGTGCG ACAACGCGTG CCTTCGGTTG TGGTGTGTGT ACTGGTCCTA CTGAGGAGGA AGTGAACTTG GATGATTGGT TGATAAGAAA GCCCGACAAC ACAAGCAAGA TGTCAGGACA CCGAGTTACA CATGCCTGAA TCTTGAAAAG GTCCGCATGC AGGCCTTTTA GACAC
|
Protein sequence | MRPRKGSRTR KSTRGRTWQK EIEDDDIGYA AKLGTKLSDL SNSSEEEVGA STADESTDRF ISFERDLRQK HSKACKAMSK AVYHVALEEF ESILADLLSR YGERHERVGA ALHNVAIANL RAGSLDDAMD AIEEAIKIRS RAVGRSHPKV ADSLVELGII LLSMEEHDDS LKVFQRALKL RKEEQNDVLS DDDLDESNLK IAKVLNNIGC VSFEKGELVE AKQSFEQAIV LQKGVFHSWF NMLCGADSNS PGILTMASTM CNKGYVEIEQ ENYLEAVKVF TESLQIQKSV LGSGNKLVQS SLDNLGYAYV MLNQNEKALK AYGEIWNAQR YSNDPKEEKV ETLRKVIASY GQLKDWANVF PALEALEDLL LDMDDDKKEM TKTRKLLGEV NYQLLKLPSL SGATTRAFGC GVCTGPTEEE VNLDDWLIRK PDNTSKMSGH RVTHA
|
| |