Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39714 |
Symbol | |
ID | 7195318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 526687 |
End bp | 528282 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183622 |
Protein GI | 219126769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAAT CACGAGCACC CTTCCGCATG CCTCGCGATT CCCCGGACGA TTCTGTTCCA CAGGCAGCTG CCAAAACCAA TCTCCTGGCG TGGAATCAAC TCGGTCTCTG GACGGAACTC GTGGAGGCCA TGGCACGCCT GGAGCTACAG ACACCCACAC CCGTACAGCA GCTCGCAATT CCTGAACTTT TAAAAGAACC ACCCCAGCAT CTAGCATTCT TAGCGGCCAC CGGCAGTGGG AAAACGTTGG CGTACGCGTT GCCCTTGTTG CAAATGCTCA AGCAGGGGGA AGTCTTTGCA GACTACGAAC GTCGGCCCAA AAGACCGCGT CTGCTGATTC TGGTTCCAAC GCGCGAACTC GTCGTGCAGA TTACATCCGT CATCAAGTCG GTAAGCCACT CAATCAAACT GAGTTCGTGC TCCATTACGG GAGGAGAAGA CTACGGGGTG CAGCGTCGAC AACTGAACCG ACCGATTGAC GTCGTGGTCG CGACACCAGG GCGCTTGACC AAACACTGGA AAGATTCGAA CCTCTTTCTA GGTAGTTTGG AGCATATAGT TGTGGACGAA ATGGACACCA TGTTGGAGCA GGGGTTTTAC CGGGAGTTGC GCCAATTGCT GTATCCCGTA TTGTACCACA AACAGGCCGA CCAGGAAATC AACGTTGAGC AAGACTTGGA TGCCAAAGCA CCGCGAATCG TGCTCACTTC TGCAACCATG ACGCAGCAAA TCCAAAAAAT TATTGGGGAT AGTGACAACA AAAAAAATCT CGTCAATGCC AAACGTCACC ATCGCAAGGT GGAAGACGCC GGCTTTCAAG TCAAAGTCCC AATGGTCCTG CCAAGAACGA AGGTACTCAA AGCAGCAGGA TTGCACAAGA CGGTCCCGCG ATTACAGCAG GTCTTTGTTG ACGTCGGAGC CACGGACAAG TTGAGTCTGT TGGTGGATAT AGTATCGAGT GGTGGTAGCG GCGCCGCCGT AGCGGCGTCC CTCACCGACC AACAGGCATT GACCATGATC TTTTGCAACA CCGCAGCTTC CTGTCGCGCA GCCCAATTCG CACTCTCTGA AGCACGGATA GAATCCTTGG CCTATCACGG GGACTTGAAC TCCGCTATGC GTTCCGAAAA CCTGAAACGA TTCCGTGCGG CGGGGAAAAA GAACTGCGAC AACGCCTTGG CCGAAGAACC CCGCGTGTTG GTGTGCACCG ACCTGGCGGC ACGGGGCCTA GACGTGCCCC AAGTGGATCA TATCGTCATG TTCGACTTTC CGCTCAACGC GTTGGATTAC TTGCATCGAA GTGGACGTAC GGCGCGAGGT GTCGGAGGTG ACCGCACCGG CAACGGTCGT GTTACGGCCT TGATATCGAA GCGCGACAAG GTCTTGGCGA ATGCTATTGA ACAAGCTGTA TTGCGAGGAG ATACTCTGGA TGGACTGAGT AGCCGAAAAT CGGATTATCT ACCCGGTGCT CGACTCGGAA ACCAAGGTAG GCCTGTAAAC AAGAAAAATA CAGCCCGGCG CGGCGGCGGA TCCTTTGCCC AAGGCAAAAA AGAGCAAAAG CGCAAGTCTA GTTCGTCGAG GGGACCCCGT CGTTAA
|
Protein sequence | MSKSRAPFRM PRDSPDDSVP QAAAKTNLLA WNQLGLWTEL VEAMARLELQ TPTPVQQLAI PELLKEPPQH LAFLAATGSG KTLAYALPLL QMLKQGEVFA DYERRPKRPR LLILVPTREL VVQITSVIKS VSHSIKLSSC SITGGEDYGV QRRQLNRPID VVVATPGRLT KHWKDSNLFL GSLEHIVVDE MDTMLEQGFY RELRQLLYPV LYHKQADQEI NVEQDLDAKA PRIVLTSATM TQQIQKIIGD SDNKKNLVNA KRHHRKVEDA GFQVKVPMVL PRTKVLKAAG LHKTVPRLQQ VFVDVGATDK LSLLVDIVSS GGSGAAVAAS LTDQQALTMI FCNTAASCRA AQFALSEARI ESLAYHGDLN SAMRSENLKR FRAAGKKNCD NALAEEPRVL VCTDLAARGL DVPQVDHIVM FDFPLNALDY LHRSGRTARG VGGDRTGNGR VTALISKRDK VLANAIEQAV LRGDTLDGLS SRKSDYLPGA RLGNQGRPVN KKNTARRGGG SFAQGKKEQK RKSSSSRGPR R
|
| |