Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49149 |
Symbol | |
ID | 7195649 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 70398 |
End bp | 72420 |
Gene Length | 2023 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183804 |
Protein GI | 219127150 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.536762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA CCCTATTGAA TGGATCAGGT CCAATTCAAG GCTTGAAACA TCATGTACAA CAGCGTGGCG CTGAGGTGCT CTCCTCTTTT GACAAGAATA TCCCGCCTTG GCCGACGCAT TTTGTCGTGA GTGAACAAGT CCAGGAGCCA CTTTCGATTG CACAAGCACT GGGTTTCGAT TCCTTGGAAG AAATGTCATC TTTTCTGTAC GACCATAGCA TTGTTTGCGC CACGCGGAGG TGGGTTCATC GCGGGGATCG ACTGGACAAG CCGCCTCTGG AGGAACCTAC CATGATGGAA ACGTATCTTG GAATAGGTCC TAAGCGAAAA TGCAAACGAC ATAGAACGAA CAAAAATGAA GAGTCGAACG ATTCGCAAAG TTCACGGTCG AATGTTTACA AAACTAACCA ATCTCTATCG GAAGCCTTTC GGACCTTGTC TAAACGTCAC CAGGAAATGC CGCTGAACGG AGAACTGGAT GCTTGGAAGT CTTATTCGTT TCAAATTACG GCGGGTCGAT TGCTGCATCT AGGCTTCGAA ATCAGAGATA GCCCGGAGGT TCTTCGCCAT CTCGCGTCGA TCAACGGGTT CGGCTCGTCG ACGATGGATA TCATTACAGA TTATCTACGA ACGCAGCAGT GCAGCCGTTT GCGTAATCTG GAATCAGATC CGGATCGGGT TGCTATGAAG AATATGATGA ATATCTGGGG TGTAGGTCGG GTTCGAGCCA AGGAGTTGGT GGATGCTGGC TTTAAGAGGA TTAATGAGGT TCGGCAAGCT GTCGAATTAG GGAACCTACA ATTGGAAAGA AACCAATACA TTGGCGTATT GTGTTACGAC GATTTGCTGG AAAAGATGGA TCGAACGGAA GTCGAAAGTA TCGGTAAGAT TATATCTAAC ATTTTCAAAA TGTCCTATCC TGAGGCGGAA GTGTGTGTAA TGGGAAGCTA TCGACGAGGG AAGCACGCTT GCGGGGATGT TGACATTCTT ATTACACACG AAGATTATAA TCACACGGTC CCACCGAAAG CACTGGGACA ATTTATCGAC GAACTACGGC AACAGGGACA CATCGCATAT CACTTGACAT TTATCTCCGG CATGAAGCAT GAGCTATATG AAACAATCCC AGATGCACCA AGTCACTGGT CGCCGCAGCG CGACAAACGA GACAAATCTT CGAGCAGCTC CTATATGGGT GTTTTCAAAT CACCTTGTAT GACGGGTAAG AAGCGCCGAG TTGATATTAA ATTTTATCCT TGGCGAGAAA AGGCTTTCGC GAGTCTTTAC TTCACCGGAA ATGGCTACTT CAATCGATCG ATGCGCCTTT GGGCAACACG CAAATTCAAC TATACGTTAA ACGACCATGG TGTTTTCGAT CGAGGATCTC TTGTTCGCGT TTTAGACACG ACTTCCGAAA AAGAAATTTT TGAATTTCTT GATATAAGTT GGAGGGAACC CAAGGAAAGA GATTCCTTTG ACGCTGTGAA AGGCAAGAAA AATGGCGAAA GTGCAGCGCA ATTAGAAGGT TTTTCAAGGT CAGAGGTTTC ACGAGAGTCA AGAGATCACA GATGGATTGT GTAAACAGCT TTTGGCCGCT GTTGCCTGGC ACAGGCTGTG CATATTGAAC GCAACATTGT CAAATGATCA ATCATTTAAA CTTCGAGTCT CGATATATTT TACATTAGCT CGCATCGACA AGGCTTTTTA CGATTGATAT TCCGCACTTG TCCGTGAAAA CGAACTTCTC GTATACCTTC AACGAGTGAA AGCAAAATAT CCTCTTATGC ACGAAAGAGA CGCCGTCTTC TAAACAGAGT CCGGTATTGC TGCTACCGCT ACTATCAGTT TCACGTCGTT CCGTGAGCTC AGTGGGAAGG GACGAAACAC GACATCTGGT GGAGCTCCAC TTGACAGGCC AATCCCGGCA TCGACACGAT TCCTTGTCCC ACACAACAAG ATTCCTCGCG ATGACGTTAT GCACAACCAA CTCTACGACT CGAAAGCACT AACGAAAAAC TGA
|
Protein sequence | MNKTLLNGSG PIQGLKHHVQ QRGAEVLSSF DKNIPPWPTH FVVSEQVQEP LSIAQALGFD SLEEMSSFLY DHSIVCATRR WVHRGDRLDK PPLEEPTMME TYLGIGPKRK CKRHRTNKNE ESNDSQSSRS NVYKTNQSLS EAFRTLSKRH QEMPLNGELD AWKSYSFQIT AGRLLHLGFE IRDSPEVLRH LASINGFGSS TMDIITDYLR TQQCSRLRNL ESDPDRVAMK NMMNIWGVGR VRAKELVDAG FKRINEVRQA VELGNLQLER NQYIGVLCYD DLLEKMDRTE VESIGKIISN IFKMSYPEAE VCVMGSYRRG KHACGDVDIL ITHEDYNHTV PPKALGQFID ELRQQGHIAY HLTFISGMKH ELYETIPDAP SHWSPQRDKR DKSSSSSYMG VFKSPCMTGK KRRVDIKFYP WREKAFASLY FTGNGYFNRS MRLWATRKFN YTLNDHGVFD RGSLVRVLDT TSEKEIFEFL DISWREPKER DSFDAVKGKK NGESAAQLEG FSSFTSFREL SGKGRNTTSG GAPLDRPIPA STRFLVPHNK IPRDDVMHNQ LYDSKALTKN
|
| |