Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33896 |
Symbol | |
ID | 7197692 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 612404 |
End bp | 615204 |
Gene Length | 2801 bp |
Protein Length | 866 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178543 |
Protein GI | 219115495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCAGC CGTCGTCGTC CCACAATTTT GATTTGTCCG ATACGTCCGC GTATCCAGTG AGGTCGGCGT TCCCCGAGAG GTTTTCCAAG AACCACGTCG GGCCTTTCAT TGTTTCGGAG CAAGGGTCGA CCTCTAACTA TCGCAAGTAC CAGCACTTCT AACACTTTTC AATTCACACA CATCCCTTGC GTAACCGGAG GATACGGAAT CGTACAGGAC TGCTGATACA GGGCACCAGC TTTTTACATT TGTGAGAAAA TGCCCAAAAT AGCTCCGAAA CGCCAATCCA AAGGTGAGCA GTCGGGCAGT ACGGTGCAGC CCGTTGCCGT TGCAGACAGT GCAGAGGGTT TCCGGTATAT CACAACGCAA GAGCGCGTCT TCTCACCAAA TCTTCCTTTG CTGTGTAGAT TTAAATTGCG CGGACGACGC GTCGTCGGCA GGATCCTTGG ACGTTGGCGC TTGTTGTCTT TGTCATTGTG CACTGGACTA CAGCGATCGC GCCGCTTTCT TTCGAGATGA TCGGCACGAA GATTATCACG AAGACAGCGA TCAGGAGGAA TATTTCTTTC GAAAGAGCGA TCCCTATCTA CCGAACACCC TCTACGATCC TTCTAATGCA CTCGTGTACT GTGATTCGTG CGATCGTATG TACCACCAGA AGTGTCACTT TGTGCCCTTG ACCGTTGTCC CGCGAGGAAA GTGGGACTGT TTGGTGTGTC AAACACAAAG GGAAAAAGCG AAAATCTCGC TCAAGAAAAA GAAACCGGCT ACCGGCAATG ACCATTCAAA ACGAAAAACT CCTGCGACGA GCCAAGACCA CGCTATTCCT CGTCTATCGT CGTCCTGTTT TCAGTCTCCT CCCAACCCGG GCGTGCGTGA GGAGGAAGTC GCTTGGGAAA GCGCCTGTCG AGACGTCAAA GCGGCCCTAT GGAAAGCCGA GCTACACAAC CGTGTTCCAC TCCAAGTCAA CTCGCAACTG GCCAACGTTC GTTTGGCCGA GACGGCGTTG GAAACCTTGA CGAGCACGGC CAAGAATCGA TCGCATTTCG CACAAAATTC CCAAGAACTG GCACAGTGTA TGGTGCGATT GTACGGGAGC CGGCGCAAGC TACGACATGT GCTGCTGAAC CTACAAGATC TCATCCGCGG GGACCAGGAA ATGCGATGGA ACATGCTACT CCGCTTTTGT CGGGATGACG CGTCCCCGGA ATTCGTAGCA CGAGTCGCCT TTCCGTTCGG AATGTCGCAC TCGCGACGAG TTGATCCTCG TACTCCGGAA ATGACACTGA ACGCAGAAGA ACACGCATCC AATACAGTTC CTGCCGAAAT TTCTCTTACC ATTAACAGCG AAACGCAGGC AAATTCTGAA CCGGCACAAT CCATAACCGA CCAGTCGAAA CCCATAGCAA AGCATGATGG TGATAGTGGT ATTTCGTTGG ACAATTTGCG CTGCTGCGTC TGTCACCAAA GTGAGGCTAC GGATGAAAAC GATATGATCA TGTGCGATGG CTGTGGGTGC TATCGCGCGT ATCACATGCG GTGTCTTCAA CCACACGTCA AGCCGGAAGA AGTCGAAAAC GAAGAAGACG ATTGGTTCTG TCCGCTTTGC AGCACCCTCG CCGATATGAT GCTCTTAATC CAAACAAACC ATATGGGAGA CGAATGGGAG CAGCGACGCT ACGCGGCGGA ACTGGATGGC GTAAAGAATG GGGATGACGA TTCATTGAAG TCGTGGAATG CTGTTGAGGA AGTTTTTCCT GAAGTAGAAG TCAACTATGC AGCAGCATGC GATCTTAAAG ATGGCAAACG AACAACTGCG GCATCCAAAC TTTTGGGGAG AATTCTTGGG TTGGAAGAAC AGGAGATCGA CACAGATATT GACGATGACG AGGAGGATGA GGACGACGGT CACTTCGACC TGGAATCTTT TCAAGAAAAG CGACATCAAG CTCGTGTAGA GTTATCTGGT GATAAGGAGG ATGCTAGTGA AGGAAGCAGT CAAGCTACGC TAGAAGAGAT GTCCAGTGTG GAGCTGAATA TCGGTAAAGA TGAGCTGGCG GCGCTTTCAG AGGTTTCGGA GGACGAGGAG GAAAGTGAAC AAGAAGTTCA ACGACGAAGC GTCCGATTGC GAAAAAGTGG AAACACAAGC TCTGCAAACG CAAGCGTTTC CGACCCGGGT AAGCTGGACG AGTCCAACAT TTTAAACGGG AAACGAGGGC GGAAATCTGT GGACTACCAG AAACTGAACG ACGCTATATT TGGAGAATTG TCGGACGGTG AAATTGCCAA GATTGATGAT ACTGACGATT TTCTAGTGGC AACATCCCGC AACCAAAACG ATTCTAGCGA CTCTGGCGAT GCCAATGATG ATTCAACTGG AAGCAGCCAT GTAGGCGAAG CAGATGAATT GAGCTTAGAC GACGAAAATG AAGATAGCGA ATCAGAAAAC ACAGAAAATG GTATAAAAGA TAGGAGAGAT GATAGACGGG AGGACGATCA TCTTACCGTT AGATCTTCAA GAGTAAATCG TCCATCAACA GTGCGACAAC TTCAATCAGC TTTGGCCGGC GAAGATGGTT GTACATCCGT GATATTGAGC CGGAGCGAAA ACGTAAAGAA GCGTAAATTG AAGGAAACAG CCATTCCTGA ATCTCAAACG AATGCACAAG GAGCGGGTGC AGAGACAGCG TCATCCGATG AGGCCTCTAG AAGTCGGAAA AATAGGAGAG GCAAGTTTGC CAAGGCAGCA TTGACAATGG TTTCGAAGCT TGTTCAAGTT GCACCAAGTT TAGAGAAGTA A
|
Protein sequence | MRQPSSSHNF DLSDTSAYPV RSAFPERFSK NHVGPFIVSE QGSTSNYRKA PAFYICEKMP KIAPKRQSKD LNCADDASSA GSLDVGACCL CHCALDYSDR AAFFRDDRHE DYHEDSDQEE YFFRKSDPYL PNTLYDPSNA LVYCDSCDRM YHQKCHFVPL TVVPRGKWDC LVCQTQREKA KISLKKKKPA TGNDHSKRKT PATSQDHAIP RLSSSCFQSP PNPGVREEEV AWESACRDVK AALWKAELHN RVPLQVNSQL ANVRLAETAL ETLTSTAKNR SHFAQNSQEL AQCMVRLYGS RRKLRHVLLN LQDLIRGDQE MRWNMLLRFC RDDASPEFVA RVAFPFGMSH SRRVDPRTPE MTLNAEEHAS NTVPAEISLT INSETQANSE PAQSITDQSK PIAKHDGDSG ISLDNLRCCV CHQSEATDEN DMIMCDGCGC YRAYHMRCLQ PHVKPEEVEN EEDDWFCPLC STLADMMLLI QTNHMGDEWE QRRYAAELDG VKNGDDDSLK SWNAVEEVFP EVEVNYAAAC DLKDGKRTTA ASKLLGRILG LEEQEIDTDI DDDEEDEDDG HFDLESFQEK RHQARVELSG DKEDASEGSS QATLEEMSSV ELNIGKDELA ALSEVSEDEE ESEQEVQRRS VRLRKSGNTS SANASVSDPG KLDESNILNG KRGRKSVDYQ KLNDAIFGEL SDGEIAKIDD TDDFLVATSR NQNDSSDSGD ANDDSTGSSH VGEADELSLD DENEDSESEN TENGIKDRRD DRREDDHLTV RSSRVNRPST VRQLQSALAG EDGCTSVILS RSENVKKRKL KETAIPESQT NAQGAGAETA SSDEASRSRK NRRGKFAKAA LTMVSKLVQV APSLEK
|
| |