Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49878 |
Symbol | |
ID | 7198594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 171242 |
End bp | 172996 |
Gene Length | 1755 bp |
Protein Length | 533 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184664 |
Protein GI | 219128952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0245361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCTTATC CCTTCTGCGA AAGACTTTCA CGTTCGTACC AGCGAACACA GCACGGAGAT TTCGTATTTG AAAAAAACGT ATGGACCAGA AGGCAACAAA CATAGTTTGA CATTGTTCGT GTTGCTCACT TGGAGAGATC ATACCGCACT GCAATGAAAG ACAGAGCCAT GCCTTCCTTC CGTTTGCCTC TACTTATGCT TTTGTCCCTT TACTTTCCTA TGGATATACT CTCAAAGACT GATCTATCAA GCTGTATGGA AGCTATGCAA CTAGCCGATC AAAACAGAGA CGGTCTTATT TCCAGATCCG AGTACGTCAA CCTCATGACG ACCCTGAGTC CGTACGAGTC ATGTCCAGAT TCTCGCCTCG GCGGCGACCT TTTAGGAAAC GGCTCCTTCC ATTTGGCGTT TTCGTCTTTG GCTTGCCTCT GTCTGCGCTA CACGAATGAT CGGAATTGTT GTACAGAAGC GGGGGAAGAA AACGGCACCG GTCCCGTTCT CGTGTTGGGG AACGTGTACC CCAAAGCCTA TACCGAGCGC GTTTGTGCGA CGCTCGCCGG TACGCTCGAC GATGAATGTG CCCCTCCACC AACTTTGTCG CCTGCAGTGT TACTTTTGGA AACACCCGTT ACCGCAGCGG CGGCACCTAC TGCGGGTCGA CCAGAAGCCT CTCAACCACC GACTGTAAGT CCCAACTTTC GTCCGTCATC CAACCCGACT GCACCACCAA GGGAATCGAT TTTACCTACA CCAACGAGTC TATTCAATAG TGGAGGGGTT GAGGACGGAG GCAACAATGG CTCCCTGGTT GATGCGAACA CAGACGACCC TAATCGAGGA CTGACCATAG CCTTGCCCAT TGTGCTCCTT TTGACACTCA TCGGCACGCT TGCCTTTTGG GCAAACCGGC GGCGTTCACG TGCGCGCGAT CGTGAGTTCA GCTCTCTGGC GTTGGGTTAT TGGAACAAGG GACGCGGTAC GGGTGGCGAA CAACCATCGC CGTCCGACAA TTTCGACAAT CGTACGACCT GTACAAAATC AATCGAAACG CCAGGATGCG TGTGGGTCTC TGAGTTAAAA CTCCCATTGT CCTTGGAAAT GCATACGCCA CAGCGTTCTG TTAACCTACA GAATAGTTTT GAAATGAATC ACAGGGGCAT CGATGCAAAA ACCCAGGTTT CTTTAGGTTC CCTTGGCGTA TACGAATCAG GAAGCGAAAC CAGTACCGGT GTGGTTGTGG TCGGGGAGTT GGCGCATCCC AAAATCCGGC GAGATCGGGA ATTTTCATCT CCCTTGGAAA GCGAGGGGTC GGAGATTGAC GAGGAATTTT TATCGGAACA GGACGTTGAC GAAATTTCCT CGCTGGCGAC AACGTCAGAC GATTACGATT GTGATTCAAA TTACACTCTG TATCGAGCAT GTGATGCCGT TCCCGAGAAG AACCAGAACG ATTCGTATAC GCCAGGGGAT AAGGCTAAGC AGACGCCGCC CCCTCATGGT GTGAGTAGGC AGCGGCGTCC TTCCGCAATC CCAACCGTTG GTATTGCGAC GAGCCCGATT GTACTAGTCA ACACAACGCT GCATGCGGCA GCAACGGAAT ACCCAGACCA AGGTCCAACA CCTGCGTTCC AATACGCGGA AGAGGGAAAT GACAGTGACT CGCAAGCAGA TTTTTCGTTC CTCGATTACA GTCTGAAGCA AGCCGCGTGG CTAGACTTTC AGCCCGGTGC GGCACCAGAA GAAGGTCTGC CGTAA
|
Protein sequence | MKDRAMPSFR LPLLMLLSLY FPMDILSKTD LSSCMEAMQL ADQNRDGLIS RSEYVNLMTT LSPYESCPDS RLGGDLLGNG SFHLAFSSLA CLCLRYTNDR NCCTEAGEEN GTGPVLVLGN VYPKAYTERV CATLAGTLDD ECAPPPTLSP AVLLLETPVT AAAAPTAGRP EASQPPTVSP NFRPSSNPTA PPRESILPTP TSLFNSGGVE DGGNNGSLVD ANTDDPNRGL TIALPIVLLL TLIGTLAFWA NRRRSRARDR EFSSLALGYW NKGRGTGGEQ PSPSDNFDNR TTCTKSIETP GCVWVSELKL PLSLEMHTPQ RSVNLQNSFE MNHRGIDAKT QVSLGSLGVY ESGSETSTGV VVVGELAHPK IRRDREFSSP LESEGSEIDE EFLSEQDVDE ISSLATTSDD YDCDSNYTLY RACDAVPEKN QNDSYTPGDK AKQTPPPHGV SRQRRPSAIP TVGIATSPIV LVNTTLHAAA TEYPDQGPTP AFQYAEEGND SDSQADFSFL DYSLKQAAWL DFQPGAAPEE GLP
|
| |