Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44894 |
Symbol | |
ID | 7199813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 568160 |
End bp | 573204 |
Gene Length | 5045 bp |
Protein Length | 1032 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178806 |
Protein GI | 219116022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00992786 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGGTCAT CGATCCATCT CGTACTCTGT GTAGAGACGT TGTGTAGTGT CGCACGAGTG CGTTCCGCTC TATACCCGAT CGACGCAGAG AACCTATTTC CCGTACCACG AGATATCCGG GACGTGTCCA TCCGTCGTGT TCTTGGCTCC ATCAGTCGAC CGTTTGCGCC AACGATTGGC ATCGTCTACT CTGCACAGTC CGCTTTATCG CAACAGCCGT ATTGACCGTG AGCCTCCATT TTCTCGGATT GGAAGGTCGG ACGCTTTGCT TTTTTATATT TATTCATCTG TTTGCATACC GACGTGTTGT GTACATACTC GACCGTGTTT TCGTTTTTGT TGGTGAAAGC CACGGCATTC TTTCTTCTAG TTTTCCCTCG TTCGTGCCTG TACGCGTATT TAAAAAAGCA ATGAAGGCAA TACACGTCGG TGTGCCACGC TACTGCCGCG CACTGTCGTT CGATAACGAC ACATCCACCG TCTCATCGCA CAACAACAAT AGTCACAGAA TACCGATATC GACAACCACC ACCACCACCG TCCGCACAAC AACAACATCA ACTCGCCTGT TGCGCAAAAT CACTGTCCGA TCGGTAATCG TCGCTTTCTC TTGTGCGGCA CTGTGGTTGC AACTCGACCC GCCGCTGTTT GTGGCTGCCT TTGCCCCCGG GACGTGTACT ACGGTGACGA CTACTGCCAC CGTCGCACGC GTATTGCCGT ACGCGCTGCG GACCGTGACG ACGAAACCGT ATCGACTGGC GCACTCGCAA CCCGTACCGC TACATCCCCA CCACCGTATC GACGGCGTAC CGTTGGCCGC CACTGCACGC GAAACGGGAT CCACCGCCAA GGACGACGAC ACGGCCGAAT GGAGGGCCGT ACTGGCCGCG TTGACCCTCT ACCAAGCCGC CTTTGGTGAC GTCAAGGTGC CGCAAAAATT TATTGTACCC ACCGCCTCGC CTTGGCCCAA ACCCGCCTGG GGGCTCAAAC TCGGCAAGGT GGTCGCCAAC ATTCGACTCA CCGGAAAGTA TATACACGGA CGGGACAAGC GCAAACAAGC GCTCGAAAAA TTAGGTTTTC TATGGGCGGC ACGGAGTACG CCGCAAACGC GGGCCGAGGC GGAAGACGAC CGCTCCAACG TGACGGTGGA ACAAATTTTG GCAGCCTTGG TAGCCTACCG TGAGAACGTG GCTCCGTTGG GACCCGTGCC CGCCAAGTAC ATCGTGCCGG ATGCCGAGCC TTGGCCAGAA CGGGTACGGG GGCTCCCTTT AGGATCGCAA CTAGCGCGAT TGCCAATGGA CCAATTGCCG GAAACAATCC AGGCCAAGCT GCAAGACTTG GGCGTCATGG AACGATATAT GCCGTTGGCA TCCTCCGGCT CAATGGGTGC TCCCCCGACT GCGCCGGTTG TCTCGGGAGA AGCGTACCAG TCCGCAAGTA ATGTACCACC AACTGCTAAC GACATTCGCT TCCATAAGGT CTACCTCGCC TTGCGAACAT ACAAGGATGT GTACGGAGAT TTACTTGTTC CGCAACCATT TGCGGTTCCA TCCAAAGCCC CATGGCCGCA AGAAACCTGG GGACTCCGCT TAGGCGCCCG GGTGAACGCG GTACGATCCC AGGGAACCTT TGTTAATGCG AATCCCGATC GGCGTCAGGC ACTGGATGAT TTGGGTTTTG TGTGGTCCCC GCCCAAGGAA GGCTCGCGGC GCGTCAAAAG CCGCGAAGCG AACAGCTTAC CCGATACTGC CATTCCTGCC AGCGCAACGA CGAAGAATTC GTTGGACTCC TTGCTCGACG ATTCGACCTT TGATTTTGGA CAGGACTTTA TGGATCAAGG AGGTGACGGC CGTGACGGCT TGGGGGGAGG ATCCGCGACA GCTCCAACCT GGGGTTTAGA GGGTGGCCGC TTGCTGCCTG TCGAAGAAGC CGCGGCGGCG GCTCAGGCAG CGGCTGAAGA AGAGTACGCT CCCCCACGGA CACTTCAGGA AAGTCTAGAC GAGGCGACCG TTCGAGCGAT GGAATGCGGC GTCATCGAAG GGTTGACGGA CCGAAAACGC GTCATGAAAG GCAAACGCGA AAAGGCGATT CCCTGGTTTA ATGACGATTT TGGAGACGAC TTTGTCTTTG AAGATGTCGT TGAGGCCTTG TCGTTGTACA AGCGTATGTA TACGGATTTT GACAACTTGA CTGCGAGCGA AGAATTTGTC GTTCCGACAC ACAACGTTCG TACTGGGTTT TTAGACGACG TTGACGACGA TTTGAATACG TTTGATGTAG ACGCGTCGGC TCGGGCAGCG GCTGCGATAA AGCAGTACGA AGAAGAAGGT ATGAAAGATC GGAGCGACGA TCTAATTGCG GCCGAAATCC AGCGAATTCA ACGGGAAATT GAAGGAAGCA ACGTCGCAAC CAAAACCAAA CCGTCCAAGG TTGTGCAAAG CACTACCGAG TGGCCCGAAC ATTTGGCAGG AATGAAGCTG GGAAATATTG TAGCTCGTAT CCGAGATGGA AGTTTAGAAG TAAAGCACTT ATCAGAACGA AAGGCACAAC TCGACAGTAT TGGCTTTGAA TGGGGCGATC CCAAGAGGTT TCTTGACGTT CCGTTTGAAA AAGCTATGTG CGCCATGTAC GCATATTACT TGGTACGCGG TGACATGTTT GTGTACGAAG ATTTTGTTAT GCCGGAGGAG GATCCCTGGC CACAAGCCTT GGCTGGGTAC GAGATCGGCA AAACAGTCAT GCGATTGCGG GAGCTTCAGG ACTTTTTAGA AGCATACCAC CCCGAAAAGG TCAGTCTGTT ACGCATGATC GATTTTGTGT GGTTCCCCAC AATGGCATTG CCTCTCGATC CAGATGAACC AGAATTGACC AACGAGACTC TGCTTTTAAG CGCTCTTGGG CATCCAGACT ACGCGAAAAT GATTGACATT CCTATGGGTC TTCCAGACAA GATTGTAGCT GATGGACCGT TCGTGGAAAG CGACAACCCA AAGCACTGGT GGCGTAAGTG GCACAATTGG GATTATGTCC AAAACTATTG GTACCAACAA GGCCGCCGAG ATAATGCTTT TGTACTTAAA GGAATGGGTT ACCCTCAAAT GGCGAAGGAG CACGAAGCCA AGTACGGTCC AGGGCTGTTT ACACAAATGG AGGAAACGAT GGCAGTTCTG GAATCTGGTA TTGAAGAAAA GTCATTAGAC GAAAAGAAGG AATTGCTTCA AACCCTACAC TTTTACCGTC AGGAGATGCT CGGATGTACT GATATCGCCG CCTGGGATCG AAATCAATGG CTAGCCGATC TGGACACAGA AATGCTTAAA ATTATGAAAG ACTCAAATCT TGAAATTGAA TTGGATGTTG ACGAGGACGA GGGGTACGAC GACGAGGCAA TAGATGGTGA AGAATATATT GAAGACGAGG ACTACGAGGA AGAGGAGCCC GAAGAAGAAG AAGAAGACAA AGAAGAGTTT GATGTAGAAG ACGAGCTTGG CTTGTCGGGC AAGCAATAGC ATCTTGAAGC TACAGCCGAC GCGAAGCAAG TGTGACAATA AAACTGATTT ATCATGTGTT GGCACAATTC AAGTAGTTCA GCTTACAAAG GCCATATTTT GCGAGACGAA ACCAATAGCT CCTAAAAGCT AAGGTCATCC GTCTCCTGCT GTGTCTTTCT ACCGCCAACC CTTTTGACGC ATGTATGTTT ATCTCTAGTC AACTTAGCGC CGCGGGCTTT GAGCGCTGGT TGCCCTTTGC GATAGACTGC CACATAGCTA CACCCAACTA TGAGGATAAC TAGAGCCACA ATGAGCACCC AAAAGCCAGC CGATAAGCTG AAGTCGTCAT CGGAATTCAT TGCTTCGCCA AGACGAGACA AGGCAATGGG AACGAAAAAA TCCCCCGAGG TTTCCAAATC ACGACCTCCA CGAAGGTTGC GGCCATCGAA TGCAATCACA ATCGTCCCTG ATGCCAAAAG AGAGTCCGGC CGGCCTGACA TAAAGAAAGT GGAGTCCACC GGTGTGCGAA CTTGGCAGGA CATTCCTGAG CAAAAGGCGC TTGTACCTGT AACACGAGTA CCGGCATCGT CGATAACCAC TGTATTGCTG CTACCTTGTC CCAAATTGAG GTTGCTGACA CCAAGAAGGC GCGCCGTACT ACTGGTGATG CACAAGCCCA GAACATCGCC CATTTCCAAA GGATCATTGA TACACTTGCT ATCTACATCG CAAGGACAGC CCTGAAGAAA TTCAGTTTCG AAATTACCCC GTTCCACATT GGGAGAGCGA GAAGCATCAA CGTTGGCAGC ACTCGGTTGT ACAGCCTCGG TCGGGCTTAT CGTAGGAGAC TCGCTGGATT GTTCCAGGTC AACTACACGG TCTCGCCAAA CACAAGGAGA GTATTTTCCG CTGACGAAGC CCAGCACTGC GTTGCATTCC TCCTTGCTTA TATCGGACTC ATTGCTATGA CGACAAACCC TCCGATCAAG AAAAGGGTCC ACGCTGTCGG AAACTGGAAG CCCGCAAAAA GGGGTGGTGT CCTGAAATAC CCTGACACTT CCCACGATAT CCCGATCATC GTTGCGTTTC CGCATCGAAC CCGACACAAT CACGGATCCT CCAGCATTCA TCGAAACATC ATTCCCCCAC CAATCAATAG TAGCATTGCC CAGTATAGTC TCAATCAGCC TATACGTGTT GATTTTTGTA TCCCATTGAT ACACCTGAAC CAAACCAGTA AAACGTTCGT GGTGTAGACT GGTAATTGCA ATTCGATTTC CATCGGCAGA CAGCGCAAGA GCCCGACCGA ACCGGTCGTT GTCGTCTTTC CCAAGTATTT CGTCCAGTTG AATCCATTGA CTCGTTTCGT CAAGATCGTA AACGTGGACG GAGCCACGAT TGGCACCGGC TATAGCTGAA GTTGGAGCGC CCACAATCAA AGTTGAGGAC CGAATGGCAA TCGAATATCC GAATTGACCA TCAACAATAG CAGAATTGCC TACTATGTCT TTGCCCACTG TAGAAAAAAG GTTGGGTGTT GAGGTGGAGA TCCGC
|
Protein sequence | MKAIHVGVPR YCRALSFDND TSTVSSHNNN SHRIPISTTT TTTVRTTTTS TRLLRKITVR SVIVAFSCAA LWLQLDPPLF VAAFAPGTCT TVTTTATVAR VLPYALRTVT TKPYRLAHSQ PVPLHPHHRI DGVPLAATAR ETGSTAKDDD TAEWRAVLAA LTLYQAAFGD VKVPQKFIVP TASPWPKPAW GLKLGKVVAN IRLTGKYIHG RDKRKQALEK LGFLWAARST PQTRAEAEDD RSNVTVEQIL AALVAYRENV APLGPVPAKY IVPDAEPWPE RVRGLPLGSQ LARLPMDQLP ETIQAKLQDL GVMERYMPLA SSGSMGAPPT APVVSGEAYQ SASNVPPTAN DIRFHKVYLA LRTYKDVYGD LLVPQPFAVP SKAPWPQETW GLRLGARVNA VRSQGTFVNA NPDRRQALDD LGFVWSPPKE GSRRVKSREA NSLPDTAIPA SATTKNSLDS LLDDSTFDFG QDFMDQGGDG RDGLGGGSAT APTWGLEGGR LLPVEEAAAA AQAAAEEEYA PPRTLQESLD EATVRAMECG VIEGLTDRKR VMKGKREKAI PWFNDDFGDD FVFEDVVEAL SLYKRMYTDF DNLTASEEFV VPTHNVRTGF LDDVDDDLNT FDVDASARAA AAIKQYEEEG MKDRSDDLIA AEIQRIQREI EGSNVATKTK PSKVVQSTTE WPEHLAGMKL GNIVARIRDG SLEVKHLSER KAQLDSIGFE WGDPKRFLDV PFEKAMCAMY AYYLVRGDMF VYEDFVMPEE DPWPQALAGY EIGKTVMRLR ELQDFLEAYH PEKVSLLRMI DFVWFPTMAL PLDPDEPELT NETLLLSALG HPDYAKMIDI PMGLPDKIVA DGPFVESDNP KHWWRKWHNW DYVQNYWYQQ GRRDNAFVLK GMGYPQMAKE HEAKYGPGLF TQMEETMAVL ESGIEEKSLD EKKELLQTLH FYRQEMLGCT DIAAWDRNQW LADLDTEMLK IMKDSNLEIE LDVDEDEGYD DEAIDGEEYI EDEDYEEEEP EEEEEDKEEF DVEDELGLSG KQ
|
| |