Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49986 |
Symbol | |
ID | 7198695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 40445 |
End bp | 42497 |
Gene Length | 2053 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184815 |
Protein GI | 219129269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGACTCCAC GGTTCATTCT AATTCGTTTT TTAGGTCCTC GAAGAGTGTG GCTGCAAACT TTGATCATGA TCTTTCCCAA TACTCGGCAA CTCACGGAAC AGCAGAAAAA CATCCGGACG TACCAACCTC TTCATGTCGC GAAGGAGGAG CTTGCCTTAC TGAAGGACCA GGCCTTAATG GTGTCTCTTG CCAAAGAGCA TAGTTTTGCT GCTGCACGAT CTGTCTACGA AAAAGGAGCG CACTGTCAAC CTTTCGCTAC CTTGTCGCTG ACAAATTCGA CCAGAGTTCG CATTCAAGAA TTCACACACG TGTCCGGTAC GACCATGCAA GGTGATCAAG TCTCAGGATT TACCCGCAAC TCCTACAAAA TAGGGACAAC TAAGATACAA GTGCTGTATG ATGAAGACCA GTTGGTTCCT GCCACTTGCG AGTTTGGGGG AAATCCCGAG CCTGTTTTGG ACGGCTGTTT GGAAGGTACG GGTGAACTCG TTTTGGCTGA CGAGCAGGGA ACTACACTCC GTTACGTGTA CAATCCTAAC GCCGATAATG CCTACGGTCG GACTATTCAG GGCTTGAGTT TACACGCGGA CCGGCGCTTT CGAATGTCCC AACATGCCCC CGTTCCAGCT TCCCTGTCGC CGCCGATCGA ATACTACGGT CAAGCGTCGT ATGCGGATGA AATACTACAA ATTTTGCTCC TCTCATCCTC AACAGCTGGA GCTTCAAATT CGACCAATTT TCGCCGAGGT AACTTGGATT TTAGTTCGAT TGTCAGCAGT GACGCGCGAG CGGCCGCAAT TGCCTCAGCG ACTGGGCTGC TCACCTTGGG ACACTACCTT GTGCGTGATT TAGAAATGGC TGTATACCAA TGTCTGATAG AGGATTCCGA CCGCGCTCTG TTAGCATTGG ATCAGGCCGT AGTCCTCTAC GGAGCGAATC CTAATAGCCC TGAACACCTT ACGGCCCATT TGGCGGATCA GCAATGCCCA TCATTTCGTA CCTGTGCAGG AAAACGCGGG GTAGTCGGCC AATCCAGGGT CAATAGCGAT ATTCTTGAAA CGATTCTACC AGCGCTCCAA TCCGGTTTGG CTGCTCACCA GTGCAGCGTG GCGCGACGTG AAAAGGAACG TTTGACGGGA ATAATCAAGG TTCCGCTGGT CCAGGCTGTA CTGCAGTCCG CCTATCAAAT GAGCTACCAG CCCGACAGCG CACAGGTCCA CGCAGCACAA GCAGCAGTGT ATGTTGCTGC GCTGGTACCG TTTTTAGATG AGTGTCAACC GCTAGATGCT ACGATATTGT ACGACAGCTT GAAGGTCGGA CACGACACGT TCTCGTTTAG CCAAGTCAAG CAAGCACTAG AGAGAAACTA CGGGTGTTTG AAAGTATCAT GTTCACATGT TGGTGGATTG TGGAATAGCG AGAGCAAAAC ATATTTTGCG GGAGCTGAAC CCTGCATTGA TCCTGTAAAT GGCGAAGCCG AGGAAGACGG ACGCGCCCAA AAGAGCTGGT CCATAATGCT GGGACTCGTG ACCTTGTTTA TCTTGGGTTT CATCTGGCAA ATACACCGTC GACGTCGGCG GCAAAGCGAG CGACATACCC GACAGAGGAA ATCGTCTAAT CCAGAGTACT ACGACGACAG CGATTCCGAT TTTAGTGATA GCGCCGATGG TCGATTTGCT TGACCAAGAA CATACCTGTA AATTTATTGG TATGCTCAAA ATTATACGGA AGAATTTCAA ATGGATTTGG AGAGACAGAT ATCTACACGT ACATGGCATC TTTTGCATTC GCATGCATTG GAACTTGAGC CGGTGAGTCT TTTGTAAATC GTATCATAGG GCTATCATGC ACGTCTTGAC ATCAATGACA TTGATTGTTG ACTGTTACAG TAAACGAATC CTCTCTAGAA AGAGACAGAC TGCGAAGCTC TACGAGGGTC GCAAGAAAAT CTTTTAGTTA ATGTATGTGC TCACGTGATC CTCGTGTTGG TCACGAGACA AATTGCTTCA CTGTCCATAA GTTTTGAATA GTGTAACATA AGTATGTTGG TGT
|
Protein sequence | MIFPNTRQLT EQQKNIRTYQ PLHVAKEELA LLKDQALMVS LAKEHSFAAA RSVYEKGAHC QPFATLSLTN STRVRIQEFT HVSGTTMQGD QVSGFTRNSY KIGTTKIQVL YDEDQLVPAT CEFGGNPEPV LDGCLEGTGE LVLADEQGTT LRYVYNPNAD NAYGRTIQGL SLHADRRFRM SQHAPVPASL SPPIEYYGQA SYADEILQIL LLSSSTAGAS NSTNFRRGNL DFSSIVSSDA RAAAIASATG LLTLGHYLVR DLEMAVYQCL IEDSDRALLA LDQAVVLYGA NPNSPEHLTA HLADQQCPSF RTCAGKRGVV GQSRVNSDIL ETILPALQSG LAAHQCSVAR REKERLTGII KVPLVQAVLQ SAYQMSYQPD SAQVHAAQAA VYVAALVPFL DECQPLDATI LYDSLKVGHD TFSFSQVKQA LERNYGCLKV SCSHVGGLWN SESKTYFAGA EPCIDPVNGE AEEDGRAQKS WSIMLGLVTL FILGFIWQIH RRRRRQSERH TRQRKSSNPE YYDDSDSDFS DSADGRFA
|
| |