Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43746 |
Symbol | |
ID | 7197031 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1358780 |
End bp | 1362168 |
Gene Length | 3389 bp |
Protein Length | 954 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178125 |
Protein GI | 219112747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.213644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCTTACA TCGCTCTGGA AAAGCAGCTC TAGCGTCCTG AGCCGAAAGA ATTAGCGTAG CTCGGAATAC CTGTCCTTCT CTTTTGAACC GCTTCCATGC TTGCCCATCA AGAGGATCGA CCCCAACCGA CGCGGAATGC CTCGCAAAAC GTTTCGTCGG AGCGATTACG ACTGACGGAA GAGGAGGTCG ATGCCCTGAT TGAAAATTTC GCCAAGGAGG ACGACTTGAC ACGTAAGACA TGGACACGCC GCGTCGTGGA AAATTTCCTC ATGAAGGTAC GAGTAGAGCA TTCGAATAAA ATAAATCGGA TCGACAGACT TTGTTGTTCT TGGAAATGAT TTTCGGCTCA CTGCTTTGTG TTCTGTTTGA TTCTCACAGT ACAAGTGGTA CTTCCCCCGC CGCGATATCA AGGGTGCTCC TTCGTTGAGT ATGGCGTATG CGTACTACGA GCATATTACA CTCCCCCGGC ATTTTGCCGG CGGGGAACAA ACGGCAGAGC ATGTTCTGCG ACGGGCCGAA CCTGGGGAAT CGCAAAGTAC GGATCTATAC AATCCACTCA AGACACCGTC GTCTTCCTTT ATTGAGTACG TAGTGTTCGA CGCATGTATG GATCGGCGCC TGGCTTTATA CATCTGGCAT TCCCTGACGT GAGATCTTAC TCCTTGTAAT TTCTTTTAGA TGGGGCATTG GTGTGGATCT ATACTTTTCC AGTGTGCGAA TTATGTCGAT GATTCTCTTG CTGGCGGGTC TGCTCAATAT CTACAGTATT TACTACTACG GTTCTACGGA ATACTCGCCG AACGGAAAGA ATTCATTGTC AACCTTTTCG CTAGTCGGTA CCGCCATTTG CACCACCGGC GACTGGGTTG TCTGTGCAGA AGGATGCACT CAGGAAGGTT ACTCCTCCGA AGGAGAAGAC GACCGTTTTG GTATTGCAGA CGACGGAACA GTTTTGGTCG TTCGCAATGG CTGCGACGAC GGGAGCTTCC TGCAAAATGG AATGGTCAAT TGGATTACTC TTTTGTTTTT GGGTATTCTA ATGGCTTTGG TGTCGCTTTA TCTTAAAGCT CGTGAAGTTC GATTCGACGA GGACAAGTGA GTGAAATCTT CATTTTGTGA TTTGTTTGTC TGCATTTGAT TTTGGTTTTG CTAACTCTTT AATCTATAGA TTAACGTCGA CGGATTATTC CGTCATCGTC AAGAATCCTC CTCCTGATGC ATACGATCCG GATGAGTGGC GCGATTTCTT TGCACAATTT GCGGAAAAGC AGGTAACAGT GGTTACCGTA GGGTTAAACA ACGAGTCGCT TTTGAACCTA CTGCTGGCGC GTCGTGTTCA CCGGCATAAT TTGCGGCTTA TGCTGCCGAA AGGAACCGAC ATGGATGATG AGGACCAGGT CCGTTCAGCA GTAGCCCGAC TGATTCAAGA CCGAGAAGCC GAGCCTGATG GTTGCATCAT GCGCTTCTTG GGATGTGCGG TATTTCCTTT TCTGCGAATC TTCAACATGT TCTTGCCGCC CGAAGTTTTG GTGGATCGAG TCTTCCGTTT GACCGACCAA ATCAAGGAGC TTCAGGGGGA GAAATACACG GTGTCGAACG TCTTTGTCAC GTTCGAAACC GAGGAAGGAC AGCGCGCGGC TCTTGCAGCT CTCTCTGTGG GAAAGCTTGA CGCAATCCGG AACAACACGG CCAACTCTGC CCCTAGTGCC ATTTTCCGTG ACCGTGTTTT GAAGGTTGAA GAGCCAACCG AACCAAGTGC TGTTCGCTGG ATGGATCTCA GCGCCTCGAC GTTGCGAAAG ATCATTCTCC GCATATTGAA TTTGCTCATT ACACTGGGTG TCGTTTCTTT TTCTGGATAC TTGGTAGCAA AAGTTCGTGA AAACTTGGGG CCTGGATACT CTGGCCCTTT GGTTTCCGTA TTCAATTCGA TTATTCCACA GATCGTCAAG CTCTTAATGA TCTTCGAACC CCATACCACC GAAGGGAGCT TCCAAACCTC TCTGTACTTG AAGATTACTC TCTTCCGTTG GGTGAATACG GCCGTCCTTA CGAAATTAAT CACTCCTTTT ACAAGCACCG TTAGTCCTGA AAGGACCAGC GTCTTGCCGA CAATCAATTC CATTCTGTGG TCCGAGCTAT GGCTAGTGCC TGGTCTACGT TTATTGGACC TTTGGGGTAA CATCCAGAAG CATGTGCTTG CTCCACGTGC TCGAAACCAA GAACTTATGA ATTTGAATTT CCAAGGCACG TTCTATAACC TCGGTGAGCG GTACACAGAT TTGACGAAAG TGCTCTTTCT TTGCTTTTTC TATTCAGCGT TATTTCCGTC GACCTTCTTT TTCGGTGCGG CAATTCTCTT TGTTCAATAC TATGTAAGTT GTGTACTGCT CAAATTACAG CCATTCTTGC CTTCGACTTC TCACGGATAC TTCAACCATT GCTATAGACC GACAAATACT GCTTGATGCG CATCTGGGCA TGGCGACCAT TCATAGGGCC AGAGCTGGCA CGCTTCAGTC GAAGGTATTT TTTTTCTGGG AGCGTCTTGG CCTTTGCTTT AGTAAGCGCG TACACCTGGG CTCAGTTCCC ATACGACAAT GTTTGCGATC CAGATACACC CATCTTTACC AACGCAGCAA GAGAATACTT CAATGTACAA TTCGCAAACT CCTCTACTGC CGACGTTGTC ACCGTTTCAC AGGACACTCC AGTTGTAGCG TGCAGCCAGA GTTGGCGAGA AGTCAGTGGT TTTTCCTTTC CTCCAACAAA GCGCATCCAG CCGGTTGGAT TAAGTTGGAT GAGCGACTCA CAAGAGACGC TGACGAGCGT CTACGGCTGG ACAGCCGTTG CACTTCTTGT CGGCTTTCTT GTTTTCTTTT TTGGTTCTTC GACCATTAAC TTCTTGTTGT CCTGGTTTCG TGGCATATAT CATACCAGTG GTCAGAATCA GCGAATCGAT TTCAGCACTA ACCTGGAAAT TTTTGCTTAT GTCCCGCAAG TCAAGCTGAA ATCTTTACCT TTTCCTCTCT TAGCTTGTAA CGTCGACAAT ATTGACAAGG GTCTAATTGG TTGGAACGAC CCCGCACATT CATATGATGT TCACAACATG ATTTTCGACG TTCCCTGGCA AGGAATGCCA AGGCAAAAAG CTGTTGAGAA TGAAGCGAGT ACAAGGGGGA GCGTTCTTGG AGCCGAACAA GAAGAGGTTG GAGGAAATCA AAATTCACCG CAAGCCTTGC CCAATGCGCT CGAAGTGAGA ACAGGGCAGC CTCCAATCTT TGCGGTTGTG AAACACTATC CGCCCGAATG GAGACAGCGC GAGCTAAAGC TGTCCTAGCG CTTCGCTTTT GCTGTAATCG CTTCTAGTTT AAACACACTT GTACTCACC
|
Protein sequence | MLAHQEDRPQ PTRNASQNVS SERLRLTEEE VDALIENFAK EDDLTRKTWT RRVVENFLMK YKWYFPRRDI KGAPSLSMAY AYYEHITLPR HFAGGEQTAE HVLRRAEPGE SQSTDLYNPL KTPSSSFIEW GIGVDLYFSS VRIMSMILLL AGLLNIYSIY YYGSTEYSPN GKNSLSTFSL VGTAICTTGD WVVCAEGCTQ EGYSSEGEDD RFGIADDGTV LVVRNGCDDG SFLQNGMVNW ITLLFLGILM ALVSLYLKAR EVRFDEDKLT STDYSVIVKN PPPDAYDPDE WRDFFAQFAE KQVTVVTVGL NNESLLNLLL ARRVHRHNLR LMLPKGTDMD DEDQVRSAVA RLIQDREAEP DGCIMRFLGC AVFPFLRIFN MFLPPEVLVD RVFRLTDQIK ELQGEKYTVS NVFVTFETEE GQRAALAALS VGKLDAIRNN TANSAPSAIF RDRVLKVEEP TEPSAVRWMD LSASTLRKII LRILNLLITL GVVSFSGYLV AKVRENLGPG YSGPLVSVFN SIIPQIVKLL MIFEPHTTEG SFQTSLYLKI TLFRWVNTAV LTKLITPFTS TVSPERTSVL PTINSILWSE LWLVPGLRLL DLWGNIQKHV LAPRARNQEL MNLNFQGTFY NLGERYTDLT KRYFRRPSFS VRQFSLFNTI HSCLRLLTDT STIAIDRQIL LDAHLGMATI HRARAGTLQS KVFFFWERLG LCFNTPIFTN AAREYFNVQF ANSSTADVVT VSQDTPVVAC SQSWREVSGF SFPPTKRIQP VGLSWMSDSQ ETLTSVYGWT AVALLVGFLV FFFGSSTINF LLSWFRGIYH TSGQNQRIDF STNLEIFAYV PQVKLKSLPF PLLACNVDNI DKGLIGWNDP AHSYDVHNMI FDVPWQGMPR QKAVENEAST RGSVLGAEQE EVGGNQNSPQ ALPNALEVRT GQPPIFAVVK HYPPEWRQRE LKLS
|
| |