Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45828 |
Symbol | |
ID | 7200942 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 399883 |
End bp | 402741 |
Gene Length | 2859 bp |
Protein Length | 805 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180036 |
Protein GI | 219118531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00551122 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACGAACGA ACACTGGTCT TCCCGCGTCA CAGTCCCCTC GGTTACCTTC CGTGTGTATT ATATTCCACA GACGACAGTA CTACCATGGG AGGATCCGGA TTGGCAACTG GCGCTGGCAC ACGCGTCGAT GTGACGCTCA AGCTTCCGCA GCTGCAGAAC TTGTGCAAGC GCGATCCGGA AGGATACCGG GAGGACTACG ACGCCCAAAC GCGACGTTTG GAATCCGAGG TCGGCATTCT CGCGTTGCAG CCCAATCACG AACCATCTCC TCGGCTCGTG GAACTCATAC AGTTCGCCGC CGCCGTATCC AGTAGCTCGT ACAAAGGCCA TGAGTCGGAT CGGATCGCAA ACATGCTCAT GAACCTGTTG GTTGGCGGTG GAGAAGGAAA CACCCAGACT TCCTCCAACA CCGTTACGGT TGAGGAACGA ACCATCGTGC AGCAACAATC CGTCCAAAAC GTCAGTACCA TGCCGGCGGC GGCACTTCAA TTGCACCGGG AAGTCCGCAA GACCTGCGTA TCCGCCTTGA TTCTTATGCG CAACAAGGGC GTCGTGGAAC CTTTGCGTCT CCTCGAACTC TTTTTCCGTC TCATGGCCGT TCTTCCCGAC AAGGCCTTGC GCGAATTGCT CTATAAGCAC ATGGTCAATG ATATTCGCAA TCTCAACAAA AAGGGCAAAC GAGACGACAA GGTCAATCGC GCTGTCCAGT CCTTTCTGCA CCGAATCGTG GCCTTTCACG GTTCCGACGC GGGGAATGGC GGTGCCGAGG CCGCGACGGA CGTGGCAGCC AAACGCGCGA CCGACCTGAC CTGCGAACTC TACCGACGAC GCGTATGGAC AGATGACCGG GCCGTGGCCA TCCTGGCATC AGCCACACAG TCTGCCAATT CTACCGTCAT GTGCCGGGCC ATGCGATTCT TTCTGGGCAT TGAAGAGAAA ATGGCTCTAG ACGACAGACG ACAAGAAGAA GAGGATTGGG AATCTAAGAA TCATATCGAT TATCACTCCC ACTCGAAAAA GACCAAGGCC AATGCGCGCC ACGTGGCCAG ACAGGTTAAA AATCGCAAGA AGGTGCAGGT TAAGAAAGAA AATAGTGATT GGATGGAAGC AGATCCCAAA AAGGACAAGG GCGTGGAAGC TTCCAAAAAG CTCTATCCCG CCATTGAAAT GCTGCGCGAT CCACAGGGAC TTGTTGAAGC TACTTTCAAA CGGCTCCGCT CTGCCAGTTC CTGCAAGTAT GAAGTCAAAC TTTTGATGAT GAATTTTGTC ACTCGAATAG TCGGCAATCA TGAGCTGATG CTCCTTCCCT TGTATCCTTT CTTGCAAAAA TACATGGGAG GTAGTCAACG AGATGTAACG GCCGTATTAG CCTACTGTGT CCAAGCTTGT CACGAACAGG TACCTCCAGA TGAGGTCTAC GGTATCCTGA AAACCATTGC ACATAACTTC ATCACGGAAC GCTGCTCGGA AGAACAAATG GCCGTCGGAA TCAACGCTGC TCGAGCAATT TGTGCCCGCG TTCCTTCAGC CTTGTCGGTG GAGGAGGGAA ACGAAATTTT GACGGGCGGA ACTTCTATGG ACGTCGAAGC CTTTGCTAGA GACTTAGCGG CGTACTCGAA TCATCGTGAT CGCAGCGTTT CGATTGCTGG AAAGGCATGG ACCAACTTTA TTCGTGAGGT CCATCCTGGT CTATTGCAAG GCAAAGATCG TGGTCTTCTG GGTTCTGCTT TGCACAAAAA CGGTGCCAAG CCTCTTCGTT ATGGTGAGAA AGAAGTTTCG GCGGGTGTGG AAGGAGCCGA CCTGTTGCTC GAGTACGAAT CGAAAAAGGC AGCGTATCTT CGCAAGAAAC GAGATCAAGG CGATGAGGAC TATATATCGG ACGAGGATAG TGTGTCTGGA GAGTGGGAGG AGGTCGGAGA AAATTCGGAA GACGAAGAAC CTCCAACATT GCTCAACATG GATGGCGAAA GCCAGCATGA TTCTGAGCCG AGGAGGACGA CCTCAACGAT GATGGTGTTG TCGATGTTTC TAAATTGTCC AAGGAAGAGC GTAACAAGCT GAAGCAACAA GCTTCATCAA CCCGCATTTT CTCTGCTGCA GAGTTTGACA AAATGCGGAA GCTTGTGGAA AGAGAAAATC GTGCCAAACG CGATCCACGG GAGGCTGCTC GTCGCAAACG GGCTATCGCT CAGGGACGAG AGTTCGAAGA AATTTCCGAC GATTCCGATT CCGACGAAGA GATTCATGTG GCCGGTGCGG TCAACCCAAG AGATCTAATG GCGGCAGCAA AGCGAAAACG ACAGAGCAAG GCTGAGAAAT TGCAAAAAAT TCTCGCTGGC CGAACAAAGT TCGAATCGAA GCAGCGAGAG GGAGGTGCAA CCAACGAAGA AAAGAAGCGC AAAAAGAACT TTCTCATGTC CAAAAGCAGT CGCGACGCTC GTGGGAAGAC TCGAGTGAAG GGTGGGCTTG GGAAGAAGCA AGTGGAAAAG CAGACCGGGC ATGTTGCCAA GAAGAGAAGA AGACGGCTTT GAGCTTACGA ATAGCGTGCT TGTCACTGTT AGTCTTTGCG CTTTTGTGTT CCGACAATAG TTTGCTGCGA CCAGTAGGCA TCGCCATACG CAACGAAAAG TTCTTCCCCA GGAAATATTT CTCGAGTTGT AACAATAGCA GCGCGGAATG CCTCGGGAAC ATATTCACAG TTAATAATAA ATCGTCATTC AGCGGATCGT TGATATAACG TGCCTTTATG TGCATTAAGG TTGCAGGATC GACAAGAACA TCTCCATTGA GGCTCATCAA ATAGCTTTTG TCCCGTAATA TCTTTGAGGA TCGAAAGGTA TGTATGTGCC CAGAGTAAT
|
Protein sequence | MGGSGLATGA GTRVDVTLKL PQLQNLCKRD PEGYREDYDA QTRRLESEVG ILALQPNHEP SPRLVELIQF AAAVSSSSYK GHESDRIANM LMNLLVGGGE GNTQTSSNTV TVEERTIVQQ QSVQNVSTMP AAALQLHREV RKTCVSALIL MRNKGVVEPL RLLELFFRLM AVLPDKALRE LLYKHMVNDI RNLNKKGKRD DKVNRAVQSF LHRIVAFHGS DAGNGGAEAA TDVAAKRATD LTCELYRRRV WTDDRAVAIL ASATQSANST VMCRAMRFFL GIEEKMALDD RRQEEEDWES KNHIDYHSHS KKTKANARHV ARQVKNRKKV QVKKENSDWM EADPKKDKGV EASKKLYPAI EMLRDPQGLV EATFKRLRSA SSCKYEVKLL MMNFVTRIVG NHELMLLPLY PFLQKYMGGS QRDVTAVLAY CVQACHEQVP PDEVYGILKT IAHNFITERC SEEQMAVGIN AARAICARVP SALSVEEGNE ILTGGTSMDV EAFARDLAAY SNHRDRSVSI AGKAWTNFIR EVHPGLLQGK DRGLLGSALH KNGAKPLRYG EKEVSAGVEG ADLLLEYESK KAAYLRKKRD QGDEDYISDE DSVSGEWEEV GENSEDEEPP TLLNMDGESQ HDSEPRRTTS TMMEERNKLK QQASSTRIFS AAEFDKMRKL VERENRAKRD PREAARRKRA IAQGREFEEI SDDSDSDEEI HVAGAVNPRD LMAAAKRKRQ SKAEKLQKIL AGRTKFESKQ REGGATNEEK KRKKNFLMSK SSRDARGKTR VKGGLGKKQV EKQTGHVAKK RRRRL
|
| |