Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45989 |
Symbol | |
ID | 7201054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 878783 |
End bp | 881883 |
Gene Length | 3101 bp |
Protein Length | 1008 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180133 |
Protein GI | 219118732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.400632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTATT TGAACAATCG TGTCGGCAAC GGCGGTCCCA GCGGCAGCAA AGCGGTTCGC CCCCAGAACT ACGTACCCGG ACTCGGGCGC GGGGCTGCCG GATTTACAAC GCGCTCCGAT GTCGGGCCCG CAGCGAACGT TGCCCTTACG GCCGAATCCA CGGGAGGCAG TCGGGCAGCG GATGCACGCG CTGCCAAACT GCAAGCCCAA AAGCAGAAGG GTCTCTTTGG AGATGCGCCT CAGAATTACG TACCGGGAGC GGGACGGGGT GCGGGCAGCA TGGGTGCAGC GGGGACCGGG GGGCCTGCCA CGGCGACAGT GGGTTCCTAC GACGCCTTTG GCGGCTATCA GGAACGCCCA GTGAACGAAG TTCCGGGACA GTACGACGAA GATGATGACG AAGCGGATCG TATTTGGGCT GCCATTGACG AACGTGTGCA GAGCAGGAAA CGAAAGTCCC AACAGCGAAA AGAGGAAACC GTAGCAACGT CCGCCGAGAC GGACAATGCT CGTGTACGGA TAGGATCGCA ATTCAGAGAG CTCAAAGAGA AACTGAAAGA TGTTTCGGAA GACCAGTGGG CAGCTATTCC CGACGTGGGG GATCACAGCA TCAAGTACAA GAGACAAAGG CAGCAACAAA ACGAAATGTT TACTCCGCTC TCCGATACCC TACTAGAGCA GAGGAATCAA GCCAATCTTG ACGCAACTGC GGGCAACACA GCCTTGGCCG GGACTACTAC AGCCGCTGAT GGCATCCACA CCACGGGAGT AACTACGACC ATGGCCAACA TGTCGGGATT GAGTGCCGCG CGGGGTACCG TCTTGGGCAT GAGTCTCGAT AAAATGTCTG ACTCCGTATC CGGTCAAACC AACGTGGATC CTCAAGGCTA CCTGACGTCG CTCGGCAGTA GCACTACAGC CCTCAACAAC GCTTCACAAG TGGCTGATAT TCATAAAGCT CGACTGCTGC TGAAATCGGT TCGTGATACC AATCCACAGC ATGGACCTGG CTGGATTGCC TCTGCCCGCG TAGAAGAAAC GGCAGGAAAG CTTCTGCAAG CCCGTAAGAT TATTCAGGAA GGAACCCGTG TTTGCCCCGA CAATGAGGAC GTTTGGCTTG AAGCCGCCCG TCTGCATCCA ATTCCCGTAG CCAAATCCAT TTTGGCGACA GCCGTCCGGA GAATACCTAC ATCGATACAA ATCTTTTTGA AAGCTGCTTC CTTGGAAACG GCCGACTCCG CCAAGAAAGC TGTCCTGCGC AAGGCTCTCG AAGCCAACCC AACCTCCACA CTGCTCTGGA AGGCTGCGAT TGACTTGGAG GAGGCCGACG ATGCCCGAGT ACTATTAGCG GTCGCTGTCG AAAAGGTTCC GCAAGACGTC GATTTATGGC TTGCCTTGGC ACGCCTCGAA ACATATCAGA GCGCTCAAAA GGTGCTGAAC AAAGCCCGCA AAGCCTTACC ATCCGATCGC TCGGTCTGGC TGGCAGCGGC CAAGCTGGAA GAATCGCAAG ATCACGTTGA TACGGTATCC AAGATTGTCG ATCGAGCCGT ACGGTCGCTT CGGAAACAAG ACGCCGTTAT ATCGCGAGAA CAATGGTTAG AGGAAGCCGA GAAGGCGGAA TCGGCCGACG CACCGATCAC AAGTGCGGCA ATCATACACC ATACGATTGG TCAAGACGTG GAAGAAGAGG ATTGTCTGCG TACCTGGTCG GAAGACGCCA AAGCTTGTGT TGCCCGGGGT TCCGTCGTGA CGGCACGGTC TATTCTCGCG CACGCGTTGC GAGTGTTTCC GAGCAAGCGT GTTTTGTGGA TGCAAGCGGT AGAATTGGAA CGCCAGCATG GGACAGCGGT AACACTAGAA GAGCGTTTAC GGGATGCCAC ACATGCTTTA CCGCGGGTGG AGATTTTCTG GTTATTGCGC GCGAAGGAGC AATGGATGGC GGGCAAGGTC GACGAGGCAC GTCAGATCCT GACGGACGCC TTTGCGGCCA ACCCGGATTC TGAGTCGGTC TGGTTGGCGG CGGCCAAGCT GGAGTGGGAA AACGACGAAC TGGAGCGAGC GCGAGTCCTG TTCGCTCGGG CGCGTGAACG AGCACCGACG GCCCGCGTAT ACATGAAATC GGCGATTCTG GAACGGGAGC AAAAGTGTTT CGGGGATGCG CTGAAGCTGG TAGAAGAAGG AATCGAAAAG TATCCCAAAT TTGCCAAGCT GTACATGATC GGGGGACAGA TTTACGCGGA CGACATGCCG AAGCACAAGG GAAGCTTGGA TCGAGCGCGC AAGTTTTATC AGCGAGGACT GGAAGCTTGT TTGGAGAACG TGACGCTCTG GAAGTTGGCG AGTCGGTTAG AAGAATCGGC GTGGCGGTTC GACGCAAAGG ATGCGGCTGG GGAATCCGAC AAGGCTGTGA GCAACGGGAA CGTTGTAGCC AAACCTGGAG CTGCGGGTGC TACCAAAGCG CGCAGTCTTT TGGAATTGGC CCGTCTGAAG AATCCCAAAA ACGCGGAATT GTGGTTAGAA GCCGTCCGGT TAGAGCGTCG GAACGGGAGT CTCCGCATTT CCGAAAGTTT GCTGGCGAAA GCGTTGCAGG AATGTCCGAC TTCGGGAATG CTGTTGGCCG AAACGATTTG GACAGCGCCG CGCGCGACTC AAAAGTCGAA ATCGGCAGAC GCCATTCAAC TGTGTCCGGA CGACCCGCAG GTAATTGTGG CCGTGGCGAG CCTATTTGCG TCGGAACGCA AGCACGAAAA GGCGCGGAAG TGGTTCGATC GCGCTGTAAC ACTGAATCCG GATCTCGGTG ACTCGTGGGT CCGTTACTAC GTGTTTGAAC TGCAATGGGG GACTGTGGAG CAGCAAGGGG CCGTGAAAGA ACGATGTATT GCGGCGGAAC CCAAACACGG CGAGTTGTGG GCATCTACGA GAAAGGAGGT AACCCGACGA CACGAGTCGA TCGGAGAAGG TCTCGAGGTG GCCGCCCAGA AGCTTCGCAA CGCGCAGGAA AGCGAGAATC CCTCCGTTAT GCTCTAATGG AGTAGCAAAG CGTTTCCCTT AGTGTATGTC CCTGTGGCTA TTTTCCAGTT ACTAGTGTAT GAGGTACAGT T
|
Protein sequence | MSYLNNRVGN GGPSGSKAVR PQNYVPGLGR GAAGFTTRSD VGPAANVALT AESTGGSRAA DARAAKLQAQ KQKGLFGDAP QNYVPGAGRG AGSMGAAGTG GPATATVGSY DAFGGYQERP VNEVPGQYDE DDDEADRIWA AIDERVQSRK RKSQQRKEET VATSAETDNA RVRIGSQFRE LKEKLKDVSE DQWAAIPDVG DHSIKYKRQR QQQNEMFTPL SDTLLEQRNQ ANLDATAGNT ALAGTTTAAD GIHTTGVTTT MANMSGLSAA RGTVLGMSLD KMSDSVSGQT NVDPQGYLTS LGSSTTALNN ASQVADIHKA RLLLKSVRDT NPQHGPGWIA SARVEETAGK LLQARKIIQE GTRVCPDNED VWLEAARLHP IPVAKSILAT AVRRIPTSIQ IFLKAASLET ADSAKKAVLR KALEANPTST LLWKAAIDLE EADDARVLLA VAVEKVPQDV DLWLALARLE TYQSAQKVLN KARKALPSDR SVWLAAAKLE ESQDHVDTVS KIVDRAVRSL RKQDAVISRE QWLEEAEKAE SADAPITSAA IIHHTIGQDV EEEDCLRTWS EDAKACVARG SVVTARSILA HALRVFPSKR VLWMQAVELE RQHGTAVTLE ERLRDATHAL PRVEIFWLLR AKEQWMAGKV DEARQILTDA FAANPDSESV WLAAAKLEWE NDELERARVL FARARERAPT ARVYMKSAIL EREQKCFGDA LKLVEEGIEK YPKFAKLYMI GGQIYADDMP KHKGSLDRAR KFYQRGLEAC LENVTLWKLA SRLEESAWRF DAKDAAGESD KAVSNGNVVA KPGAAGATKA RSLLELARLK NPKNAELWLE AVRLERRNGS LRISESLLAK ALQECPTSGM LLAETIWTAP RATQKSKSAD AIQLCPDDPQ VIVAVASLFA SERKHEKARK WFDRAVTLNP DLGDSWVRYY VFELQWGTVE QQGAVKERCI AAEPKHGELW ASTRKEVTRR HESIGEGLEV AAQKLRNAQE SENPSVML
|
| |