Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45342 |
Symbol | |
ID | 7200033 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 897792 |
End bp | 902600 |
Gene Length | 4809 bp |
Protein Length | 1366 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179532 |
Protein GI | 219117475 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTTC CCTTAATTTC ATGGGCGAAG TACAACGCGT GGACAAAACC ACAATCAAAC GCCAGTTAAA TGTTTGTAAA CACTGAGTTC GCAGTCAGCC CTGGTGTTTT GCATCCGAAC TTATCGAGAC GAAGAAACAG TCTCACGACT TGCAAACAAC TAGAGTTCTG GTTGCCAAAT ATAGCGCTCT TGGAAGGACT CGAAATCCTA CCGCTTTCCG GCATGCCGTT CTTACTGGTG CTAAGAAAGA CTTGGATCGC ATTTCGCCAT CTCGTACAGT CCAAACGTCG TCGACGCCAG CGCTCGTTTT TCTTTTCGGT GGCAATCCTG GGATCACTGG TCCTGTTGCA GACGAAACAT ATGCACACAG TTTCGACAAC ACCGGAAACA GTTTATTCCC TCAATCATGG ACAGGCGATT GAAGATTCCG CGCCGTCCTT GACGTGGGAA TTTCACGGCT ATTCACAGTC AAACATACCA GCACGATCCA CCAGCCCCTC AACTCCACGT CTATTGATTG CCCAGTACGC CTCCGCTTTC TACACGGTAG TATTGAACGA GACACAACGA GTCAACCAGG CATATGCGGA GCGATTCAAT CACGATTTTA TCGTTTGTCG AGGCATTTAC CTGACCGACA GTCCTTGGTG GAGACTCGTC TCACCACCCC TGCACACTAT TGCCGGTTCG CGCTCGACCT ACAACAAAAT TGCCGTCTTG GCCTACGCGA TGCAACATGA TTACGATCGT GTGCTGATAC TGGATTCCGA CGCCATGGTT CGCGATTTTA GCATCAATTT GGCGACGTAC TCCCTGACAG ACGACAAGGG AAAGGACGTT GTGGTGGTAG CCCAGCAAGC CAAAATGGGT GAAGTTCACC CTCCAAATAC TTGGAATGTC AATATTGGTG TCACACTCTG GAATCTCCGT CACGAACAAG TCGTTACAGT GTGGCAGCAA TGGCACGATC GCTCCATTGC CCGAATACGA AGTGGCCAAG CGGACGACGA TCAACAACCG TTGCAGCGCT GTTTTCGCGC TTTTCCCGAC ACTACACGTC CCGTCCTTGC CGTGAAGGAG TTCGGCTATG GTGGCGGATC CATCGTACAA CACTTCATTC GCGAAAGTTC GTCGTCCTGG AGTGAACCGA CGGAGGCTCG AACGGAAGGA ATTCGGACCG CAGCCCGAAA AATAGTACCA AAGTAAAAGC TGACAATTGA TATTGAACAA AATGAAATGC CATGTAGTGC TAGGTAGACT AAATCAGCGA CTGTGTAGCT TTTCCCAACA CTGTGTTTGA TCTAATATCC AAAGGGGACT GACAAGAATG ACAGAAACTT CACTGTGAGT GCCATACATT TCGAACGCCG CCTTTCACAT CAATTCGTAA AAATTGTGAA GTTGTGTATG TGGTGGTTGT GCTTTTCTAT AGGGATTTCC GGCGATTGGG TCTTCCTTGA AATCCGAACC CGCAAAGGTT GAAAGTTCAA ACAAAACCAT CTACCGTTCG GATTGTCGTG AACCTCGGCC GTTGGAACGG ATTCGCTCTC GTCTGTGAGA TTCTTTTGCC TTGCGGCAAA GAACAAATCC CTGACTATTT TCTAAAACAG TCATTTGCTC CACCAAATGA TGCCTATTGC CCAAGCGATA CTGTAACACC GGAGATAGTC CCTGTTGCAG ATGAGCCTCG GGACGAGTCC TCGTCCAGTG GGAAACGCAG CTGATCTATG CGTACGTGCG CATGTTGCTC ATACCGTCCC AGAAACGCAT CCATATTGTC AGAACGCCCA GAAACAGGAT GCGCTAGGCA GTCATGCCAT TTGGAAAGCC AATGAGACAA AGAAAGATTC TTACACTGTG GGAAGCCGGC GTACTGCTCC AATGATTCCT ACGACAACCT CTCCCGTATT GATCTATACG GGCGTTGTGC TTCGCACCCT CTTGCTGGAT GTTCCCCTAC TATTAGCCTT GATTCTATAT TCTACGACAT CTTGGTTAGA ATATGTAAAG ACCAACTACA TGCTACCGCA GCTCGAACTC CAGCGCTGGA CCCCGGAACG GGCCGAACAA GAAGTGACCT ATTTTCATCG TCGGTGTGAC GAATCCGATC AGTCCGCGCA CGATACGGAA CCTCTCGTCA TAGACTACAG CAGCATGTCC AAGCGAGACA AAATGGAACA CATGCTGACG CACGGTGTTT CAGTCTATCC TAACCTTTTG TCGGCGGAAA CGGCCAACGA AGTGCGCGAT TTCATTCTCG CGCAGAATCT TAAAAACGAA GACATGATTG ACGTGATTGA AAACACCAAT CGCTGGAGCT TTGGTGTAAG AGTCGACCAA CATCCCAGTG TATCCAAGGC TCTGAAGGAA GTTCTCAATA AGCCCGAGCT GGTTGAAGGT CTCGAAGCCA TTCTTGGTAA GAATCCCGCC ATTATTGAGT TTACCGGGAT TACCAGTGCC TATGGCGCTA CCGCACAACG CTGGCACCAG GACGTGGTAC CCGAAGGAAG CGCCGCCAAA TACAGTCGTA GTTTTGTCCC GTCCTACAGT CTCTTTATCC CACTGCAAAA CACTACCAAG GCGATTGGTG CCACCGATAT TTGCCCGGGT ACACACATGT GCGCTGCCGG ACCGATCCAC TTTTGTGAGT ACTCGGGATT TCCCGTATCC GGGGCGGCCG ATAACTGGCC ATTGGGATGG GGAGCACTGG TCAATCAACA AACGACCCAC CGTGGAGCCC CCCATGTCGA CCCGCACGGG CCCAGCCGCG TTTTGTTTAT TCTCACCTTT GCACCACGGC CCCAGTTCAC ACCGTCCAAG CTGGAAACGC GAATGATTTC GACCGGCGGT TCGTACTCCT TACACTGGTC ACAATGGGGA CACACGTTGA GGGATTTCCA GGACCCGGAC ACTCGCATGA AACATCCCTG GCGAGCTCTC CGTGCGCTGG GTTTGTACAA ACCGCGAGAT GCACAGTGGG GATGGGACTA TTTATCCCAA GCTTCGGGAC GCGTGGCCAA CGATGAAGAG GGTTTTCATC GTGAGAGTCT GGACGGGCAG CTATCCAAGG GCGGACTCAC GTTTCTGCCG GACTGGCTGC AGGGGCACGC GTCAACTGAC GAGGAAACCA GTTATGCCTG GGTGGAATTC ATGGAAGATA CGCTACGCTT ATGCAGTCAC ACCACTCAGA AATTCTATCT TGCCGTTGTA TTCGGGTACG GTTCATTTGT AGTTATCTGG AACGGATTTT TGTTCGCCGC GGGACGTAGA CATTTTCGAG TGAAGGCTAT TGGTCGAAGC ATGCTGCGGG TCCTATTGCT ACACGCTGTA ATATTGTCGA TCGAGGAGTT TGCGCGACGG CGACTTGCTG TTACGGATTG GGCCAAAAGC ATTCGTGGCA GTCGTCTCTA TCGGTTGCCC AGTCCAGATC AAAATCTTCC TCTTCCAGGG ACGCTTCTTC TCTTGGAAGA TGTTTTGATC CTTGACCAAT TTCAGTCGGA ACACTTGGGG TCGTATGATC GAATACTGGA CTTCGCGCAT CCTGGAAATC GACGGTTCAA CACCATGATC TTGCAGCACT CCAAAGGATA TACCGCTCTA CCATTTTCGC TTAAACGGAG TTTGCGTGCG GATGTTCTGC TATGGAACAA GCAAGACGGA AGCCGTATTC TAGCAAAGAA CGTAGATGGA GCTTGGGCCG AGGTTGCCCA AGAAACTGCC GAAAAAGCAT GTCATAAGAA ATTGACAAGA GCATCAAATT CCGGTGTTGA ACATGTATCC CGACAGCTGG ACTACTTGAA GGCGGAAAAT ATATATGGCT TTTGGCGGCA CACATCTATG TACCTACGTC ACAATCCTGT TCTGCTTGAC CGCCTCGAAC GAAAGCTGCT AGGATGGAAC GAGACTAGCA AGAATTCGTC AGCCGGTCTA TCGTCCTCAA ACGGCTCTCT CCTCGTACGA CCATTCTTTC GCGGACATTC GATTCCATTA CTCACTCAAA AATCTCTTCG ATCAGTTCGG CAAGTTCTTC CTCCCAGGCC ATCTACTTCT GAGCCGTACG CTGGAGCGTG GATGCAAGAG GGGGATGTCG TGGAAGGTCG CTATCATGGT AATTTTCCAG GTACGTCGTA ATTGTGAGAA GGAAGCACAC GTCCAGACTG GTAAGTAAAA CTCACAGCCA GCATTTTCAT TGTTCTCCTA GAATGGTATC GTGGACGCAT AGTGTCCACG AGTGCTGACA AAGATGTATG GGACGTTGAA TACGACGATG GCGACGAAGA TGTTGGACTC TGTCGTAACT GCGTACGACC GTTTGTTCCG TACGCCCTGA ACGACGACGT AGAGTGGAGA GACGAAGAAG ACATATTTCA TCGTGCTCGT GTAGTCAAAA TTCAGTCGGG TGATGTGTAC GATCTCAAGT TTGAAGACGG CAGCATACGT AGTAACGCGT CAGCTACCGA TCTACGCCGC GTCCCGTTGT TGGGGGAAAT AGAGGTAGGA TCTCGCGTTG AATTTCTGGT TGATGAAGGC TACAATACGG GCACAATATT GCATGTGAAT GTGGATGGGT CCTACAACAT TGAGTTTGAC GACGGCGACT TCGCCACCAA CGTTGCACCA AAACACGTAA TTCCCGAATA GGAGAGGAGA GATACCCGAG CACAAAAGGT AATGAACTAC GAAAGCTGGA AAGCAGTAAA GAATTGAAAG CAAAGTTAAC ACGAAACGAC GTGAGACGCG TTGCGATAGT CTATCAAGTT TTCAAGGCTA AATACTGTTA ATTTCAAAAA AGTATCGTTC CGACACTTC
|
Protein sequence | MSFPLISWAK YNAWTKPQSN AISPGVLHPN LSRRRNSLTT CKQLEFWLPN IALLEGLEIL PLSGMPFLLV LRKTWIAFRH LVQSKRRRRQ RSFFFSVAIL GSLVLLQTKH MHTVSTTPET VYSLNHGQAI EDSAPSLTWE FHGYSQSNIP ARSTSPSTPR LLIAQYASAF YTVVLNETQR VNQAYAERFN HDFIVCRGIY LTDSPWWRLV SPPLHTIAGS RSTYNKIAVL AYAMQHDYDR VLILDSDAMV RDFSINLATY SLTDDKGKDV VVVAQQAKMG EVHPPNTWNV NIGVTLWNLR HEQVVTVWQQ WHDRSIARIR SGQADDDQQP LQRCFRAFPD TTRPVLAVKE FGYGGGSIVQ HFIRESSSSW SEPTEARTEG IRTAARKIGF PAIGSSLKSE PAKSLLQMSL GTSPRPVGNA ADLCVRAHVA HTVPETHPYC QNAQKQDALG SHAIWKANET KKDSYTVGSR RTAPMIPTTT SPVLIYTGVV LRTLLLDVPL LLALILYSTT SWLEYVKTNY MLPQLELQRW TPERAEQEVT YFHRRCDESD QSAHDTEPLV IDYSSMSKRD KMEHMLTHGV SVYPNLLSAE TANEVRDFIL AQNLKNEDMI DVIENTNRWS FGVRVDQHPS VSKALKEVLN KPELVEGLEA ILGKNPAIIE FTGITSAYGA TAQRWHQDVV PEGSAAKYSR SFVPSYSLFI PLQNTTKAIG ATDICPGTHM CAAGPIHFCE YSGFPVSGAA DNWPLGWGAL VNQQTTHRGA PHVDPHGPSR VLFILTFAPR PQFTPSKLET RMISTGGSYS LHWSQWGHTL RDFQDPDTRM KHPWRALRAL GLYKPRDAQW GWDYLSQASG RVANDEEGFH RESLDGQLSK GGLTFLPDWL QGHASTDEET SYAWVEFMED TLRLCSHTTQ KFYLAVVFGY GSFVVIWNGF LFAAGRRHFR VKAIGRSMLR VLLLHAVILS IEEFARRRLA VTDWAKSIRG SRLYRLPSPD QNLPLPGTLL LLEDVLILDQ FQSEHLGSYD RILDFAHPGN RRFNTMILQH SKGYTALPFS LKRSLRADVL LWNKQDGSRI LAKNVDGAWA EVAQETAEKA CHKKLTRASN SGVEHVSRQL DYLKAENIYG FWRHTSMYLR HNPVLLDRLE RKLLGWNETS KNSSAGLSSS NGSLLVRPFF RGHSIPLLTQ KSLRSVRQVL PPRPSTSEPY AGAWMQEGDV VEGRYHGNFP EWYRGRIVST SADKDVWDVE YDDGDEDVGL CRNCVRPFVP YALNDDVEWR DEEDIFHRAR VVKIQSGDVY DLKFEDGSIR SNASATDLRR VPLLGEIEVG SRVEFLVDEG YNTGTILHVN VDGSYNIEFD DGDFATNVAP KHVIPE
|
| |