Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30585 |
Symbol | |
ID | 7198343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 175129 |
End bp | 178302 |
Gene Length | 3174 bp |
Protein Length | 942 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184578 |
Protein GI | 219128770 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTCGTCCGG GGTCGCCTGG GGCGAAGAAG ATGATCACAA CGAGGAGCTT GGACTACACG ATCAGGTGCA GACTAGCCAC CAAGCCAAGG TACAGGAGCT GCTGAAGGAA TCAGATGACG AGTTTAAACA ACAACGCAAA ATACGAAAAT GGGGAAAATT TGCCAACGTA ACTAAACGCG AAGACCTGCA GGATGTCTTA CAGGAGGAGC GCAACGCAAT TGATCGAGAA AATGCGCTGA AGGCATTCAT GGCCCGTTCG AGCGGGATTG AGCTGCAAGT CTTGGATCCA CGGGATGAAA TGGCAAGCGG AGTGCCGTCC GTCTGGGATG AAACTGGAAA CGTGCAGATC ACCGGTGGCT CGGTGAAATC ATGGTTTGCC GAAGTGGACG AGGATCTCGA GTCTGAGTGG CAAGCCCTCA TGGGCGCTGG CGGTGCTGCT GGTGACGTGG CGGTAGAAAA GACATCTGGT GAATTGGTCG CGAGGGACAA ACTAGCGGGT ATACGAGTAG GAAGTGCGGG CGGCTGGACG TTGGAAGTCT TTCCGGGCGA TTTTGTTGTT CACCGCAAAT ACGGTATCGG TCGATTCGAG ACGACGTGCT TGCGGCCAAA AACGAAGCTC AACGAAGAAG AACGACTAGC GCAAGAAGAA AGAAGGGCTG AAATTCTCAC TACCGAATTA CGCAAGCGCA AGCGGGTAAC ACCCGACGAA ATTCAAGAAA TACGTGCAAG ATTTGGCACG GAAGAAGATA CGGACCCACT ATCCAATCCA CAAACTACTG TCTTGGAGAT TACGTATGCA GACGCCGTCG TGCATGTACC TGTCGATCGC GCATACCGTC TCAGTCGGTA TCGCGCTGGG GATGCCGTGG TCAAACCCAA ACTTTCCCGT GTCAAAGGTG AAGCATGGAG CAAGGCGAAA CAAAAGGTGG AGGAAAATAC CTTACAGCTG GCACAGGATG TGCTGGCACT CTACGCAACC CGTGAAACAC TCCAAAGACA ACCCTTTGAT CCATCAGTGG AAGACGTTGT CCAAGAATTT AGCAAGTCGT TCCTGTATGA ACCGACGACG GACCAAAAGA AGTGTTTCGA AGAAATTGAA AACGACATGG TTTGGCGAAG TCGTCCAATG GATCGTTTAA TTTGTGGTGA CGTTGGCTTC GGAAAGACGG AAGTGGCTAT TCGTGCTTTA TTCCGGTCCA TTATCAACGG TCGCCAAGCG GCCTTGCTAG CACCTACTGG AGTCTTGGCT GCCCAGCACT ACAAAAATAT TGTCAAGCGC ATGGGACCCG GCACAGAGTA CAATATAAAC ATTGCCTTGT TGCGAGGGGG GATGGGTAAA CAGACCAAGG CTGGAAGAGA ATTGCGTGGA GAGATTGAAG GAGGCAAGAC ACAGCTTATC GTGGGAACCC ATGCACTCTT GTCCAACGAA ATGAAGTTTA AGAACTTGGG TTTGCTGGTA GTCGACGAAG AGCAACGGTT TGGTGTCAAG CAAAAAGAAC GCCTCAAGTT AATCTGTGAT GGAATCGATG TTTTGACGTT GTCTGCTACC CCAATTCCTC GTACTTTGCA AATGAGTTTG AGTGGAATTC GCGATACATC GACAATTCGG TCGCCACCGC CGATGCGAAA ACCCACAGTC ACGCACGTGC AGGATTTTAG TGAAGATATT GTAAAGACTG CCATCTCGAC AGAACTGGCG CGTGGAGGAC AATGCTATTA CGTAGTTCCT CGTATTTCTA TGCTTGATGA AGCCGAGCAA ACGATCCAAA GCCTGTTCCC AGGAATACGC ATCATTCAAG CGCACGGCCG AATGCAACGC AACGGCGCGG AGGAAAACGT CGCCGAATTC GCCGAAGGCA ACTACGATGT TTTGCTCGCT ACGACGGTCA TTGAAAACGG TGTTGACATT CCTTCCGTCA ACACAATTGT CGTGCAAAAC AGTCAAGCTT TTGGAATGAG CACCCTGTAT CAGTTACGTG GTCGTGTTGG TCGTTCTGAC AAGCAAGCCT TCGCGTACTT TTTGTACCGC GAAGAATCTA TCACGGAACA AGCAGCTATG CGTTTGCAGG CAATAGGGGA ACTTTCAGAA CTTGGCTCCG GATTCGACGT GGCGAATCGA GATTTGGAAA TTCGTGGAGC CGGAAGTTTA CTGGGAACGG AACAGAGTGG TATGGCGGCC AAAGTCGGTT TTGATTTGTA CATGCGCATG TTGAAAAAGA GCATACGCAA GCTCAGGGGT CTCGACTTGC CTCTAGTACC ACGTACTAAC ATTCTATTTC CGACAGATGG ATCGCCCAGT ACCTTTAGCT TGCCAATGTC TTTCATAGAG CGTCAAAGCG AACGTCGCAG TGAAGAAACC AAGGCTCGTC TGGCCGAAAG CACTTCAGCG TTGGTCACCT TGACCAATGA GTGGAAATCT AAATACGGGT CGCTCCCCTC CACCCTGCAA AACCAGCTCA AGACTTTACA TCTGCACGCT TGTACTCGTA GGTTGGGAAT TGATCTCGTC GGTCTGGTGG ATGTTTTTGG CAATGGGAAG CGCATCGATT GTATTCTGCG TTCACCGGGT CTTCGCCCGC GGCACTGGGC CACGATTGTC CCAATGCTGG CCAAGGGTAT TGCCCCCAAG GGTTTAGACG TTGTATTTCC TGCTCGTTTC ACGGTCACAG GTGAAGAAGT AGAAGTGAGA GGTGGCCGAA AGATGAATCT ATTAGAACTC GTCAAGGAAG AGACTTTCAA CGAAGAGTTG GAAGAGGAGG ATTGGGACGC CATGGACGAA GAAGAGGTCG AGGCAATGAA GGACATTAGT TCGGCCGTAA ACGTTTTGGA TATGGACGAG GTTGATCTGG AGCAGTATCC ACGTTTTGTG GTGAGGGATT TTCAGGATGC CGACAAGGCC GTTGACCGCC TTTTGAAATT GCTACTGCCG GTTGCCAAGA TCGTATATGA GAAACAAGAA GACCAAGCGG AAGCCGCTCG CATGGCCGCA GAGCTTCGTG ACAAACAAGA GCTTTTACGC CAACGAAAGA AAACGAACGA AAAGCGAGAA GCCCAGCGTC TGGGTTACCA GTATTAATTG CCCGGCAGTC GAGTTTCCAC TCGTTACCAC TCGTAAGAGT TAAACGCTCT GTACAACTGT AGCTAACATA ACACGTAAAT TTAGTTGTTG GTCT
|
Protein sequence | MARSSGIELQ VLDPRDEMAS GVPSVWDETG NVQITGGSVK SWFAEVDEDL ESEWQALMGA GGAAGDVAVE KTSGELVARD KLAGIRVGSA GGWTLEVFPG DFVVHRKYGI GRFETTCLRP KTKLNEEERL AQEERRAEIL TTELRKRKRV TPDEIQEIRA RFGTEEDTDP LSNPQTTVLE ITYADAVVHV PVDRAYRLSR YRAGDAVVKP KLSRVKGEAW SKAKQKVEEN TLQLAQDVLA LYATRETLQR QPFDPSVEDV VQEFSKSFLY EPTTDQKKCF EEIENDMVWR SRPMDRLICG DVGFGKTEVA IRALFRSIIN GRQAALLAPT GVLAAQHYKN IVKRMGPGTE YNINIALLRG GMGKQTKAGR ELRGEIEGGK TQLIVGTHAL LSNEMKFKNL GLLVVDEEQR FGVKQKERLK LICDGIDVLT LSATPIPRTL QMSLSGIRDT STIRSPPPMR KPTVTHVQDF SEDIVKTAIS TELARGGQCY YVVPRISMLD EAEQTIQSLF PGIRIIQAHG RMQRNGAEEN VAEFAEGNYD VLLATTVIEN GVDIPSVNTI VVQNSQAFGM STLYQLRGRV GRSDKQAFAY FLYREESITE QAAMRLQAIG ELSELGSGFD VANRDLEIRG AGSLLGTEQS GMAAKVGFDL YMRMLKKSIR KLRGLDLPLV PRTNILFPTD GSPSTFSLPM SFIERQSERR SEETKARLAE STSALVTLTN EWKSKYGSLP STLQNQLKTL HLHACTRRLG IDLVGLVDVF GNGKRIDCIL RSPGLRPRHW ATIVPMLAKG IAPKGLDVVF PARFTVTGEE VEVRGGRKMN LLELVKEETF NEELEEEDWD AMDEEEVEAM KDISSAVNVL DMDEVDLEQY PRFVVRDFQD ADKAVDRLLK LLLPVAKIVY EKQEDQAEAA RMAAELRDKQ ELLRQRKKTN EKREAQRLGY QY
|
| |