Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45879 |
Symbol | |
ID | 7201117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 559373 |
End bp | 562495 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180260 |
Protein GI | 219118987 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAATC TACAGTTGCA CCATCGTCCT TCTGCTAGAA TTGCGATGCA GAAGCTTTTG TTCAAATCGT CGCATCCCCG CGTTCCTTCT CGTTCAGTGG TACAAATACT ACGTCACCGC ACGTTTTCAA CCAAAGCTGT CTTTCCAAAT ACTTTGACCA CTTCGCGGAT ACAAGGAGAT CCAGACAATG GTCGGCAAGA TTCTGTGATG GGAGAGAGGG TTGCCTCCCA AAAATTTGGC GGCATGGTGC ACTTCGTTCA AAAGCTCCCG ACAATTTCGT CACCGACACA AACTGGGCTT CGCTCGGCAT TTCTGTATTG GCTCAGTAAA CCAACTCGTC GAGTAGCGTC GAAGCGAAAC TTGCCGGAAC TCCCCTTAAA TTTTGCTCAG AAAATTCTGG ACTACTGTGT TGACCAAAAC GACCCAGCCC TTATCGAGTA CGTCGTCGAC AATCCGGGCG TGTCTTGCAA CGGCGGCTTT GACCGACTGA TATATCTCTA TCTCGAGCCA TTGCAAGGGA CAGGAGCTCA ATTGCGCGGT CACAAATATC GAAAGAGCTT GGAGCGGTCA CCCGAACAAA CAATTGATAT GCTTGCCAAG GCGTCGGCTG TGCGGGCTCT CGCGGATAGA CTGCACCGCG ACCCACGGTA TCCCAACATA GTTCCAGATT TGACTACGTG CGAATCGACG CTGTACCTGT GGAGCAAACG TTCCCAGTTT CTTGCAAACA ACAATCCATC GTGGCGTGAT AGTGTGGCCC AAAGTAAGGC GGATTTGGCT TTTGGCGGAA AGTCGAATTC ACAGGAATGT ATCGATGCCA TGAAGGAGTT TGTTTTCCAA AGAAAACAAA GTATTCACGG ACCGCAGCCA GACACTGTTA TGTATTCTAT CTTGCTAACG GCAATTTCTC AAGGAAAGCA CTTTGATGCA GCGGATGAAG CCTTCTCGCT ACTGAAAGAA TTGGAACTCG ACGATTCTGT CAAAAAGACC ATTCATTTGT ACACGGCAGT CCTTTTGGCT TACCGCTCTG AAGTCACACA TTCGACACGG GCACAAGAAA AGGCTATCGA ATTGTGGAAT CATATGATAA GTATGGACGA CCCAACAATA TCGCGGAATC CAATAGCCGC CGGCATTATG ATGTCCATGT TTGCGAAAGT TGGAAAAGCT GAGGAAGCTC AAAAGCTCCT GGACGAAATG GAAGCATCTG CTAAGGAAAA GAGCGAATAT CCGACGCGCA TTCACTATAA TACGCTTCTT CATGCTCTTA CGAAGGCACA ACTAGACGAC GCTACAATAA GGGCCGAAAA AATACTTCAA CGTATGGAAT CTCTGGCCAT TAACAACCTT CGCGATACAT TCCCGGACCG CATCAGTTAT ACATCGGCTC TAAATGTTTT TCTCCAAAAC GGAGGAACCG ACTGCATTGA AAAGGCTGAA GCAGTTCTTG ATAATTTGGA AGAAAGCAGC GGCAGAAATC TTCGTCCTGA TAAAATGACG TACAGTAGAT TTATGCAATC TTTGTCAATG CGGCGTGGGC GAGAAGTTGA TACTCAAATT AAGGAAAGCT TCTGCATCAA GATCGAGGAA GTCCTCCAAC GCTGGCGTCG TCGCTCAGAG TCTGATGTGA CGGTCAAGCC TCCAGATCTG GAGGCGTACA AACTCTGCCT TCACGCGTGG GCAACTTCTT ACAGCACCCT CTCGCCGGAA CGGGGAATGC TTCTCTTTAA TGAGATTGAA TCTCGTTACC AAGCCGGTCA AAGAAATTTA CGCCCGGATG TTTACACTTT CGAATCTGTG CTGCACTGTC TGTCTATAAA GGTAGATGAA GCGTCCGTAC GTCTTTCCGA ATCGCTCCTT CGGAAATTAG ATGAGTACAA TATTTCCCAA ACCGGTTGCA TGGTTAAGTA CTACATCGGT TTGGTTGCTA GGCAAAATGT CCAAAAAGCG GCTACGATCC TGAATGAATT GGAAGACAAC TTTGCCTCTG GCGCAAGTTC ACTTCGACCT AACGAGCAAA TATACAATGC TGTAATTCGA GGATTTTGCG TGCGTGAAGA CGGAGCACTT GAAGCCCAAC GCCTACTGGA CAGGATGAAG CGACTGGCTC TTCTACCTGG TAGGACTGAC TTGACTCCAT CAGCAGTTGT CTACTCTTCA CTCATTGAAG CATGGGCAAA ATCGGGACGA AAAGATGCAG TTGACCATAT TGAAGCTTTA TTTGCCGAAG TGTGCGATCG AATGATTCCA AACCATTTTG TTTATGCGAC ATACCAGAAC GCCATCAGCC GCTCAAATCT CCCCGATGCT CCTGAGCGCG TTGAAGCGAT ACTCACAAAA ATGCAGGAGG ACTACGAACA AGGGCGCAAC AAGCTAGGCC GTCCAGACGC AAACAACTTT GCAGCTGTGA TTTCCTGCTG GAGTTTTAGT CGACATAAAG AGGCTGCCGA ACGTGCTGAG GCAATCTTAA ACAGAATGGA AAGCCTCTAC CTGCACAGCT TAAAATATGC TCACCTAAGA CCAACAGCGC GATGCTTCAA AGGCGCAATT GCGGCGTGGG CGATGAGTGG TCATCCAGAT GGAGGAAAAC GGGCTCTGGT ATTACTGGAT CGGATGAGTA TCGCTAGTCG AGGCCAAAAC ATTGTCCACT TACGGCCATC TCGGGCCTGT TACGATTATT GTATTGTAGC TATCGGTCGA TCGAAAGATT CTAATAGGGC AAGGAAGTCT TTGGATCTTT TGAAACGCAT GCAGCGAGAC GTACGGGAAG GATATCGACA CTCACAGCCG GGGATTTCGA CGATGGAAAA CATACTAGAA GTATGTAATA CGTACGCGCA TGCTCTGGCG AACGAACGAG AGGAAGCCCT CGAAGTGGCT GAGAAAGCAA TAGATCTATT CGCAGAAGCA GACGGGGAAG TCCGAGATAT GGTCAATGTT TATACACGAT ATGTTTGGGT GTTGAGGCAA CTCGTGCAGA CGTGTGAAAA ACGCGACGAA GTTGTCCATA ACGTAAGAAA GAAATGGCCC GAACACATTC TGAGCGCGTC GGATGTTAAT AAGGCTCTCC ACAACTTTGA AACTTCGGAA TTACCGACAG AGCTCAAATC CGTAGAAGAC TGA
|
Protein sequence | MRNLQLHHRP SARIAMQKLL FKSSHPRVPS RSVVQILRHR TFSTKAVFPN TLTTSRIQGD PDNGRQDSVM GERVASQKFG GMVHFVQKLP TISSPTQTGL RSAFLYWLSK PTRRVASKRN LPELPLNFAQ KILDYCVDQN DPALIEYVVD NPGVSCNGGF DRLIYLYLEP LQGTGAQLRG HKYRKSLERS PEQTIDMLAK ASAVRALADR LHRDPRYPNI VPDLTTCEST LYLWSKRSQF LANNNPSWRD SVAQSKADLA FGGKSNSQEC IDAMKEFVFQ RKQSIHGPQP DTVMYSILLT AISQGKHFDA ADEAFSLLKE LELDDSVKKT IHLYTAVLLA YRSEVTHSTR AQEKAIELWN HMISMDDPTI SRNPIAAGIM MSMFAKVGKA EEAQKLLDEM EASAKEKSEY PTRIHYNTLL HALTKAQLDD ATIRAEKILQ RMESLAINNL RDTFPDRISY TSALNVFLQN GGTDCIEKAE AVLDNLEESS GRNLRPDKMT YSRFMQSLSM RRGREVDTQI KESFCIKIEE VLQRWRRRSE SDVTVKPPDL EAYKLCLHAW ATSYSTLSPE RGMLLFNEIE SRYQAGQRNL RPDVYTFESV LHCLSIKVDE ASVRLSESLL RKLDEYNISQ TGCMVKYYIG LVARQNVQKA ATILNELEDN FASGASSLRP NEQIYNAVIR GFCVREDGAL EAQRLLDRMK RLALLPGRTD LTPSAVVYSS LIEAWAKSGR KDAVDHIEAL FAEVCDRMIP NHFVYATYQN AISRSNLPDA PERVEAILTK MQEDYEQGRN KLGRPDANNF AAVISCWSFS RHKEAAERAE AILNRMESLY LHSLKYAHLR PTARCFKGAI AAWAMSGHPD GGKRALVLLD RMSIASRGQN IVHLRPSRAC YDYCIVAIGR SKDSNRARKS LDLLKRMQRD VREGYRHSQP GISTMENILE VCNTYAHALA NEREEALEVA EKAIDLFAEA DGEVRDMVNV YTRYVWVLRQ LVQTCEKRDE VVHNVRKKWP EHILSASDVN KALHNFETSE LPTELKSVED
|
| |