Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47331 |
Symbol | |
ID | 7202393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 311802 |
End bp | 316510 |
Gene Length | 4709 bp |
Protein Length | 879 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181699 |
Protein GI | 219122742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.217105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGATTGAGTG ATAAGAATAA ATTCACTATT TGATTGCCAA TCATATAATC AGGCTACGGG AAAATTTTAA TGAAGCAAGA CGCTATGAGC CTGCCGCGCT TCGCTCCCCC TTTTCGGAGA TGTGTGACCT TGTCTCATTG GTGGGAGTCT TCCGGCCACG ACAGAATGTT ACACACCACC AGTACAGGAG CGAGGAAACT TTTTTTTGGC GTCTTTTACT TGATTCTCTT GCTCAGCTAC TCATTTTCTG CGCGTGCGTT TGCTTCGACG CCGAATGGGC GGCGTCGTTT GAAAATATCA GCAAAAAACG CTATTTCAAG CGAGGGCACG AACGAAATTC AGACTGTTTC AGCTTGTTTA CCGAGAAGTC CATTAACAAA TCGAAGTCGA GATGATTTGT CGTACGAATT TTGGGGTGAA AAGTTATCGT TGGCGCACTT TAATATGGAT TTACAAAATC TGGCGTTGGA CGACCCGGCC AAGGCGCAGG ATGCCTTAGA GATCATGCAG GAGCTTCACT GGAAAGACCC CGAGAATCCG TTTACGGTTG AACCAGATGC AACGTGCTAC GCCACAGTAA TAGAAGCTCA TATTGGAGTA GGGAACGTCG AATCTGCATG TTCGATACTC GACTACATGG AACGAATATC GGACGAGGCT ATCGAAGGTG GCCGCTGTTC ACCACTGGTG CCGACGGAAC GAATTTACAT GCTGGTTGCA CAAGGCTGGG CAAATTGTGC CCGAGACGAC ATAACCGGTC ATCCAGCATT TGAAGCTGAG GCATTAATGA GACGAATGCA AAAACGCAAG ATGGAACCGA GCGTCAAGGT CTGGAGCATC GTGGTGGAGG CTTGGTGCAG ACGAACTGGT ACCGTTCGTG CAGCGATGCA GCGAGCCGAG TCCTTACTCG AAGAAATGGA AGCTTCAGTT GATAAAAAGG TAGGAGAAAG GGAAGCAAAA AGGAGCGCAG TGCGTCCTAA TGTATTAACA TACACTAGTT TTATCGGAGG ATTGGCGAGG AGCAAAGATC GCGATTTGGC GACACGAGCA GAAGCTACTC TCGAGCGAAT GGAGCGATTT GGAGTACAGG TCGATATGGT AGCATATACA TCTGTACTCA ATTGCTGGTC AAAATCTGTC AGTAAGCGTG AAAGATTGCA GTCGGCGTCA CGTGCATTGC AAATTTTGGA CAAAATGGAA GGGATGTACG CCAGGGGAGT GTACCACGTC AAACCAAGCC TTATTACTTA CGCAACAGCG ATTCGTGCTG TCGCAAACAG TCTTGATCCA AGAGCTCCCA AATTGGCCGA AGCTATTTTA CGAAGAATGT ACAAACTTTA CGAGTCTGGA GCTATCGAGA ACCTGAAACC GGTCACAACC ACATACAACG CGGTTCTTAA CGCCTTGAGC CGTGCGAACG GAGTCTCTCG TGTTAAATTC GCAAGAAGAA CAGAAGTTTT GATGAAGGAA ATGTTCAAGC GAGCAGAAGA AGGCGAACGT GATGTTATGC CTGATGTACG AACGTGGGCA GCTGTGCTGA GGGCATGGTC GACTTGCGGC CAACACGACG CTGCTGAAAA CGCCGAAAGG GTTCTACAAA GGCTCGAGAC CCTACACCGG AACGGAAGTA CGACGGTTCG GCCAAATTAC GTCTGCTATA CAACTGTAAT GGGAGCATGG GGACATTCAA GAAGACAGGA TTCTCTTGAC AAGATGGAGT CTCTTTTGAA GCTGATGGAG GAAGGCTATG AGAAAACACA AGAAGCTGAT ATCCGCCCAA ATACTGTTTC CTATGTAACA GCAATTGATG CATTCGTAAG AAGAAATGAC AACAATGCAG CCACCCGCGC GCAGGAAACA GTCGACCGTA TGATGCGTCT GTATGCCTTA GGTCTCGGCC ATGTTCGACC GACGCGAATA ATTTTCAATA CTCTGATACA TGCCTGGTCA AAGTCGAAAG ATCGAGAAGC GGCCAAGAAG GCCGAACAAA TTTTCCAATG GATGGAAGCT CAGTACGATG CAGGAGACTA TCTCGTGCGA CCTGATGAAG TGAGCCTTTG TGCGGTTTTA AATGCATGGG CAAATAATGC AGAGAACGGA GGCGCAGACC GGGCTATTCA GATTTTCAAT CATATGGAGG CTGTTTCTTT AGAAAAAAGA GGATTTCACG TATCGATCAT GATGCCTAAT ATTGTGATCA AAGCTATAGC ACGAAGTAAA GACAAGGACA GTTTCCGTAA GGCGGAAGCA ATTCTTTTGA AACTTGAGGG CGACTACGAG AGAGGATTGA CAATCGTTCA ACCTGATGTG ACAACTTACT CTTCTGTAAT TAACTGCTGC GCGTATTATC GATACGCTGA AGGAAGAGCA GATGCGTTTG AAACTGCAAT GAGAACATTT AGAAAGGTTT CTGTCCTCCA CAATGCAAGT CCAAATAACA TCACTTTTGG AACTTTGTTC AAGGCAATCA CAAATCTACT GCCCGAGAGC GAAAAGCGAG AACAACTTGT GGAAGAGTTG TTTGATCAAT GCTGCACAAA TGGTGTAATG GATGGTTTCG TACTTTCTCA ACTTCGGAAC GCAAGCCCAA GACTTTACCG AACGCTTGTA TGCAAAACAT GTGATCAGGG AAACTCACGA AGTGGGGATA ACATTGACAG TATATTGAGA CGGCTACAGC CGGAATGGAG TCAGAATATT GTCGACTAGA CAATGATTAG AATTACAAAC ATGGATTGTA TACCGACTGG GCCTCTACCT TGTGTTGTAC TTAAAATTTC GCAGAAGTCA AACTCTTCTT GGAGTACATT TACTGAAGTT TCAGAAGAAA TGTCCGAGAC CAGTTGCTGA GGCTCTTGTC AACGTAGCTT CCATCGACGA CTGGAAGGTT TCTGCCCCTG GTAGTCTACC GATGTCGCTC TAATTTGCTA GCAGGGTGCT CACTTGTTAC AATTTCCATG CTCTTGTAGA GTTGAAGGTC TTCAGAGAGT GAATCTGGAG GCGTGCTCGC TTCGTTTTGT TCCAAAGGCG AATTGAAAGA GAGAACAGAA AGTATTCAGT TCCGGTTGAG CTAGCTCGAG GACCGGAGAC GAGGATTTAG ATATCTCCGG GATATTTTAC TCCTGCAGCT GTCTGATGAG AGGGTGTGGC AAGCAATTCA AATTGCGCTG TTTCGATATT GGCGTCAACG CCTTTGCCGT ATCGATCTTT TGAGCCATCT GCCAAAAAGC GAATGCCTGT AAGTATTTTC TGCTCTACTA TAAAATCAAA ATCTTTGATT TGAGTACCAG CGTCTCACCT TCAGCTTCAA GCCTGCTCCG AATTTCAATA TTGCATTTCG CACTGGCCAC TGGATTGATC GATACATCAG TCCAATGAGC GACGAAGTTC AGGTGCTCGC CTTTTTCGGT TTCAGCTTTC TTTGATGAAT CTGGCTGTGA TCCGACACTA GACGTATCGC CCTCCTCGGA GTCAACTTCT TCATCTATTC GATCCAATCT TAGACTGCTC TTTATAGTGT GAGAGTCAGA TGTCAGATCC GATGCGCTCC AGGTAATTGA TTGTCCATTC AACTGGTGCG GATCTGTTGC CATCCAGCCA TATGCGAGTT GAGCGTTGTC TTGAAGCTCA CTAAAATAAT GTCGCCTCTT TGCTTGGTGA TGTGCAATTC GATTTTTTAG GGATACGGCG CAAGCGTCGT TCATATTGGC AACCTTCCAT CGCCAAATCC CATAGAAAGA CACTCCTGTT ATGCAAAAGG CCATCATCAC CACAGCTACC GCTACCACAA TAACGGAACT TGAATTCATC GCGCTACTAT TGTCTGTGTT GTTATCCAGG AGAACGAGAT TTTCGCCAGT TGAATACTCA AAGGAGGTCT GGGCCACGTT TGGGCCAATA AGTTCGCGCA GAACATTCTC ATTTCGCAGA AGACTTTCGA TTGTAGAGTA AGCTTTGACC TCCACTTCGG GTGCTCCCGA CTCCAGCAAA ATGGAAGTAA TACCACGCAC AGTACGACAT TCATTGTACG CTGACATTGA GTTGCATTCG CCTACAAAAA AACATCAAGG AAATTGAAGT CAGGAGGCTG CCTGAATGAT GGCAAAAACA GTTTGCAAAA GCTATTCATA CCACCAGTCA AAAACCAATG ATTCCTAGAA AGTTGCACGG CAAACAACGG TTGTTGGAGG TCGTCACAAT CATCGAGAGC ATGGGCGACG GCTGTTGCGA GCAATTGCTC CACATCATCA AGATTCACAG CTGTATTACG TAAAAATTCA ATCGAGTACA GGTAGAACAG GTCAAGTCTT GCTTTGAAAT TTCCTACTCG AACCCTATCA CAGGTGCTTG AGATGATACT CTTATTCTCT GGTTCCCGAC TCAGCGAAAG AAAAGTCAGA TCAGCGGAGC TCAGCCCCTT TGACCATTTC CAGCAAAGAA GGGAGACCAG TAAAGTCAGC GTCCATGACA TTGTTCAATA GGCCTAGGTT GCAGGTCGAG TGATACCTGC CATTGAAGAC AGGTCCGTGT TAGTGCTGCT CCTTTGGGTT TTTTTTTTTT TCTGAGCCAA TTTTACAAAG TTCGATTGCT TGAGGTGCAG CGGGACGGAT TGGCTGCTTT GGTTTTCATG TGAAGCACAT GGTTGCGAGT AAATACAAAG ATCAACTTAA ATTGCAACGA GCATTTGTCT AGAATATTTA AATTCGATC
|
Protein sequence | MKQDAMSLPR FAPPFRRCVT LSHWWESSGH DRMLHTTSTG ARKLFFGVFY LILLLSYSFS ARAFASTPNG RRRLKISAKN AISSEGTNEI QTVSACLPRS PLTNRSRDDL SYEFWGEKLS LAHFNMDLQN LALDDPAKAQ DALEIMQELH WKDPENPFTV EPDATCYATV IEAHIGVGNV ESACSILDYM ERISDEAIEG GRCSPLVPTE RIYMLVAQGW ANCARDDITG HPAFEAEALM RRMQKRKMEP SVKVWSIVVE AWCRRTGTVR AAMQRAESLL EEMEASVDKK VGEREAKRSA VRPNVLTYTS FIGGLARSKD RDLATRAEAT LERMERFGVQ VDMVAYTSVL NCWSKSVSKR ERLQSASRAL QILDKMEGMY ARGVYHVKPS LITYATAIRA VANSLDPRAP KLAEAILRRM YKLYESGAIE NLKPVTTTYN AVLNALSRAN GVSRVKFARR TEVLMKEMFK RAEEGERDVM PDVRTWAAVL RAWSTCGQHD AAENAERVLQ RLETLHRNGS TTVRPNYVCY TTVMGAWGHS RRQDSLDKME SLLKLMEEGY EKTQEADIRP NTVSYVTAID AFVRRNDNNA ATRAQETVDR MMRLYALGLG HVRPTRIIFN TLIHAWSKSK DREAAKKAEQ IFQWMEAQYD AGDYLVRPDE VSLCAVLNAW ANNAENGGAD RAIQIFNHME AVSLEKRGFH VSIMMPNIVI KAIARSKDKD SFRKAEAILL KLEGDYERGL TIVQPDVTTY SSVINCCAYY RYAEGRADAF ETAMRTFRKV SVLHNASPNN ITFGTLFKAI TNLLPESEKR EQLVEELFDQ CCTNGVMDGF VLSQLRNASP RLYRTLVCKT CDQGNSRSGD NIDSILRRLQ PEWSQNIVD
|
| |