Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43370 |
Symbol | |
ID | 7197407 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 264240 |
End bp | 268511 |
Gene Length | 4272 bp |
Protein Length | 1258 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177898 |
Protein GI | 219112293 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACCATGGG AGGTGCTATG CCAATCTCGA CCATTAACCA ACATCGAAAG CTCTCAATTT GACACATCCC TACTGGAAGT TGTTTTCTCC ATGAGGACAA CTTCAAATAA TGATAGAGCC AGTAACGGGC AGAACAAAAG GCAGCCTTAC GGTGCAGTTC CTGCTTTCGA AGACGCAGAC TTGCTGGAAG CAGAAGGGAA TGGTTTTGCT TCAAATTGTG AACGACTCAA AACATTTCCA CCAGATACGC GTCGCACGTG GAAGGCGAAA TTAACGACAT ACGTGGCGAT TGCAATGGCG CTGATGTCCA TTGTGCTCTG CATGGGGATA ATTCTCAACG TTGTACTGGG AGGTACTTGG GGTATGGAAA CCATTGACAA CGAGAAGAGC CAAGCTACTC AAGCGAGACC TAGCAGTACC AACACGAAAA CTCTGCACTT GTACGACGAG CTTGGTCGGT ATATTTTGGA AGATTACGAC GCCCAGCCGC CCTTTTCAGA CTTTTTGCCA GCTTTGGCTG GTTATTTTGG CAAACCTTTG TACGCATTTT ATATCAATCG GGGTCAGGGT ATCGCTGCAT TTGGTGTGGA GTCGAAAGAA TATCCCATTA TGGAATTCCA TTCCGCCAAT GTTGCATACC AAAATACCGC TTTGTTGGGC TTCCGTACCT TCATCCAAGG ATCACGCCAG AAGAAAGACC AAGACTCCTT CTTCGTTGAG CCTTTCTCGC TTTTGCAAAC ACGATTTCCG AATATACCAC TGCACCACAA CGACATGGAT AAGTTATTGC CTTCTTCTAC CACACCGAAG CGGTATATGT ACGTGGGATC TAACGAAATG CAAATTCGCG AAGTGGATAC TGCTCACAGA CTAGAAACCA ACGTGACGTA TTTCGTACTT CCGGAGGAAG ATTTTGGATC GTTCGTCAAA CGGACTACAA TTACCAACAC TGACGCTAAG GAAAGTCTCA CATTGTCGTT TCTAGACGGG CTCGCGCAAA TTCAGCCCGC CGGTGGCGAA ATGGAAAAGC CGCTAAAGAC GATTGGACGA ACATTGCAAG GATGGATGGG AGTGTACTCG CCGTACGATG ATTCCGATGG AATGATGAGA ATGCCTTTTT TCCGGCTGTC GACGCAACCT AACGATTCCG CATCCGTTGT TGTACAAAAA GCTGGGCACT GGTGTCTCTC CGTCTTAGAA TCTGACAATG ATTCGACACT CTTACCTATC GTTTACGATA CGTCAAAAAT TTTCGGGCAA GACACAAGTT TATTGCATCC TGTTGGGTTG TCCAACAGAC CAATCAGCGA AATTCTGAAA GAAAAGCAAT ATGGGCGCGC CAAAACTTCC TCAGCTTTCG CTGCCGTGGA TCATATCAAG CTGGCTCCGG GCAAGTCCGT GACGATCTCT ACTTTCTTCG GTAAGGCAAA TAATGTGCTG GACATTCCCG TAATTTCCCG CAGAGTATTG CAACTTGGTT TTTCCCAGTA CAAGGTGATG AGGACCCGAG AAATTATTAA GCAAATTACG GCCGTAGTGG AAACCACTAC ATCTCATCCG CTGTGGGACG GACACGTTCA GCAAATGTTT CTTGACAATG CTCTCCGAGG TGGTGTTCCC TCTATTTTGG GCGAAATCGA TGACGATGCT AGACTACGCA ACGTAGACGA AGATCCACGT CTAAAAACAT ACCACTTGTT TTCCCGAATT CACGGGGATT TGGAACGTGA CTACAACGGT TTTGTGATCA AGCCAACGTT TTTTTCCGAG GTAAATGCTG CGATGACCTG TGGCGTGGCA GCGTTTTGCT GACTTTCTCG GCGAAATACC CGCACCTTTC TTCTCACGTT CTAACGGTAA CGCTTTTCGT ATAGGGACCA GGAAACTTTC GAGACATCGC ACAAAATCGA AGAAGCGATG TACTTTTTTA CCCTAGAGTC GGGTCAACCA ATTTGAAAAC GTTCTTGTCC ATGATCCAAG CTGACGGCTA TAATCCTCTG TCAATCGAGG CCAGTACATT TACCATAAGT GACGTTAAAG CCTGCTTACA GATTGCTACA GCAGCTGTAG GACCAGCTGA TGGGCATCGG GCACAGCGTG AAGCCTTAAC CGACATACTT CACAGTGGAC CATTTCGTCC CGGACAACTC TTTTTACTGA TGGAGGAGCA ACATATCGAT ATCATCATTG CACAACAAGA ATTTATCGAT ATTGTTGCCG CCGCGGCGGA CCAACATCCG TAAGTCTCTT AATTTAGATA TGCGCGCATA GTTGCGCACG TCATATCTGA CTATTCTCTA CCTTTGTGAA AACAGAATTG CGGTGTATAA AAGCGGATTC TGGGCCGATC ACTGGACGTA CTACATGGAC CAAATCCAGG CCTATCTAGC AATCTATCCA GACTGGGAAG AGCGTATTTT GTTTGATGAA CGGCTCCCGT ACTTTTTTTC ACCTGCGTTT GTGAAGCCTC GACACGAGAA GTATGTGTTG TCGGTTTCGT TTGATGGCAC CGGTAAGCAT GTCAGACAAC TGGATGCGAC AGAAGATAAC GACGAAGAAA AAAGAACTTA TATGGCTGAA TACATCGAAA ATAAGACGGG ATGGTACGAC TTGCAAGCCA ACTGGCAGCA TACAGCTTTA GAGGGCGGCA TTTTTCACAG CTCTCCTATG GCGAAACTGC TTTTGCTGGC CACTCTAAAA TTCGCTTCTC GAGATGCGTA CGGCATGGGC GTTGAATACG AAGGTGGGCG TCCCGGTTGG GATGATGCTA ACAATGGTCT TGTTGGAATG CTGGGTAGTG GCATGCCAGA GACTTACGAG CTTGTCGTAC TTTTGCGATA TATTCGCTCT GCGCTTCACC GTTTTGATCG CGATCTTTCG GTACCAGTCG AGCTGGTCGA CCTTATAAAT TCGATCAATA AGGCGCTAAA AACACTTTCC AGCGAACGCG GTGATGAAAG AAACAAAACA GAGTCGCTCG GAAGTGAAGT TCCTTCTTCT TTCTTCAAAT ACTGGGATGA GGTTGCGACA GCTCGAGAAT TGGTAAGTTG GAATATATCT GTATGCTGTA CCTGAAAGCC ATTCCGCTCA ACCCGGTTGT TTGTTGCGTC GGCAGTATCG CGATCAAACC AAGATCCTTT TCCAAGGCCT GACGGAGGTA CTTTCCTCCA AGACGATATG TTTAATGATT GACACTTGGC TAGGTGAAAT TGGACGAGGT ATTGAGCGCG CCATGTATTT CGGCTCCCGT GGTTTTAACG ATAACGGTAC ATCTGGCATC ACGCCCACAT ACTTTTCGTA TAACGTTACC AAATGGGAAA AGACTAAAGC GATCAATACC AACGGCCATC CGTTCGTTCG GGCAAAACAG TTTACACTAA ATCTGTTTCC ATTGTTTCTT GAAGGCCCAG TGCGAATGAT GAAAACATTA GATTCGGATG AAGCCAAACT CGTTTATGAC AGGGTAAGTT CGTCGCAACT GCGGGACAAC GACCTTAAGA TGTATTTGAT TTCAGCCAGT CTGAAAGGTC AAAGTTTGGA CACGGGTCGC GAAATGGCGT TTGCTCCCGG TTGGTTGGAA AATAATTCGG TATGGTTGCA CATGTCGTAC AAGTTTTATT TGGAAATGCT ACGGCACGAG ATGTTCGAAG CTTTCTTCGA GGAAATCACT TCGGGTGGTA TGCTGCCATT TATGGATCCG CAAGTATACG GTCGGTCGTT GATGGAGTGC TCGTCATTCA TTGCATCGTC GTCCTTTGAA GACCCGGCTA TCCGAGGACG TGGGTTTTTG GCCCGCTTGA GTGGGTCAAC TGCAGAGTTT TTGAGCATGT GGATCTTAAT AATGATTGGA CCGCATCCCT TTTACATTGA TAAGACAACG AATCTGGTAC AAATGCAGCT GGTACCTGCC TTGCCACGGT GGCTGTTTGC AGAGCAAGCG GATGACCAGA AACCAGTCAT TCGATTCAAG CTTTTTGGCT CCATTGAGGT AACGTACGTC CATGATAGGG GAGAGGAAGA TTTATACCGT ACGTTGCCTT CGAGATACGT CGTCGGATTG CGAGATGGGT CCACATTTAA AGTGGAAGGT CCATCAATTT CGAATGACTT GGCCGATAAA ATCCGTCGAG TAGCTTTTGT TGCGTCTATT GACGTTTACT TCGAGCATTG AAAAACTGAA ATATTGCTGC TGCTGTGAAA AATTGAAAAA AACCAATGCT TTGGTGTATG TGAATTTCTA CAATTTAATT AGACGCTCAT TTACAATTTC TA
|
Protein sequence | MRTTSNNDRA SNGQNKRQPY GAVPAFEDAD LLEAEGNGFA SNCERLKTFP PDTRRTWKAK LTTYVAIAMA LMSIVLCMGI ILNVVLGGTW GMETIDNEKS QATQARPSST NTKTLHLYDE LGRYILEDYD AQPPFSDFLP ALAGYFGKPL YAFYINRGQG IAAFGVESKE YPIMEFHSAN VAYQNTALLG FRTFIQGSRQ KKDQDSFFVE PFSLLQTRFP NIPLHHNDMD KLLPSSTTPK RYMYVGSNEM QIREVDTAHR LETNVTYFVL PEEDFGSFVK RTTITNTDAK ESLTLSFLDG LAQIQPAGGE MEKPLKTIGR TLQGWMGVYS PYDDSDGMMR MPFFRLSTQP NDSASVVVQK AGHWCLSVLE SDNDSTLLPI VYDTSKIFGQ DTSLLHPVGL SNRPISEILK EKQYGRAKTS SAFAAVDHIK LAPGKSVTIS TFFGKANNVL DIPVISRRVL QLGFSQYKVM RTREIIKQIT AVVETTTSHP LWDGHVQQMF LDNALRGGVP SILGEIDDDA RLRNVDEDPR LKTYHLFSRI HGDLERDYNG FVIKPTFFSE GPGNFRDIAQ NRRSDVLFYP RVGSTNLKTF LSMIQADGYN PLSIEASTFT ISDVKACLQI ATAAVGPADG HRAQREALTD ILHSGPFRPG QLFLLMEEQH IDIIIAQQEF IDIVAAAADQ HPIAVYKSGF WADHWTYYMD QIQAYLAIYP DWEERILFDE RLPYFFSPAF VKPRHEKYVL SVSFDGTGKH VRQLDATEDN DEEKRTYMAE YIENKTGWYD LQANWQHTAL EGGIFHSSPM AKLLLLATLK FASRDAYGMG VEYEGGRPGW DDANNGLVGM LGSGMPETYE LVVLLRYIRS ALHRFDRDLS VPVELVDLIN SINKALKTLS SERGDERNKT ESLGSEVPSS FFKYWDEVAT ARELYRDQTK ILFQGLTEVL SSKTICLMID TWLGEIGRGI ERAMYFGSRG FNDNGTSGIT PTYFSYNVTK WEKTKAINTN GHPFVRAKQF TLNLFPLFLE GPVRMMKTLD SDEAKLVYDR VSSSQLRDND LKMYLISASL KGQSLDTGRE MAFAPGWLEN NSVWLHMSYK FYLEMLRHEM FEAFFEEITS GGMLPFMDPQ VYGRSLMECS SFIASSSFED PAIRGRGFLA RLSGSTAEFL SMWILIMIGP HPFYIDKTTN LVQMQLVPAL PRWLFAEQAD DQKPVIRFKL FGSIEVTYVV GLRDGSTFKV EGPSISNDLA DKIRRVAFVA SIDVYFEH
|
| |