Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49919 |
Symbol | |
ID | 7198619 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 290700 |
End bp | 294011 |
Gene Length | 3312 bp |
Protein Length | 1040 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184773 |
Protein GI | 219129179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.099413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTCCAA CTTGCATTAA AGAGGCTTTC AAGGATCCAC ACGGGGAAAA CTTCAGGTAC TGTCCTATTT GCCGGGAAGC ATGCGCTGTT GATATGATCG AATCTGTATG CGGACCGGGT GCAGTGAAGG AGATGGAACG ACAGTTGCGC AACAAAGTAG AAATAGAAGT CCAGAGAGGG ATGGACAAGA AGCAGGAGGA AAAGCTCAAA ATGAATGAAA GCAAGCAAGT GGCCTTGAAG CTCTACCAGG ATCTCTGCGA ACGATTGAAT ATGAAATGTC CGCGATGTGA ATTGGTCTTT GATGACTACA CAGGCTGTAA CGCACTTACA TGCCGAGGTC AGAGCTGCAG AGCCGCCTTT TGTGCCATCT GCTTGAAGGA CTGTGACACG AATGCCCATC ACCATGTTCG TACCTGCCAC GGGGACCTGT TTGACAAGAA GGCGTTCAAT ACGGCGAGAA AGAAGAGAGA AATCGACACC ATCAACGATT TTTCGGCCGA AATCAGCGGA GAGCCACACG AGGTGAAAGA ATTGGTTCGA ATCGAATTTG AAAAGTCCCA GTCAGACCAA ACACAATATT GTGAGGGAGG TTTCCCGTTT GCTCCATTCC TATCTAGAGC AAAGAGAGAT CTTCTTGCGG CAGTCAACTC TGGACGACTT TCAATTTTAA GCGACGCGGA AGCATATCCT GTCGAAACTG GTCTTACTCG ACGTGACATT TCTCCGCGAA ATGTCATCCC CGAGAACTAT AGGCTTCGCC TACTACCATC CATGGAAAAT ATTTATTCAA TTATTCTTGA AGAACAAGTG CACACCATCA ACGGTATTGC GTGGAAGAAG ATAGCGTTGC AGGACGAAAA AGAAATACGC TCCAAAGACT TGGGCAGGCC AACAGTAGAC GCTTTGAAAA ACATTGCGAT GGCATTGTCT TGCGGAGTTG TAGCCTTTGT AGGTACCAGC TCTCTGTACC AAAGTTGTAT TTCACGAAAA GAAGGCAAGT ACGACCGAGA CGAGGCGAAG GATCCTACCA TCTGTGTACA GTTCCACAAA ATTTGTCGAA ACGGTAACAT GAAAAAAAAC GGGCAATCAC TATCGGAGCT GGGATTGGAG GAACGTGATG TAATAGGCGT CGACCAAAAT GTCCGTATGC TTATATTAGC GGATCACGTC TTGAAATCTT CGGATGAGTC GATGAGCTTT GAACCGCTCC AACACTTTGT TACAGGTCGG CAGCCGTCTC GAGTTTTTAC ATCTATTTCA ATGCCACCAC CCCCCAGCTT CTTGACCTTG AACAACAAAC AGCAAAAGGT TGCGCACCCT CTTTCTCTCC TCACTGCAAT GGAAGTTGCC GGTCCTCCAG GTACAGGCAA AACAAAAACC ATCATGGAGC TTGTTAGGGG CATTCTTCAC TGCACAGACT ACGATGTCAT TCTCATGTCG GAGAGGAACG GCGCCATTGA TGCTATTGCC GAAAAGATGG CTGGCGATTG TTTAACCTTA AATCAATCTC AGTCGGTAAA GAGCGTTTCA AATGTGGAGC TCTGGTCCAA GGTCCTATCA TTTGGTTCGG TTGGAGGCAT GGGCCCATTC TCCGCTTTAT TTACCTCCAC CGCCAAAGAA TTGTATGCAA TATTTGTATA TTCTTGCTTA CGGAAGTGCT GGACTTGAAA CCTCTCACAA AGATTTTGAC TCTTCACTAT TGACAGGTAC CACCCAGAGG TGCTGGAAGC CGACCGAGTC TTGAAAAAGA AAATCAAGTT CATGGAAAAT TACTCGAGGC GGTTGAGAGA GGCACTCAGT AGGTCGATAT ATGATCTCGA GGGAGAATTA TTTGAGGAGT CGAAGCTCAG CATGAGGGGT CGACTTATTC AAAACAACAA GGAGAATAAT CTAGAGAACG CCCGTGATAT CATAAGCTCA ACCATCAATG CTCTGGGTGC AGTGAAACAG TTCCGAGCCG ATAGCCCAGG CAAGCAGAAT TCTGATCTTG TTATCCTTTC ATTGGAGCAC TCTGCTATCC TCCGAGATCT GATTCCCATC AATTGTGAGA AAGCGGATCA CCACTCTGTT CCAAAGTCAA TCACTTCCAA CCCGAAAGCA ATCAACCGGG CTCTGGAAGC CATAGAAAGA CGACTTCAAG AAGTCCTTCA GAACAAGTTA TTTACTGTTG CGGCATGCGA AGCCATGGAC TACTCGATCC GCCTATCGCA AAGCTCGCTT CAAGACGTTA GATTACGCAT CAAGATCGAT CTGCAAAAGG ACGCTCGCGT TTTTTTATCG ACCATTGGTT CCTCGCACAA AATAAACAAG AGTGTTCTAG AAGCCCACAG AGTAGAACCA GAGATAGTTT CCTTGGACGA AGATGCATTG AACGAAATGC AGCGAATGAA TGAACAGACA AAGCCCACCA TTGTCATTTT TGATGAGGCT GGTTGCATTC CTTCGTATGA GCTACTAGGA CTTTCTCGAT TGGGACGATC AATTAAATCT ATAATCTGCG TCGGGGACAA GCATCAGCTA CCACCCTACA ACCCTGGATC AACGAAAAAC GACTTTAAAA AAGGAGGTTC ATTTGGCAAC GTAAGGAGGG GAAAGCCAGT ACGGCAGCCA GAAAAAGTAC AAAGTTTACT GGATGCAAGT GGCTTGCGAT CAGAAAAAGT CAAAGTTGAA CTCACGGAGC AATACCGCGT TCCTCGGGAT ATTGCCGGTG TTTTGAACGC TCGTATCTAT CGTGGAAATT ACCAGACTTC TGTTCATTGC AACGCTCCGA TAAAAGGTTT TCGTCTCGTG AACGTTCCAA AAAGTGGCCG CGACCAACCT TACGTGAATC ATGACGAAAT TGACGCTTGT ATTCAGCTCG TGGAAAGCTC CCTGCAGGCA GGGTTGAAAC ACACAATGGT GCTGACACCG GTAAGACGCA TGGGCTTTTC TTTCCCTGGA CCACTACGCG CTATGCCTCA ACGGAACTAA CACTCGGTTT TTGTCCAAAC TTCCAGTACA AAAAACAGCA GCGGGAAATG GAGTTCAGAT TTAAAAAGAA AGGGTGGAAT GACATTCTTT CTGTACTGAC AATTGATCAG TGTCAAGGCC AACAGGCTGA TATTGTAATA CTCAGTCTGG TCCGCAAACC AACGCGATTT CTTGACAAGA ATCGTCTCAA TGTGGCGCTG TCGCGGGCCT GTCAAAAGAT GTACTTCCTT TGCGACAAAA ACCTATTTGT TGAAGCGAGC CAGAATCAAG CCTGGGAGTG TCACCTTTTG GCGAAGGATC TGCTTGATCT AGCCGGTAAT TGAGGCAAAG AAAACAAAGA CC
|
Protein sequence | MCPTCIKEAF KDPHGENFRY CPICREACAV DMIESVCGPG AVKEMERQLR NKVEIEVQRG MDKKQEEKLK MNESKQVALK LYQDLCERLN MKCPRCELVF DDYTGCNALT CRGQSCRAAF CAICLKDCDT NAHHHVRTCH GDLFDKKAFN TARKKREIDT INDFSAEISG EPHEVKELVR IEFEKSQSDQ TQYCEGGFPF APFLSRAKRD LLAAVNSGRL SILSDAEAYP VETGLTRRDI SPRNVIPENY RLRLLPSMEN IYSIILEEQV HTINGIAWKK IALQDEKEIR SKDLGRPTVD ALKNIAMALS CGVVAFVGTS SLYQSCISRK EGKYDRDEAK DPTICVQFHK ICRNGNMKKN GQSLSELGLE ERDVIGVDQN VRMLILADHV LKSSDESMSF EPLQHFVTGR QPSRVFTSIS MPPPPSFLTL NNKQQKVAHP LSLLTAMEVA GPPGTGKTKT IMELVRGILH CTDYDVILMS ERNGAIDAIA EKMAGDCLTL NQSQSVKSVS NVELWSKVLS FGSVGGMGPF SALFTSTAKE LYHPEVLEAD RVLKKKIKFM ENYSRRLREA LSRSIYDLEG ELFEESKLSM RGRLIQNNKE NNLENARDII SSTINALGAV KQFRADSPGK QNSDLVILSL EHSAILRDLI PINCEKADHH SVPKSITSNP KAINRALEAI ERRLQEVLQN KLFTVAACEA MDYSIRLSQS SLQDVRLRIK IDLQKDARVF LSTIGSSHKI NKSVLEAHRV EPEIVSLDED ALNEMQRMNE QTKPTIVIFD EAGCIPSYEL LGLSRLGRSI KSIICVGDKH QLPPYNPGST KNDFKKGGSF GNVRRGKPVR QPEKVQSLLD ASGLRSEKVK VELTEQYRVP RDIAGVLNAR IYRGNYQTSV HCNAPIKGFR LVNVPKSGRD QPYVNHDEID ACIQLVESSL QAGLKHTMVL TPYKKQQREM EFRFKKKGWN DILSVLTIDQ CQGQQADIVI LSLVRKPTRF LDKNRLNVAL SRACQKMYFL CDKNLFVEAS QNQAWECHLL AKDLLDLAGN
|
| |