Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49261 |
Symbol | |
ID | 7195711 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 397399 |
End bp | 400687 |
Gene Length | 3289 bp |
Protein Length | 524 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183986 |
Protein GI | 219127529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.439634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCATGTTCT TTTTCAAGCA ACACATCATT CTATCGAGGA AAATCGATCG TCAAACCCCA AAAGTAGGTA ACCATGTTGT CTGTGAACAC GAGAAGGTTC CAACGGAATC ACAGCAATCA TTGTTGGGAT GAAAGGGGGA AAGCTGTGCC GTCTATTTCA ATATGCCCAC GAAATCTGAC TTTTATCGGA AGTTTTTCCG AGACAAATTG CGTGATGTCG AATGCATATT TGTGCATCTA CTGTGGAAAT CGGCTGATCG CTGAAAAATG TGTTCATGCC ACTCGGCGGC CCGTTTTCAT AAGAATCCTT GTTGAAAATT TGCTAAGGTC ACGGAGTCAT GAAATAAGTC GTGGTACACG ATTTTGCTCC GCGAAAGTAC AGCTTTCATT GTGTGAAAAG GATAAAGAGA ACGATCGGCT TCGCGAAGCA CGCTTCCCAA ATTTCCACCT TCTAGTTTGC ATCGTTGATC ATCCTGGGTT TAATTGGCGA CCGATAGATG CAAAAGAATT TTCGGATTCT CTCTCGGGTT GTCAATCGGA AACCCTTGCC ATTGTTGAAA ACAGGAGGCG AAATACGCGT CTTTCTTCGA TCGGAAAGTT CGCTCTTCCT CCACGTGTTT CGAGCTCTGC TTTTTCTTGC TTCTCCGTGA CTGACTGTCC TTGGGATTTG AATTAGCCAC AGAATCCCAT ATGCAAATAT TTTATTCAGT CCCGAGCTCG TATGGTATCT ACCGTTTACT AGGAACACTA ACGAACTTTC TTCCTCTTTT CCTTACATTA GAAACAAATA AAACCCAACT CGACCTTTTG CAATGACCAA CGGAAATGGA TCCACTGTTT TGATGAGTCC CACGGCTTCA ACGTTGAAGC GCTCTTTGCG TTCGGCGACT CGTGTTCAGT CGGAGCAGCT TGAAGAGGCA GCCCTCATTA AAGAAGAGAA CCAGATGTTG GATGACGAAG AGGTCACTTC GGAAGAAGAG GGAGACGGCG ACGGCGACAC GGTGGTTGAG CGTACCATTT TCACGGTTGC TTTAGTGAAC AGTGTGGTTG ATTCCCATCA GTCGTCAAGC GATTTGTCGC AAAAGTCCGA TGGGTCGGCT GACAATCGCA AATACGGACT GCGCAAACGA AGGCGTCAAT CGGGGGAAGA TTTGAAACGT CTGGAGCACA ATCAGATGAC CAAAGATGGA GGGCTTGCTC GTCAGAAGGA TAGGGCAGCC ACGCAAGAGT GCGATATATT TGGCGAAACG ACAGCTGAGA TTGCTTCCGC TGCGGCAATC CCATTACCCC CCGCCACTGC ACCAATAGAA ACCCATAAAA AAGTACAGAC TCAGCCGGTG ATCACACAGT TGCGAGTAAA AGATCCACCA CACGTCGCTC GAAAAAACAA TCGACGTCCA CTGAAGCCTC CAAAGCCGAT GTCTACACCC ATTTTGCCTT CATCGAGTTC TGTACCTAAT CCTCTCTCTA AGCCTCTCCC TTCGACTCTA ACCCCTAGCT TTCCAACAAA GACAATCAAA TGTGAACCTG AACAAAGCTT TAAATCAGTT ACAGTGCCAT GTCCTTTACC AGCAGTAGCG TTTGACGAAA AAGTTGACTT TGACGGGGAT AAACGTAAGG TCAATTTCAA TGACATGGTA GGGACTACGC GCCTTCGAGG ATTCTCCATT GACATGGACT GTAAGTTTAT TTTTTTGAAA ATGTCATGCT TTTTCTTGAT CTGTGTTTAA CACTATCGAC CGACGAAATC CTCTTGTTTC AGCGGTTGGT CTAGATTTTC CTGATGATAA CTCAGTAGCC GAAACTGGGG ATCTTCCTCT TGTAGGAGGA CGTCGCGATC GTGCGTTTTC GTTTGAATGC TTTGCGTTTG GAATCAATGC CGACGAGCCG CTTCCACCTT TGGAACAGTC AGTAATGTCG CATCCCGGAG GAGGTATTGA CTCAATGAGT GGGCGTCTTC GTGGTGACTC GATTATTTTT GATCCCTCCA GTTTTCAGGA AGGTGGTATT CACGAGCAGA CTGCGCTGGA GCGAAATCGG GCAACTGCAG AGGCACTGAA GCCAACTCTA GTGATGCCAA AAACAGTTTC TGTGTCGCGA GAAACGCTGG TTCTTCCTCC TCCGTTACTT GCATCCAATA CCACAGTACC ACTTCCAGAA GTCCAGTCTT CGATGCAGTC CCTGGCGATG ACGAATTATG CAGGTACCAC CATGCAGAAT CCTGCAAGTA CTACCGTGAC GACTCCGGGC GGGTCGGCTA CTTTTTCGTT GGAACTGCTC AACAAAGATG GCCGGATTGG AATCTACTTG CCGGACGCAC GCCGCGCGCG TATCGCTCGC TTTCACGCGA AACGGATTAA ACGCATTTGG CGCAAGCGGA TCAAATACGA TTGTCGCAAA AAGCTGGCCG ATTCGCGGCC TCGCATCAAA GGTCGCTTCG TGAAGCGCTC GGATATGGAC GATGAATAAA ATACTCTTAT CTCATAAAAT ATGTAAAACT TTCTCTTGAT AGACAATGGC TCTGTACTTT AAACTTTGTT TCGAGTGAAG GCCATAAGGC TACAAAAACG AACCGCCTAC GAAAGATCGG ATACAACACG TGAAGTCTTG AGCCACGGGT GACTCATGAT ACACAGTGCG GCACACCTAA CACCAACATA CCTACAACCT CTTTTGTAAA CAACATGCGG TATACAGTTA ATAGCTACCA GTAAAGGGTC CAAATGCCCA GAAATCAGTT CCAGTGGAGA TAGCGTAGGC CACTCGACCG CCCACAACGG CAACTATACC TGACCAGACA AAGGTAAGAA CCAACGATTT CTTGTCCACC TTGGAGTAGT CAATCTTTTT CCCTGTTCGA TCCATTTCCA TCCGCGCAAA GGCCCAGTCT TGCGGATTCC GCAAGTACGC ATCCATATTG CCCTGAGCCT CATCCCGACT GCACTTGTCA CCGGCCATGA ATTTCAAGAC GGCTTGCTCG TAGTCTTCCG ATTGAAAAAC GCGTTGAAAC CACGGTTTGG GGCCGTGCAT AAGCAAGGTA CGTGTAGCAT CGTAGCCTGC TTCGGGACCT TCCTCTGCTT TGGTGGCTAC CCACTTTTCG CCGTCGAACT TTGGAGCCAT GAATCGTGCC GACTGGGATA ACGATCCCGT CGCCGTCGGG ACTGATGATC TCTGAGGCTG GGGAGTCGTA AAGGCTTGCG CCAGACCGAA AACAAAGAAC AATATAAGCA AGGTTTTCAT CATAATGAGG GCACGGTGTT GTACAACAGA TTGCCGGCGT GATCTCTAC
|
Protein sequence | MTNGNGSTVL MSPTASTLKR SLRSATRVQS EQLEEAALIK EENQMLDDEE VTSEEEGDGD GDTVVERTIF TVALVNSVVD SHQSSSDLSQ KSDGSADNRK YGLRKRRRQS GEDLKRLEHN QMTKDGGLAR QKDRAATQEC DIFGETTAEI ASAAAIPLPP ATAPIETHKK VQTQPVITQL RVKDPPHVAR KNNRRPLKPP KPMSTPILPS SSSVPNPLSK PLPSTLTPSF PTKTIKCEPE QSFKSVTVPC PLPAVAFDEK VDFDGDKRKV NFNDMVGTTR LRGFSIDMDS VGLDFPDDNS VAETGDLPLV GGRRDRAFSF ECFAFGINAD EPLPPLEQSV MSHPGGGIDS MSGRLRGDSI IFDPSSFQEG GIHEQTALER NRATAEALKP TLVMPKTVSV SRETLVLPPP LLASNTTVPL PEVQSSMQSL AMTNYAGTTM QNPASTTVTT PGGSATFSLE LLNKDGRIGI YLPDARRARI ARFHAKRIKR IWRKRIKYDC RKKLADSRPR IKGRFVKRSD MDDE
|
| |