Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42980 |
Symbol | |
ID | 7196798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1684576 |
End bp | 1687527 |
Gene Length | 2952 bp |
Protein Length | 950 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176833 |
Protein GI | 219110163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTGATCCGA AAAGATAGCT GACGACCGGA AACTCTCAGA TATATTTTCA CCATGCGCAT CGGAAAGCCC CTTCGCAAGT CCCGCGCTTG GAAGAAGGTT GGAGTCAATG GCACACCCAA ATATGTATCC CCTGAGAGCG TTGTTGGGAG AATCAATTCA GATATTATAT CAAGTGTTTT GGCACCGAAA ATTGCCGCAC TTGCTTCAGC GAGTCTCGAA CGATATGCTG GGGAGTTATT GGTATACGAA GACATTATGA AAAAGGTGAA TAGCAATGAA TCTATCGACG GTCTCTTGGA AAGCGATAGT TCCATTATTA TCCAAGACGG AATTCCACAG CAACCCCAAC GTCCTGTCTT TCATTCCCAG TTTTCTGTGG ACGAAGCGGC TTCCATATTT TCAGAAACAT CGGAACACTT TTTCGCAAAA GCGGGTAAAT GGAAAGCACA CGCCAATGCT GCCAAATTTG AGCGTATTCT GGATGAAAAG TACGGCATTT TGCGTCCATT TATTACGAAC CATCCCGAAA TTGAACATTT CATTCGGGGC GTTCAGCGGA AGTACGCCAT GGGGTATTTC AGTCCCTTCC GACAAGGCGA TCCGCCAATA CCCCGATCGA CTGCTGTCAT TATATTGTTT ATGATGCAAC GAGGTCAGAT GCGTTGGGAA ATAATGCTGT TGACTACCTT GTTCTTTCTT ATTGGCCTAC AACCTTGGGC TTTGGTGGCA GTTGTCGGAG TTTTACAAGG TCTTCTCATG CGACGAAAGG CAAAGCCTTT GGGGAAGATG AAGCGTTTCA TCCCTGCGGT AGAGTCATAC TACACGGATG CAAAAACCGA TACAGAAAAG CACGAGCTAC TATTGCATCC GGTTGGTGAA CCTTTGCCTA GCAAAGAGGA AATTGACGCG TCTCTCTTTG ATGCTCTGAT TCTTGGCTCA GGACCAGCTT CACTGTATAT CGCATCGTTG TTGTCGCGGG CGGGTAGAAA AGTGCTCGTT CTCTCTTCAC GGAACGACGC TAGTGGCTGC CTGAGTATAA AGCATGCCGA GTATTCAAAT GTCCCATTTG ACGTTGAAGC TTCGAATGTA GCCAAAATAA GCCGTCAGCA ACAAATCTTG GCCCCTGCTC TGTGTACCGA GACCGATACT CAGGGTGGAG TCCGATTTGC CCAGATTGGA TCAAATGAAG ATGCTCATGC TTTTGAAATA CTATCGATAC CAGGAATGGG AACAGATTCG TACGACGAAG AGTTACCATT TATTTTGAAT GCGGATGGTG GAACAGCCGG TCTCATAGAC GATGCTGCAA AGTATCTGAA TGATGGCTGG CCAGATGCGG AAGGCGGGAA TGGCAATTCT GTAACGGGAG CGTATGCCGC TGCGTGCGAA GCAATTAACA GTACAGCAAA CGAGTTCTAT ATTTCGAAGA TTCTCTCGGA AAAAGTCAAT AGTCTACGGA GCTCTCCTAC CTATCAAGAC AGTGGAATTC GTTACGCTCA GTCCTTCTTG AACAAAACAT TCACCATCAA CCCCCATACA CGGTCGTTGA TGGCGGGTAT AGGTATGAAA GGGGAGAACA TCCGACCTGG AGCGACAAGT ATGGCAGCGC ATGTCACCAA CATTAGCGCA GCTCTCAGTG GAGAAGGTAT GCACTATCCG ATCGGCGGAC CTAGGGCACT TTGCCGTGCA CTCGCCAACG TCGTTCTCCG TAGCGGTGGC CGAGTGTTGA CGTCGGTTGA TGTCGCTGAG CTAATATTTG GTGAGCCACG GGAACAAGCG AGCAAAGGAA AGCAAAAAGA AGGGGACAAC GACGGGCCAC CTCCACCTCG CTGCGTTGGA GTCAAGCTAT CAGACGGGCG AGAAATCAAG TTTGCGAGCG ACCGTTTTGA TGAAAAAAAT GGTTCCTGCT TACCCGCAGT TATTTCAATG GAAGGCTTCA TTTGGACATT CATAAACATG TTGCCGGATG ACATAAGGAT GAAGTACAAA GTACCACGTG GCTTGCCAGC TCTTTCGTCG CGGCGGCCTG TTTTCAAGGT TCTTTTTGCG TTGAAAGGCA GCGCCGATCA ACTCAATGTG ACGGGTGCTG ATTACTATCG GCTGCCCAAC GCAGCTGTAG CGCGAGACGA GTTTGATCAG TCCTCTGGAC AGATAAAACA CGGTGAGATT GGTTGGTCTG ATTCGGACAC TGGTGATAAC GGAGATGCTT ACGCGGATGG AGGTAAGAAT TTAATGGACG TCATCAACCA GGATCCTGGT TCCATCAGTG ATGAGCATAT TGTAAACTCC AGTAGAAAAC GAGCCCGAAA GACAAAATTT GAAGCTGGGT CTTCATGGCT CCACGTTTCT TTTCCTTCAG CCAAAGACCC TTCTTTTGAG GAACGTCACG GGAAGACCAC AACGTGCGTC GTCACTATTG AGGCGGATGA CGATTTTGTT ACCTATTTTG ACACGAAACC TAAGATCTAT GTCATTAAGA ATGCCTCGGC TACAAAGGGC GATCTTGATC GCTTGCTAGA ACGTGTCAAA AAGGATGTGT ACCATATTTT TCCTCAACTA AGGGACAAGG TGGACCACTG CGAAATTTGT GGACCTTTTC AGAAAGGGTT GAGTCACAAT CCCGAGAGAT TCGCCGCCAA AGGCATTCGA GCCGACACGC CTTATCCTGG TTTGTTCGTA GGAGGATCGG ACTTGACTGT CGGCGAGTCC TTTTCCGGTG ACATCGTCGG CGCCTGGTTG GCAGCGAACG CTGTTGAACA ATACGGCCCA CTCGATCACT TGTTCCTGCA AAAGAACATC ACAACTGACA TTGAGCAATT CTTAGAAGAA CCAGGCTGGG TTGATGAAGA GGATGTTGCA ATTCCGTACA AATCGGCAGA TGCAAAGAAG GACAAGGACG TCTAAGCGAC CAGTATGTTT TTCCAACCTT AAGGTCTGCG TCACGGCAAA TT
|
Protein sequence | MRIGKPLRKS RAWKKVGVNG TPKYVSPESV VGRINSDIIS SVLAPKIAAL ASASLERYAG ELLVYEDIMK KVNSNESIDG LLESDSSIII QDGIPQQPQR PVFHSQFSVD EAASIFSETS EHFFAKAGKW KAHANAAKFE RILDEKYGIL RPFITNHPEI EHFIRGVQRK YAMGYFSPFR QGDPPIPRST AVIILFMMQR GQMRWEIMLL TTLFFLIGLQ PWALVAVVGV LQGLLMRRKA KPLGKMKRFI PAVESYYTDA KTDTEKHELL LHPVGEPLPS KEEIDASLFD ALILGSGPAS LYIASLLSRA GRKVLVLSSR NDASGCLSIK HAEYSNVPFD VEASNVAKIS RQQQILAPAL CTETDTQGGV RFAQIGSNED AHAFEILSIP GMGTDSYDEE LPFILNADGG TAGLIDDAAK YLNDGWPDAE GGNGNSVTGA YAAACEAINS TANEFYISKI LSEKVNSLRS SPTYQDSGIR YAQSFLNKTF TINPHTRSLM AGIGMKGENI RPGATSMAAH VTNISAALSG EGMHYPIGGP RALCRALANV VLRSGGRVLT SVDVAELIFG EPREQASKGK QKEGDNDGPP PPRCVGVKLS DGREIKFASD RFDEKNGSCL PAVISMEGFI WTFINMLPDD IRMKYKVPRG LPALSSRRPV FKVLFALKGS ADQLNVTGAD YYRLPNAAVA RDEFDQSSGQ IKHGEIGWSD SDTGDNGDAY ADGGKNLMDV INQDPGSISD EHIVNSSRKR ARKTKFEAGS SWLHVSFPSA KDPSFEERHG KTTTCVVTIE ADDDFVTYFD TKPKIYVIKN ASATKGDLDR LLERVKKDVY HIFPQLRDKV DHCEICGPFQ KGLSHNPERF AAKGIRADTP YPGLFVGGSD LTVGESFSGD IVGAWLAANA VEQYGPLDHL FLQKNITTDI EQFLEEPGWV DEEDVAIPYK SADAKKDKDV
|
| |