Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48786 |
Symbol | |
ID | 7195100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 235859 |
End bp | 238738 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183448 |
Protein GI | 219126404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAG TACCACCAAG TTCAGCTCAG CATGAAACGC TTTCCGAGGA GGACTTTCCG CTCACCTCCG ATCGCCGACG GCCCCGAGAA AAGACTTCGG TATCGAATTG TGTAATGCCA CGGTTTGCTT CTTGCTCTAC GATAGTTTTT GTGTCCCTTT TCTTGTTCAT AAGTGGCAGT TCATGCTATG CTTTATCGTT GACTTCATCG GCGTCCCTTT CGTCTTCTCG ACTGTCATCG AAGGCGCTTT CTTCGCAATT GCTGGCGCCG ATGGATGTTT CTCCGCCGTG GATGGAAGCG GCTCCCGTAG TTTGGAAACA TCAGTTGGCT ACCCTTCACA TGAAGGTTTC CGGACGAATG CCGGTAACCC ACATGTTCCT CCCTACACAG AAGCTATCGG CAGTGGCCTT GTCGCGCTCG GACTGCGCCG GCACACGGCA CTACCGTTCG TCGGAAGAAT TGAAAGACGA ACACCTGCAA CTCTGCCGTC GTGGACCACG GGGGACTCTC GGGTTTGAGG TCTGCGAAAG CACCGCACTC GAGGCATCCC GTTCCAGCGT CGATTCGCAT CAATGGAATG TTCCCCAAAC CTGGAACGCG CTGACACCGA GTGGACGAGG CAATCGTATT TGGGGCAGCG ACAGTGCCAC TTGGAGTGGT GCTGTTGCGG ACTCGGCCGC CGCACCGCCT CTTTCCAGTA CGACGCTTTG GTTACCGTGG ATACCCACTC GTCAACAGAT TGAAGATCTA AAAGTTGTGG AACTGAAACA AGTGTGTGCC GAACGAGGAT TGCTGCGCAC AGGGAACAAA GCAATCCTGC AAACTCGCCT TCTGGATTGG ACTCGTGAAC AAATTACTCG CAATGTACAA ATGTCGAGCT CGGGTCTCAG TTTTTTGGAT ACCAGCGAGC CAGTTCTCGA GGAGGAGGGA AGAGAGATGC GCCCATCCGA TCAACCGACC ATGCAGGCGA ATTCGCTGGC AGAATGGGCA CGTACCGTAG ATTTAGAGCC GCTTTTACAA CGGAGAGAAA CTATCCATCG CGAAAAATTG GAAGGGAAAC CTATTTTAAA GCCGTCACGC CAGACCCAGA CCGTTAACAT GCCGCGTCAA GAATACCTTT CCGTGCTAAC ACGAGTCTTT GAAAAGCCGT CGTCGCCGTA CTCGAATCGA GAAGTAAAGC AGATGTACGC AGCCTCCAAG CAAGCCGATC AAGTTGGCGA CCGAGATCTT GCGAAACGGT TACTGCAAGA ACTCGTTGTC GCAACTCCAC ACGATGCGCG GTTGTACCGC CGTTTGGCAC GAATGTACAA GGAAGAGGGC AACGTTTCAG CGGCACGCGC AATACTCCAA CAAGGCATTC GCGACCATCA CGCAAAAAAT GGCTATTTGT GGCACGGATT GGGAAGTATG GCAACCTCAG ATGCGGACGC CAAACACTAC TGGCAGAAGG CGATTGAAGT GGATCCGGCC CTGCCACATC CGTATCACTC GCTGGGGACT CTTGAACATA AGGAAGGTCG GATCGCCAAT GCGATGAAAA CCTTGCAAAA GGGTGTTGCC TACTGCCCCA CCTCTCATCG TCTACACCAC GCCTTGGGTG AATTATATCG CGACGCGAAA ATGCTCGACA TGGCGGCAAA ATCATATCAT CGAGCGATCC AGCATGGACC GCCTGTAAGC CACGGTTTTG CATTCACTGG CCTCGCATAC GTGGCGTACG AACGAGACGA TATACATGGT GCTCGAAGAT GGCTGCGAAA GGCAATTGTA CTCAACAAAG GCCGGCATGT GAACAGTTGG GTTGCCTTGG CACAGATGGA AGAAAGCATT GGTGATATAG ACTCTGCACG CGCGACTTGC GTGGCTGGTC TAGCTCAGTA TGAACGGGGC TTATTACAAC GCAGCAATCG TGGTCGACCA TGGAAACCGA CAACGGAGCG AGCCTTCCTG GAGGATCCGG TGGCTCTAAA GGACGAGTTT TTGCGGCAGG TACCCGTGTA CAGATCTGGC GATCGCTTTT TTAATTTGTA CCGCAACTGG GCACGGCTGG AAGAACGATA CGGAAATCGG GACTCGGTGA AGGAAGTTTA CAGGCGAGCG ACTGTCGCCT TTCCGAACGA GTACAGATTA TTACTAGACT GGGCGCAATA TATGGTGAAA GAACAGCGGG ACGAGACGGC TAGGCAACTC TTCGCGAAAG CTAGCACGAA AGCTGCTTCG AAACATGCAG ATCCTCATCG AGTGTACGCT GAGTTCGAAA TGTCACGAGG TAGGTATCTG GATGCCCGTG AGATTCTTTA TCGTGGTGCC ATGGTATTGT CCAAAACGAC CGATAGTGGC GGCAGCGCTG GAAATCGATA CGGACTTGCG GAACTTTTTC ATACCTGGGC TGTGTGCGAA TGGCATTTAA ACGAACTATC TCGTTCCGAA AGTCTGTTTG ACCATGCGCT GCGTATGACC AATGCCGGTG AGGACGGTTC GAAACTGCGA TCACTTATCT TATATTCGAT GGCTCGGCTG CAATATTATC GAGGCGAGCA CTTGCTAGCA CAGCATTGTA TTGGTTTATG TCTCAAGGAA AACCTAATGC CCGGAGGAAA TTCCAAGATC TGGGATCTTT GGTCTGATGT CGCTACTCAA ATGGGCAATC CTTCACTTTC ACAAAGATGC CAGGAGCAAT CAGAGGCATC GAAGTCGAAT GAAAACGCCA ACGGGACAAC CGAGCTTTCG GGACTGTTAG AGCACCCTTC AGGTTTGACG CGAATGAAGG GACCGGACAT GGAGCAGCTA ATGAGACGAG ATCCTTGGCA TCGAAAGATC TTTGGGACAC CACTGCGGCC TACTGCCTCT CTAGGGGTCA ATTTGCCGGA GGTTCTATGA
|
Protein sequence | MQEVPPSSAQ HETLSEEDFP LTSDRRRPRE KTSVSNCVMP RFASCSTIVF VSLFLFISGS SCYALSLTSS ASLSSSRLSS KALSSQLLAP MDVSPPWMEA APVVWKHQLA TLHMKVSGRM PVTHMFLPTQ KLSAVALSRS DCAGTRHYRS SEELKDEHLQ LCRRGPRGTL GFEVCESTAL EASRSSVDSH QWNVPQTWNA LTPSGRGNRI WGSDSATWSG AVADSAAAPP LSSTTLWLPW IPTRQQIEDL KVVELKQVCA ERGLLRTGNK AILQTRLLDW TREQITRNVQ MSSSGLSFLD TSEPVLEEEG REMRPSDQPT MQANSLAEWA RTVDLEPLLQ RRETIHREKL EGKPILKPSR QTQTVNMPRQ EYLSVLTRVF EKPSSPYSNR EVKQMYAASK QADQVGDRDL AKRLLQELVV ATPHDARLYR RLARMYKEEG NVSAARAILQ QGIRDHHAKN GYLWHGLGSM ATSDADAKHY WQKAIEVDPA LPHPYHSLGT LEHKEGRIAN AMKTLQKGVA YCPTSHRLHH ALGELYRDAK MLDMAAKSYH RAIQHGPPVS HGFAFTGLAY VAYERDDIHG ARRWLRKAIV LNKGRHVNSW VALAQMEESI GDIDSARATC VAGLAQYERG LLQRSNRGRP WKPTTERAFL EDPVALKDEF LRQVPVYRSG DRFFNLYRNW ARLEERYGNR DSVKEVYRRA TVAFPNEYRL LLDWAQYMVK EQRDETARQL FAKASTKAAS KHADPHRVYA EFEMSRGRYL DAREILYRGA MVLSKTTDSG GSAGNRYGLA ELFHTWAVCE WHLNELSRSE SLFDHALRMT NAGEDGSKLR SLILYSMARL QYYRGEHLLA QHCIGLCLKE NLMPGGNSKI WDLWSDVATQ MGNPSLSQRC QEQSEASKSN ENANGTTELS GLLEHPSGLT RMKGPDMEQL MRRDPWHRKI FGTPLRPTAS LGVNLPEVL
|
| |