Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42778 |
Symbol | |
ID | 7196400 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1087639 |
End bp | 1090192 |
Gene Length | 2554 bp |
Protein Length | 675 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176720 |
Protein GI | 219109935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00214847 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCAATCGT CGCAAAACGA AGTCATCATC CATGGCTACA CTGCTACCTA TAGAGATACA GCAAGACTTC TATACTCCTC CCTACTAAGC GATGGAGACA AGAGCAGCAA ATCATGTTGC TGCGGAGGAG CACAGGTGCG ACTTGGCAAA AAGTTCGCGA ATGAAAAAGG CAAGATCTGT CGATTTTGGA CGATTTCTTC GGTCCAGTCT TCGGAATCCG GCTGATAGCT CCACCGTTGT AGCATCGGCC TTCAGCCAAA GAGCTGCAAG GGACAACTTT TCTAACTGGT AAGTCATTGC TGTAGAATCA GATATGGCTT TTGATCGCCC TGCCGTAAAG GACGAACTTG CGATCCGCTC TATTTCCGTG GTTGATGGTC TTGGTGTCAC TGTCAGTGTG GAATAGTCGA CTGAGTCAAA ATGGTGTTTC ACCATTTTTT TTCAAACCGG CCGGGCGGTG ACGTGTTTTT CGTTACAATC GCCAGATCAA GCTCGTGCTC TGCAGAGACT AGCTTATGTT TCGACGGTGT CCGCCCAGTC AAGATAAAGG TTCTCTTTGT CTCTGGTCAA CTTCGTAAAT TTTGTGTCCC GGAGGCGTAC ACACAGGAAC AGCGATACCA CGCGCTTTTA CCGCGCCCTG ACAGTGAACA CATATTGCAT TTCTAACTCG TTTTCTTTGT AACTCTTACT TCAATGCGGT TAAACCACAG TACCTCTGTT TTCGCGCCAT CCCTGTCAGA ACAGTGCTCC GCAACAGCTA CTACAGACAA TTTGTTACTA CGAAAAGGCT GGCCGAACGG AACCGCCATT TCCGCCGCTT CTTCATTTCC CCCGTCGGCG TCCAGCTTCC CCATCGGAAC GTGGACTCTT GGCAAGCTTT CTGTCTTAGT TTTAGCGCTA GCATCTTTGA CAGATTTAGC ACTTTCTCAC ATATACCGAC TAGAATCACC AGCAGACTTC TATGACGCTC TGTCCGCCGA TGTTATTCGA CCGCCCTTTC GGTATAAGTC GAGTACATCC ATTGCACGCA CACGGTCTAG CCTTGGTTCT GCTGCGATTA GTTCTGTCAC AGAATTTCTT TCCCCAATCC TTCCCTTTAC TGGCGGGTTG GATCTGAGAA GGGAAGATTA TTGGACAACT TCAGGAAGCT GGCTAGAAAC ATTGGAGTCC TTGACGAGAC AAATCCAAGA GGCGCTGTTT TCGTCGGACG ATGACCGGAC AAGTCTACTC AGCAGTATAT CGCTTATTCG TAGCGGAAGT CTCTCCAACA AGATGTCGCC GCGGGTCCAC ACTAAGGGAA GAAATACGGG CACTTTTACG AAGCATGTGT CATCTATTTC AGCTCAGAAA CCGTTCTTCA GCGAAGATGA GATCGCTGAG CTTTCTCTCG GCGAGGTAGC GCAGGCCTTC CGATATGCGT CAGAAAGCTC GTCCACAGAT TTCAACGAGG ATAAGTTTTT GAACAGCTTG ACCACCCGAG TGCGCAGAAT GATTCTCTCC ATCAGGGAAG CTGTCTCCGA GTCGCGCGGC ATCGATGTCG AAGATGCCTG TGTTTTGACG AAGAATCATC GAGTGAATGG AAGCGTCGAT GCGCTGAAAT TTTCGGCCGC TATGAGAATA TTTGCGGAAT GGCGTATCCT CCGGCAGGTT CCAGAAGGAT ACAAGGGATA TGCTGTTGGA ATGAATCTTG GTCATAAGGA TGTCGTCCAA AACGTTGCAA AGATTGAACA GGCAGTTCAT TCTTGGCTAG ACAATCAACG AGACTTGCGT TCGCTATCAG AAATCGAATC GCAAACCGGC TGCCCTATAA TTGAATTGAG ATCCTCGAAT CTTTGCTCGC CAACGATTCG AGAGCTAATG CAAGACGAAG TTGACATGGA CATTCATCCA ACAAATCGTC TACCTCGCCT GAAAGAAAAA ACCGCGGCGA TGGGCATTCT TTGGGTGAGA CGACAGCTAC ATTACCAAAC AGGTGTTTTT GGCAATCTGT TAGTCGTACC GGAAAGCTTT CCGACAACAG AGAGAGCAGT GGCGTCGGCC TATAAGGAGG TTTACGACAA GTATCATGGT TGGGCTGTAC AAAAGATTTT CAGCTACTCT TTTCAATCAG CACCAAAGGC AGAAGAAATC TACCAGCATA TGAACCCAGA ACGGCTGAAA GAAGTCAAGG CTGCCGCTGA TGAATTGGTG TTGCATTTTG ACTCTGAATC ACGATGTTCA GCCAAAGGAA TGAACATTTC GCCTAAGGGC AATCTGATTG ATGTCATTTT ACTGAACGCT AGTAGGGAAT TCGAGAATCT TGTAGAGGCT TTTCTACAGC TTGTCAATTC TGGCATGGCA TCACCAGGTT CTGACGTTCG AGGTGGCGGC TGCAACATTC ACAGCTCGGA TGAAAATGAC AGAGAATCGT TCATTGCCAA GGAAATGATC AAGGATGCAC ACAAGCACAT TGAGTTTTAC CTAGAGGTTG TCCGTCCACT CTTGGATGAC CTTGCCCTAG TTTTTGATGA GCTCAATATG GACGATCCCA CCAAGGTTTA AGTCCACTTA ATGT
|
Protein sequence | METRAANHVA AEEHRCDLAK SSRMKKARSV DFGRFLRSSL RNPADSSTVV ASAFSQRAAR DNFSNCTSVF APSLSEQCSA TATTDNLLLR KGWPNGTAIS AASSFPPSAS SFPIGTWTLG KLSVLVLALA SLTDLALSHI YRLESPADFY DALSADVIRP PFRYKSSTSI ARTRSSLGSA AISSVTEFLS PILPFTGGLD LRREDYWTTS GSWLETLESL TRQIQEALFS SDDDRTSLLS SISLIRSGSL SNKMSPRVHT KGRNTGTFTK HVSSISAQKP FFSEDEIAEL SLGEVAQAFR YASESSSTDF NEDKFLNSLT TRVRRMILSI REAVSESRGI DVEDACVLTK NHRVNGSVDA LKFSAAMRIF AEWRILRQVP EGYKGYAVGM NLGHKDVVQN VAKIEQAVHS WLDNQRDLRS LSEIESQTGC PIIELRSSNL CSPTIRELMQ DEVDMDIHPT NRLPRLKEKT AAMGILWVRR QLHYQTGVFG NLLVVPESFP TTERAVASAY KEVYDKYHGW AVQKIFSYSF QSAPKAEEIY QHMNPERLKE VKAAADELVL HFDSESRCSA KGMNISPKGN LIDVILLNAS REFENLVEAF LQLVNSGMAS PGSDVRGGGC NIHSSDENDR ESFIAKEMIK DAHKHIEFYL EVVRPLLDDL ALVFDELNMD DPTKV
|
| |