Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46978 |
Symbol | |
ID | 7202221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 70979 |
End bp | 73846 |
Gene Length | 2868 bp |
Protein Length | 890 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181130 |
Protein GI | 219121556 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0132592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTTCGAGT AATTACAATA CAATCGGGGC CTTACAAGGC TTTCCCCTTT ATTCATCTTA CTCTCTCATT GTATCAACAA TACATTGCGT GAGTATAGAA GAATCCGTTA GTTTATCAGA CCCAGCAATC ATCGCTGGCG TTGCTCTCGC CAACGTTCTT ACTTTACATG ATTGTCTCTC TTTTTGAACT GAGCCATGGC CCGAGTTCGC AAGGCAACCG GTCCTACCCG GAAGGGAGCG ACCGAAACGG TGCCGGAGGA GCGAGTGGAA GAAGAAACGC CCTTTGAGGC CGTTGAGTCG CCGTCCAAGG ACAGTGACAA TGAGACGCAA CCATCGTCCA TGGGCGATGA TGATGACTCA CAGTCTGAGA TCGAGTCGTA CAAGATTGAT ACCGACATTG ATTTCAAGTA CAACCCAAAC TTTTTTGAGG ACAAGAAAGC CCTTGAAAGT GTTCTAAGGA ATACTATGGG ATTTGGAGAT ATCCATGTGA AGTCACTCCA AAACGAAGGT TTGAAGACCG CAAATGATTT CTTGCTTATT TCTATGAGTG ACATCAATGA TCTTTGCGAC AAGCTTTTGT TTGCAACAGT TTACAGGGCT CGCCTACGGG CATTTGCTAC ATGGTTACGT AGTCAACCTG ACAACGTAAA TATTACCCAA GAATGGACAA TTCCAGTTAT GCAATTGGAA ATGCAGATGA AGGCGCAAGC GTCTCCATTT GGAACCTCCG AGACCAACAA AACAGACAAG TCAGTCTCCA GTCTGGTGCC TGATCCCTTT GATGGTACAC AGAAGAAGTG GCTCGCCTTT CGATACATTT TTGAGGCATG GGCCGGAGCA AGTGGGCAAT CTTTTGATGC CTGCATCTCA CATGACTCGG AGCGATATTC CCGTTCAGAA CCAACAGCGA CCTCTAATGA CATCAATGAC GAACCTGATT CATTTAAATA TGACTGGAAC GTTAAGTCAG TTCGCAATTC AAACATCTTT TTTATGCTCA AGTCGCTCAC AAGCGGCGGA GATGCATGGG GCCTTATCGA ACCTTACGAG GTTTCAAAAA ATGGCCGTCA TGCCTGGATT GCCTTGTGTG CGTTCTATGA AGGGGCCAGT CAGGTGGGCT TAACCACAGA AGAAGCTCGC ACTACAATTC TGACATCGAA GTATACCGGA CAATCCCGGA ACTTCACTTT TACCAAGTAT GTTCAAAAGC ATCTTACTGG TAACAACATA TTGGCTCGCA ACAAAGAGGC CTACACGGAC GCACAGAAAA CAAACTTTTT CCTACGGGGA ATTGTTGATC CTGAACTTAT GGCATTCAAG GCAGCTGCTG AAGCTAACCT AAATGAATGG AAGTTCGAAC GCGTAGTCAC GTACATGCGT ACTCAAGCCG CCAAGCTCAC GAGCAAGGAC GGTAAGGATT CCCGAAACAT TCGTCAGGCT ACGGGCTTGT CGAAAAACAG GAACAACAAA AACAACCGGC GCAAGCGCTC GGAATACCAA AGCCAAGGCA AAGGTAATAA AGAGTCGGGC AAAGGAAACA ATGCTCCTAG TACTCAACTC CGCAAGGACA TCTGGGATGA ATTGTCTCCC GAGATAAAGG ATGCCATCAA AGCGGCAAAG CGTAGAGCGT CTACGGACCC GCGCACGGCT AAAAGAGCCA AGACTAGTAG TACGGATAAC TCTAACGCAA GCGTTGAGTC CTACTTGCCT GATTTAAGGT CAATGTCTAC TGAAATATTT AAAGCAGATG ATGACAAGGA CTTGGCTTCA GGCCAGCCTG AGGCGAAAGA TACACCACTT CATTTGGAAC TTGAAGATAC GCTTAAGAAA CCTACATATG GAGCAGGTAC CCTATTTGGG CGATCTGCTG ACAGGGTCTC CTTTAATCGT ATGGTATGCA GTTCAGAAGA AAACAAAGTC ACTCCTTGGC GCATGTCAGA ACTACGGCTT GCGGATGCAA CAATAAGACG CATTTGTAAG AATCGCACAC GAAATCCTAC CGGCCGTTCA ACATGGGGCG AAGCTGCCAT TGATACTGGT GCCGACACAA TTTGCATTGG TTCAGGCTAT ACTGTACTTG CCCATACAGG TCGATATGTG AGTCTGCGAG GTTTTCATGA CAGTGGTGAT ACTCTTGATC GAATTCCAGT TGTGACGGCT GCTACAGCAT ATGACTACGA TGACGGAACC ACCATTATTC TGGTTTTCCA TGAAGCTTTG AATCTTGGGC CTACACAGTC CACATCTCTC ATCAACTTGA ATCAGATTCG GCACGCCGGA CATCAGACTG ATGACATTCC GAAGTTTTTA TCCCAAGGGA AATCTTTTCA CGGAATTGAA ACAATTGATG GCGACTACAT TCTTTTTGAA TTGAAGGGAC GCACATCATT GTTGTACTCA CGAGTACCTA CTCGCCATGA GCTTGAGAAC TGCCTGCACA TTGATCTTAC ATCTGATCAA CCCTGGGATC CAAACAGCAA AGACTGGGAG GATAATGAGC AGCGCTACAC GCGTCATGAC CGACAACGGA ATGCACGCTA TACCGCAACT GATAATGCGG ATGAGGAGAA CTTTTACCAT GGGTATTTCT CTCTCCCTGA CTCTAAGGAG TTCCCGGTTC TACCGGCAAA CAATAATGTT ATGAACCCAC ATGATGTCGT ACGCGAGATC AAATATGCTA CTGCACGGGT TTCAAAATCT AGCCCACGGG ATCTAGATGT CGATCGAGAC AAACTTCGCC GCATCCTGGG TCATGTTCCT ATGGAAGTAG TTGACCGAAC ACTGGAAGCT ACAACACAAC TTGCGGAACG CTCTGGCAAA ATGCTACTGC ATCGACGTAG TTTGAACAAT TGCGATACCG CCGGTTGA
|
Protein sequence | MARVRKATGP TRKGATETVP EERVEEETPF EAVESPSKDS DNETQPSSMG DDDDSQSEIE SYKIDTDIDF KYNPNFFEDK KALESVLRNT MGFGDIHVKS LQNEGLKTAN DFLLISMSDI NDLCDKLLFA TVYRARLRAF ATWLRSQPDN VNITQEWTIP VMQLEMQMKA QASPFGTSET NKTDKSVSSL VPDPFDGTQK KWLAFRYIFE AWAGASGQSF DACISHDSER YSRSEPTATS NDINDEPDSF KYDWNVKSVR NSNIFFMLKS LTSGGDAWGL IEPYEVSKNG RHAWIALCAF YEGASQVGLT TEEARTTILT SKYTGQSRNF TFTKYVQKHL TGNNILARNK EAYTDAQKTN FFLRGIVDPE LMAFKAAAEA NLNEWKFERV VTYMRTQAAK LTSKDGKDSR NIRQATGLSK NRNNKNNRRK RSEYQSQGKG NKESGKGNNA PSTQLRKDIW DELSPEIKDA IKAAKRRAST DPRTAKRAKT SSTDNSNASV ESYLPDLRSM STEIFKADDD KDLASGQPEA KDTPLHLELE DTLKKPTYGA GTLFGRSADR VSFNRMVCSS EENKVTPWRM SELRLADATI RRICKNRTRN PTGRSTWGEA AIDTGADTIC IGSGYTVLAH TGRYVSLRGF HDSGDTLDRI PVVTAATAYD YDDGTTIILV FHEALNLGPT QSTSLINLNQ IRHAGHQTDD IPKFLSQGKS FHGIETIDGD YILFELKGRT SLLYSRVPTR HELENCLHID LTSDQPWDPN SKDWEDNEQR YTRHDRQRNA RYTATDNADE ENFYHGYFSL PDSKEFPVLP ANNNVMNPHD VVREIKYATA RVSKSSPRDL DVDRDKLRRI LGHVPMEVVD RTLEATTQLA ERSGKMLLHR RSLNNCDTAG
|
| |