Gene PHATRDRAFT_36106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36106 
Symbol 
ID7201173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp549445 
End bp551928 
Gene Length2484 bp 
Protein Length827 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180459 
Protein GI219119395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG CCGAAAGCGT TGCTTCAGCA GCTAAACTGA TTGCTTTGGT TGTATTGATT 
CTTACAACTG CCAATATGCT GGCATTGCGG TCATTTTTCT GCTTGCCTAT GCCTGGCGAT
GGTTATGATG AGTACCCAGG GAGGCCTGTT GAAGCTTCCG GCAAAGAAGG CGCTAAAAAA
CAAAATATTG ATGTATCCAT CGAGGGCATA GATCCCTTTT ACACGAGTGA TGAAACACTG
GAATCGATAA CAAGAAGCAA GCAAGCCAGC GGACAAACAA CTTCCTCTTT ATCAAACGGA
TCGTACACTC GGTCTATGGT CGTTGTACGG GCCATCGGGA ATCCTCTTTC CTCGACCAAC
TCATCGAACA AACCACTACG GATTCTCGAA TATATTCTTG AAAATGAGCC TGCCTTTCCC
AATACAACCC GTCATTGGTT TTTGAATCGA ATTACTGATG CACAGGTCGA GAAGGCAATC
GTGCAACGCT TAAAGTCTTC TAATGAAACG TACACCATCA TCCCCTTTTC GCTTCGCGAA
TACGACAACG TTTTGTATTC TTTCGACAGT AAGGATACAA TTCATTCGAA GCACTATATC
AAACGCTCGG CGGAAGAAAA GGCTGCGACG CTTGTTGAAG AAACTGTGCA ACAAAGCAAA
ATACGATATG TTGCCAACAT CAACGGCGCG AGAAATGCCA TGCTACGCTA CGGCAAAAAG
AACAGCGCTG CGGAATACAT TCTTCCATGG GACGGACACT GTTTTCTCAC GCGGGAAGCT
TGGGATGCAA TCCAGTTTTC TATGCAAAAG TATCCGGAAG CAAAATACTT TACTTCACCC
ATTCGTTTAC ACGAAGGATC TATCGATGCC AATGTCACAG AAGAGCCACA AATTGCATTT
CATCGTACAG CCCTGGCACA ATACAACGAA TATCTTCGGT TCGGCCGCCG CGACAAAGCA
GAGTTATTGA CCCGGATTGG TGTCAAAGGA CAATGGGACG AGGGCATTCC CTGGGAAGAC
TGGGAACTTG ACTTCGTAGA ACGTGAGCAA GCAGCCGATT CCGTTAGCGG TGTTCCAGAT
GCGGGTTGGG CTGCTTGTGT GCATTCTGGA ATTGAAAATG GCGGCAAACT AGGAACAAAA
ATACTAAAAC TGAACCGACG CGGTCATAAC CTATCTGCTC AGTTGGTTAG GTTGGATATC
CGAGCGTCAC AGGAGCTTCA TGGGCTCTCA TCTTCAACAC TGTTGTACTA CAAGGAGAGC
CAACTGGCAG AGGAGAGGGA GCTTTGGAAA GCAGGCAAGC GATTGCCCCT TGTGAAAGAG
CTTTTGGAAT TGGCAAATAA AGCATTGTCG TTCGGACCAT GGTCAGTAAT GGACAAGCGC
GGTTTCGGTT GCGGCGTTTC CGGTGACTGT CACGATTACT TCCATGTGGC GCCATATCAA
TGGCCTACGT TGAACAGTAC AGGATATACG GACTACTCGA AACCATTTGT TCTACGAGAT
GGTGAGCGTG CGCCTGGAAC CGTCGCATTT AGTGAAGGGA GCGAAAAATA TGATCGGACA
AAATTGCTGG CAATGAAGTT TAACACTACT GTCTTGGCCT TGGCGTACTC GGTAACTGGA
AACATTACGT ATGCCCGCCG GGCAGCGGAG AACCTCAGAC ATTGGTTCAT CTACAATGAG
ACAAGAATGA ATGCAAACAT CAACTTTGCA CAGATTAAAT GGGACGTAGA AAAACGAGAA
ATGTTTGGGT CTCCGTGTGG CTTAATCGAA ATGAAGGATC TATATTTCTT TTTGGATGGG
GTGAAACTCA TTGAGAAGTC TGGCGCGCTA TCTGAATCTG AGATTGATCA GCTACGCAAT
TGGTTTGCAA ACTATCTTCA GTGGTTGCTT TCCAGTGAAC AGGGTAGGTG GCAGATTTCT
GCCAACAACA ATCACGGTCT GTTTTACGAT GTTCAAGTTG CGCCACTTGC TCTGTATGCT
GGTAATCTAC CTCTGGCAAT TTCGCGAATG CAACGGTCGA TTTCGCGCAT TCGACGACAG
ATGAATGCCA CCACGGGAGC CCTACCACAC GAATTGAGAC GGCCAATATG CGAACACTAC
CAGGCATTTA CGCTCCAGGG ATGGATCACA ATGGCAAGAA TGGCGGAAAA GATCGGTTTG
AACTACTGGA AACGATTTGC AGATTCAGAT GCCCCAAACA AAGAGACGGC CCTTTGTCGA
GCAGTGCGCT ATGCAAACCC GTATCTAAGT CGTCGCGCCG TATGTCCCGG CAATATTGAC
GGTATCGACG CGCGGCGCTG GCAACCAATA CTTTTGGACG CACTGCACCA CTGTCCTATG
CTAGACTACA AATCGACGTC CGGACAAAAC AACGTCCTGA TCCCCCCTGA ACTGATAGAT
CCGCCTTTGA ATCACTACGA GATTACTGGA TTGTTCAATA TGTCGGACGG GATAGGACCT
TTCTGGAACT TGGGACTGTA CTAG
 
Protein sequence
MRRAESVASA AKLIALVVLI LTTANMLALR SFFCLPMPGD GYDEYPGRPV EASGKEGAKK 
QNIDVSIEGI DPFYTSDETL ESITRSKQAS GQTTSSLSNG SYTRSMVVVR AIGNPLSSTN
SSNKPLRILE YILENEPAFP NTTRHWFLNR ITDAQVEKAI VQRLKSSNET YTIIPFSLRE
YDNVLYSFDS KDTIHSKHYI KRSAEEKAAT LVEETVQQSK IRYVANINGA RNAMLRYGKK
NSAAEYILPW DGHCFLTREA WDAIQFSMQK YPEAKYFTSP IRLHEGSIDA NVTEEPQIAF
HRTALAQYNE YLRFGRRDKA ELLTRIGVKG QWDEGIPWED WELDFVEREQ AADSVSGVPD
AGWAACVHSG IENGGKLGTK ILKLNRRGHN LSAQLVRLDI RASQELHGLS SSTLLYYKES
QLAEERELWK AGKRLPLVKE LLELANKALS FGPWSVMDKR GFGCGVSGDC HDYFHVAPYQ
WPTLNSTGYT DYSKPFVLRD GERAPGTVAF SEGSEKYDRT KLLAMKFNTT VLALAYSVTG
NITYARRAAE NLRHWFIYNE TRMNANINFA QIKWDVEKRE MFGSPCGLIE MKDLYFFLDG
VKLIEKSGAL SESEIDQLRN WFANYLQWLL SSEQGRWQIS ANNNHGLFYD VQVAPLALYA
GNLPLAISRM QRSISRIRRQ MNATTGALPH ELRRPICEHY QAFTLQGWIT MARMAEKIGL
NYWKRFADSD APNKETALCR AVRYANPYLS RRAVCPGNID GIDARRWQPI LLDALHHCPM
LDYKSTSGQN NVLIPPELID PPLNHYEITG LFNMSDGIGP FWNLGLY