Gene PHATRDRAFT_49898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49898 
Symbol 
ID7198602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp232208 
End bp234640 
Gene Length2433 bp 
Protein Length810 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184756 
Protein GI219129144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.412602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTCAC AGTCTGAGCG GTCCCGTCGT TTACGGCAAG CTACAGGAAC GCCGCCGTCA 
ATATCTGCAC TGACAAAGGA ACACTCTCGT ATTGGAGTTC GACGCAGCAA GAATACGGGC
TCTTCCCCCA GCGCGTGGAT ATTCATGCTC TGTTTCGGTG GTTGCTTAAT TCTGCTACTT
CACTCTACAC AGCAGCTTCC GTCTTCGTCC ACCTACTACG AACCTTTAGA ATCAATGCTG
CCTTCATTGC AGTCCACCGT CGGCTGGGAC GAACAAAACC CGTTGGCCAT CCCCAAGGGC
CAGGCTCCAA TTCTACCATC GCTACGCACC AATGGTGTCG ACAACCAGCG GAAAGGGTAC
GGTGGACACG GAGACCAGAA GCACCTCGGT GGATTTACGG AATACGACGG CATGGGCGTC
AGTCCTCACA CTTGGAAACA CATGATCCAA GACTACGGGG TGCATTCGCT TTTGGATGTG
GGATGCGGAC GCGGTACCTC CACGGCGTGG TTTCTCATGC ACGGCGTCGA TGTTCTGTGT
GTAGAAGGGT CACACGACGC GATCGAACGG TCCGTACTTC CGGATCCCGC AACCCTAGTG
GTGGAACACG ACTTTTCGCG AGGACCTTGG TGGCCCGCCA AGACCTACGA TGCTGCCTGG
GCCGTCGAGT TTCTGGAACA CGTCAACGTC CAGTATCATT TCAACTACGT TACGGCCTTC
CGCAAGGCGG CACTGATTTT TGTTTCCACT TCTCAGTGGG GAGGCTGGCA TCATGTCGAA
GTACATGGAG ACGAGTGGTG GATTCGAAAG TATGAAGCCT ACGGGTTCAA ATACGATGCA
AGCCTTACCA CGCAAGTCCG GAAATGGGCC AGAGAAGAAA AATCTTGGGC GAACAACACC
GGTCCAGACG GAAAGCCCCA CAATGCTCAG CATATTTGGC TGTCTATGAA AGTGTTTGTA
AATCCCATTG TCGCGGCTCT TCCGCAGCAT GCTCATTTGT TTCCGGAGGA CGGCTGTTTT
CTGCGAAGGG GCGACGATGG GGAAATCCTT CACAAGGAAT GCGGAACTGG CAAAGATGCA
GGATTGGAGA CACCATTGGC GCAGGGATTT CGTCCCCTCG CTCTGAATCC CGCGATGGAC
CAGAGATGGC TACGGCACAT TCAAAAGCAT GCCTCTCGTC TTCATGAGGG GAAAGAGAAG
GATGAAGCCA GTCCTGATGA TGACGAGACA AGCGCCATAC AAAGTGAGCC GACCAGGCCG
GCAGATTTGC TCCGAAAAAT TGATGAGAAA AACATTACTG ATATACTTCC ACTTCATGTC
GTCGCGTGGC CGTACTTGGA ACACGGCATC AGGACGGCTG AACATCAGCA CATCGAGGAA
AACGGTATTA GCGAGTCGTC GTATTTGAGG CTTAGCGAAG ATATGTTGGA TTTTCATCCC
AACGTTGTTT GGCTTGGGGA CATTGGTTGG GGCTTTCCTT GGAAAGAATG GTGTCAGGAA
TACACCAAAC ACATCAAAAA GGCCAAAAAT ATGCGGCGCG AGAAAGGGCT GCCGGAGCAG
TGGCCAATCT TCATTGCCGC CTTCACCGAT GGACCATCTC TACCGAGATG CCAAAATGTA
GAAGCTGAGG TTGGTAAAGC GAACGTTCGC TACACAAGTC GGTCAATAGT GCACAATCGG
CGCTGGAATG AAGCAAAGAA ATGGGTGGAA ACGGGCGAAA AGTTGAATAT GATGAAGAGC
GGTATCATCT ACCGGCACAC GCCTCTGGTT GTTCGAACTG ATACGGTGAA GTTTTTGGAA
GAAGCCCTGC GGAAGCGCAA CATGACGCTG GCTGATCCTA TAGAGCGTTT GCAACGCGAC
GTAGATGTAG CTCATTACTG GCCTCATCAA AGAGACCTAG ACAAGGTTGG TACAGTTGGA
TCGCTTTTAC GCCAGGAGAT CAGTAAGCTT CTTTTTGCTT TTGGGAAAAA TACAAATTTT
AACGTTTTTG TTGGACTGAA GGGCGAAGCG GTTCGCAAAG GTCGTCGTGG TGTCGCATCT
GATTATATTG AGTCCTTGTT GGAGACCAAG ATTGTTGTTG TCTCACAAAG GGATCGATGG
GAAGACCACT ACCGACTTAT GGAAGCTCTG GTTGGTGGCG CTTTGGTTTT GACGGATCGC
GTTCTGGGAA TGCCGGCAGG TCTAGAGAAT GGCACTTCGG TTGTTGAATA TGAGAGCGCG
GATAGTTTAT TGTCTTTGAT CCAGTACTAC CTTACGCACA CGGAAGAGCG GCTCTCCATT
GCCCGCAAAG GAAGGGAAGC TGCAATGAAG AAACACAGAA CATGGCATCG TATCGAAGAG
ATCATTTTTG GCGAGAGCTT GTCGGATTGT AGGTTTCAAG GCTTGGATAG CCCTTGTCCG
TATGTTGTGC ATGGCGTCGA GTCAAAGCGC TGA
 
Protein sequence
MESQSERSRR LRQATGTPPS ISALTKEHSR IGVRRSKNTG SSPSAWIFML CFGGCLILLL 
HSTQQLPSSS TYYEPLESML PSLQSTVGWD EQNPLAIPKG QAPILPSLRT NGVDNQRKGY
GGHGDQKHLG GFTEYDGMGV SPHTWKHMIQ DYGVHSLLDV GCGRGTSTAW FLMHGVDVLC
VEGSHDAIER SVLPDPATLV VEHDFSRGPW WPAKTYDAAW AVEFLEHVNV QYHFNYVTAF
RKAALIFVST SQWGGWHHVE VHGDEWWIRK YEAYGFKYDA SLTTQVRKWA REEKSWANNT
GPDGKPHNAQ HIWLSMKVFV NPIVAALPQH AHLFPEDGCF LRRGDDGEIL HKECGTGKDA
GLETPLAQGF RPLALNPAMD QRWLRHIQKH ASRLHEGKEK DEASPDDDET SAIQSEPTRP
ADLLRKIDEK NITDILPLHV VAWPYLEHGI RTAEHQHIEE NGISESSYLR LSEDMLDFHP
NVVWLGDIGW GFPWKEWCQE YTKHIKKAKN MRREKGLPEQ WPIFIAAFTD GPSLPRCQNV
EAEVGKANVR YTSRSIVHNR RWNEAKKWVE TGEKLNMMKS GIIYRHTPLV VRTDTVKFLE
EALRKRNMTL ADPIERLQRD VDVAHYWPHQ RDLDKVGTVG SLLRQEISKL LFAFGKNTNF
NVFVGLKGEA VRKGRRGVAS DYIESLLETK IVVVSQRDRW EDHYRLMEAL VGGALVLTDR
VLGMPAGLEN GTSVVEYESA DSLLSLIQYY LTHTEERLSI ARKGREAAMK KHRTWHRIEE
IIFGESLSDC RFQGLDSPCP YVVHGVESKR