Gene PHATRDRAFT_48799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48799 
Symbol 
ID7195107 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp266415 
End bp269613 
Gene Length3199 bp 
Protein Length892 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183332 
Protein GI219126161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTTCACACC AGGTTCGTTT CGATCGATCG GCGACACTAG TAGTGTCCAC GAAATGGACA 
AGACAGGTGA TGCAAATTTT CCTGCATGCA CAAACAACAA TTACCTAAAT ATCATTTAGC
TCGGAAGAAA GAGTCGGTGA AACTTAGATT TTGAGTGAGA CTCTGCTAGT TCAGCATTAA
CAACATTGCA TTGCCGAGAG AATTGCCTTC CGATTGCCAG AAACGTCCCA CTAGAGTTCT
CAAAATCTCA GGTTGACGAA ACCGGCATGG AGATTTTGAT TGATGTTCCA CCGTATTTGC
ATCCATACGT CGAATGGCTG CTGGATTGGA CCAGTTCTAT TTGGTCGTCT CTATCTATGT
ATTGGGCGAA TGTGGGCCAC TGTTGGAGTA TGCTCATTAC CAATGCGGAA CGAGTCTTTC
AGATCCCCCG GAACGCACTC GTACCACTTT TGGCTTTCAT TTTCTTCTTT TGGCCTATTC
TGCTCAGCCT CATCATGACC CTGGTCACTG CCTGGGCTTG GATCTTCTGG ATTGTCACTT
CTGTCTCGTT CGGCTTGATC CAACTCGTCT ACGTGTCCTA TCAGTTCATC ATGATCACCT
GCGATATTTT TGGACTAAGC TGTTTGAAAA CCTACAGTAT GCTACGCCAA CAGCTTTTGC
ATATTGTTGA CAAAACCACC AGTGCGGTGG GAGTAGAAAA CGCCACGCAC CGTCGGGGTG
GCAAATCCCG GCGTCGACAG TGGCGTCAAG ACGTGGAACA GGCGCAAACC TACGAACGCT
TTTTGCAGAT ACGCATCCAG TCCAAAGAAC ACGCACAGGT GATTTCGAAA CAGACTCTTG
TCAAGAAAGC GTCTCTGGAC ACGGCGTTGC CGCCGCCGAT TCCACGGAAT CGATCCTTTT
CCGTAGAACA CTCGCCAGCC AAACATGCTC TGAGGCGAAA TCAAAGTTTT GCTTCCGCCG
ATGAACGTCG AATCAAGTCG AAAGGATCGT CTTCATTCCA AAATCGCGGT ACTAGTATTG
ATGATTTGGA TTCAGTGGTC GTGGATGAAT TGGGCGAGAA ATTATCCGAT TTACTCGTGA
GCACAACGCG ACGCTTGCGC GAAGCCCGCC GTTCGGCACA GAACACACCC AATGACGCGA
ACGCAGCATC CTTGTGTTAC TTGCTTTCAG GCGTTCTTAA ACGTAATCAT TTACAATTGG
ATGATTTATT GATTGAAAAC GCCCGAGCTG TTGCCGAACG GGGTCAATAC GGCCTGACGA
ATGAATCGCG GAGTGTGGTC CGGGCCTATT TTCAACAAGT AGAGGAGGGC TTGGACTGGA
TTGCGGAAGC GCCTGTTCTA CAAAATTTAT CATCGCACCA GTGCTCAGAA GGTGAGAATG
GAAAGGAGGC AAAACATATG CACGAGTCAT CTCGAAGCAG CAGTGCGGAG CTCACGGGCT
GGGCCGAAAG TAGCAGTAAA CACAATGACC TTTTGGAACG TGTAACTTTG ATACGGAAGA
TGAAACAAAA TATGGGTCGG ACAGCGCTGA TGTTGAGCGG CGGAGGAGCA CAAGCCATGT
ACCACCTAGG TATAATCCGA ACTCTGCTCG AATCAAAACT ATACCAAGAT ATAAAGGTGA
TTTCGGGAAC GTCGGGAGGT AGCATTATTG CCGCAATGTG TGCTACTAAA ACGCCTGAGG
AACTTTATAA CAATATATGC ATTCCAACAG TGGTTGACGA TTTCACCAAA ACAGTAAGCC
ATTAATGTGA ATTTTTAAGG CCTGGTGGCT CTGGATGGTA TCCTAACTTT CCTATCTCTC
ACTCAGGGCG AGCAACGACG AGAGAATATT CGATGGTTTC CTCCGGTTAC AGAAATGGCA
GCATATTGGT TGAAGCACAA ACTTCTGGTG GACAGTGCAT ATTTTCGACG TACATGCGAC
TTTTACTATA GCGACATGAC TTTCGATGAA GCTTTCGAGC GGACAGGCAA GCACGTTTGT
ATCACTGTGT CGGCCAGCAG AGCAAGCGGT GGAACCGCGC AACGCTTACT CTTAAACCAC
ATATCCACTC CACATGTAAC TGTAGCAAGT GCGGTTGCTG CTAGCTGCGC GCTTCCCGGA
GTCATGGCCC CGGCTAAGCT GCTTGCCAAA AACAGCTCTG GAGTGTTGGA ACCGTTCGAG
GTTGATGGTG TTGAGTGGAT TGACGGTTCC GTTCAGGCTG ATCTTCCGTT CCAGCGAATT
GCAACTCTAT TTGCAGTATC GTCTTTCATT GTTTCACAGA CAAATTTTCA CGTTTTGCCA
TTTCTCAATA AAGAGTATCA TCCGAACCAA AAAAGCTTGT ACTGGCAGCT ATTTCAAACC
CTAGAATGGG ACATTCGAAG CCGTGCCCTC AAACTGAGCC GACTTGGACT CTTTCCTCGA
CTTTTCGGAC AGGACATCAG CAAGATCTTC AAGCAAAAAT ACTATGGAAA CCTGACAATC
GTTCCCCGCT TTACGACAAT GCAAACATTT GGTCTAAAAT CTCTTTCCAA TCCGACAATA
AAAGATATGG AGGGGTATCT CAAGTACGGC CAAATTGCTG CATGGCCCTA TCTAAACGCC
ATACGCGATA TGATCCGACT AGAAAAAGCT CTGGACGATT GTCTTATGCG CTTGGAAGCA
CGAGTTCGAG CGCTGAATCC CGACGTTGAC TGGCTCAACC CTGACGATGT TGAGTCTATA
GCAAGTTCGT CAGCTGTGTT TTCCAATTCT CGAGTACGAA TAATAGGACG ACCCCCAATG
GTTGATTCCG CAAGGCAGCG GGAAAGTGAT TTAGTTCGAA AACTCGAAGA CGAGAACCAG
GTGCTAAAGG AACAAGTACA GCGACTTCGA GCTGAACTGC TGGCACAAGT AGGCACTGAT
GAAAATGCCA ACAGCAAATT GGATGAATCA AGCCACTATC CCGTAGCTCA ACGTTATCTA
ATACAAAGCT CAGAAGGACG CCAACCATCA CTGAAGAATG AGCAAGAAGT ATTGACTCCA
AGAGGAGCTT TGATCTGACT TCCCTGGCAA TATCTCCATT GTCTAGTTTG TTACTCAACC
GTTTGCAAGA TCGACTTACT GTTAGTACCT CATCTGCCAC TGTGTCTGAT GTCTACATCA
GCATCGGTCT TTATGGCTTT CGATTTGTAA TCGAAAAATA GCTTTCAAGT TCGCCAGAAG
CTTACTGTTA GATCTCATT
 
Protein sequence
MEILIDVPPY LHPYVEWLLD WTSSIWSSLS MYWANVGHCW SMLITNAERV FQIPRNALVP 
LLAFIFFFWP ILLSLIMTLV TAWAWIFWIV TSVSFGLIQL VYVSYQFIMI TCDIFGLSCL
KTYSMLRQQL LHIVDKTTSA VGVENATHRR GGKSRRRQWR QDVEQAQTYE RFLQIRIQSK
EHAQVISKQT LVKKASLDTA LPPPIPRNRS FSVEHSPAKH ALRRNQSFAS ADERRIKSKG
SSSFQNRGTS IDDLDSVVVD ELGEKLSDLL VSTTRRLREA RRSAQNTPND ANAASLCYLL
SGVLKRNHLQ LDDLLIENAR AVAERGQYGL TNESRSVVRA YFQQVEEGLD WIAEAPVLQN
LSSHQCSEGE NGKEAKHMHE SSRSSSAELT GWAESSSKHN DLLERVTLIR KMKQNMGRTA
LMLSGGGAQA MYHLGIIRTL LESKLYQDIK VISGTSGGSI IAAMCATKTP EELYNNICIP
TVVDDFTKTG EQRRENIRWF PPVTEMAAYW LKHKLLVDSA YFRRTCDFYY SDMTFDEAFE
RTGKHVCITV SASRASGGTA QRLLLNHIST PHVTVASAVA ASCALPGVMA PAKLLAKNSS
GVLEPFEVDG VEWIDGSVQA DLPFQRIATL FAVSSFIVSQ TNFHVLPFLN KEYHPNQKSL
YWQLFQTLEW DIRSRALKLS RLGLFPRLFG QDISKIFKQK YYGNLTIVPR FTTMQTFGLK
SLSNPTIKDM EGYLKYGQIA AWPYLNAIRD MIRLEKALDD CLMRLEARVR ALNPDVDWLN
PDDVESIASS SAVFSNSRVR IIGRPPMVDS ARQRESDLVR KLEDENQVLK EQVQRLRAEL
LAQVGTDENA NSKLDESSHY PVAQRYLIQS SEGRQPSLKN EQEVLTPRGA LI