Gene PHATRDRAFT_21982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21982 
Symbol 
ID7202993 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp285730 
End bp288271 
Gene Length2542 bp 
Protein Length641 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182362 
Protein GI219124126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATTT TACTTGAAGC CGGTACGGCT AATTTCTTTG CAACATTTCT CAGAAATACA 
TGCTACTTGA ATTCTTCGAT TCAATGTTTG AGTCACACAC CAATTTTTCG GGAATACTTT
ACGTCCAAGG CGTACCTCAA TGACATCAAT ACCACCAATC CACTAGGCCA CCAAGGGCAT
TTGGCTCAAG TCAGTGCAGT GCTCATTAAT TCGTTATGGA AGCAGTTCAA TCAAACTCCT
CAGGTACCTC TTCGTCGTGT CCGGGCACCT GGCTCGTACG CAATGGTCAA CGCGCCGTCT
CTGACGCCAA AGACATTCAA GGATTCGCTA GGCAAGTTCA ATGATCATTT TGCTGGCAAT
GAGCAACACG ATGCACAAGA GCTGCTCGCC TTTTTACTCG GTGGTCTTTC GGAAGATTTA
AACCGAATAA TGGACAAGCC ATATATTGAG GCACCGGACT CGGACGGCCG ACCGGATCAC
GAGTTAGCTG ATATTTGGTG GACAAATCAC TTGAAACGAG AAATGTCAAT CATCGTAGCT
TTATTTACCG GTCAATACAA GAGCTTGTTG ACCTGTAGAT CCTGCAAATA CGAAAGCGCC
CGCTTCGAGC CATTCTCGTT TTTGCAACTT CCGCTGCCTG AGGATGATCA GCTGACAGTT
TCCCTGGTTG TGTATCCGTT GAAAGACGGT ACGGATACGC TGAAGTATTG TGTTCGTGTC
AACAGTGATG GAAAGCTTCG CGATGTGTTG CTGGCACTTG CAATGTTACT GTATGTTGAG
CAGAATGGGA AGGCCGTGTC GTCAAATTCA GCAGCAGACG AGGAAAGCGA AAAAGAAAGG
AGTGAAAGAG AAGCTTTATA CCAAAAAATG GCACAGAATT TCTCTGTTGT GGACATGCGG
GACGGTTACA TTTCAAAGAT AGCACCGGTA CGTATTCCCA AATGCGTTTA AGGCGAAGGT
GATGAAAAAG ATTTTGTGCT GACAGTCCGT TTGATTCTTG CAGAATACAT GGTCTCTACA
AGACCTCCAA AACAAAGAGA CCGGAGACTT ACCCCTCTTA CATGTTTACG AACTGGAAAG
TCCGATTGAA GACTCGCCGC TGCAAGGTAA TGCAATGACA GAGAGCGACG GTGTGTCAAC
AGATGAAAGT TCGGACGGCG AGGTTCTCTT CGTCAAACCC CGGGCATCCT TTTTGGCAAT
TGCACAGCGG CGCTCGGAAC TTGTATCGCA AAATTCCCTG CACCCTTTGG CCCATCGTGT
TTTTGGGACT CCAATTCTGA TGCGTGTGAA TGACCTTCAA ACTTGTACTG GGCGTGACTT
ATACGACCTG ATTGCAGCAC GGGTTCGAAA CGTCGTACCC AAACAAGCGA TTCGGTTCCT
TTCCGAGATT TCTTCCTCTA AAAAAACTGT CAATCTCAAA GAGCAATCAG TTGAACTCAC
CAAAACAGGC AAACGACAAT CCGTCGGCAG AACGACAACA GACATGGAAG AAGTGTCTGC
TGGACCTGTC CCTCGCTATG GATTCCGATT GCGCATTACG TCTCGTGATG GGCGTCGCTG
TCTCATTTGT CGCTGGTTCG ACTGCTGCGT GGGTTGTCTC ATTCCAGATG ACGATGAATT
TACCACTGTG TTGGACGGCG ACAGCATAGT GGTTGACTGG CATTTTGCGG TGGATTTAGC
AACAGGTGGC TTTGGGCAAC GATTGACGCA ACCGGGGAGC TCGGCATCAA ATACGCAACA
AACATTGGCA CGCACGCGTC ATTCAACAGT GTTCGTTAAG AATCACAGTT CTTGTGGCGG
AGTAAAGGGC AACCACGCTG GTTCAATAAC GCTGGAGCAA TGTTTGGACG CATTTGCCGA
GGAGGAAAAG ATTCCGGAAG CCTACTGCTC ACGGTGCAAA GACTTTCGTG TACAAACGAA
ACGCATGAGT CTTTGGCGAT TGCCGCCGGT GGTGATCATC CAACTGAAGC GATTCCAATT
TACGCAACAT ATGCGTCGCA AGCTTCGTGA TTTGGTTGTC TTTCCCATAG AAGGTTTGGA
TCTATCACGC ATCATGGCTC CGGACTCGGT TGCTCCCAAA ACGGTCCTGA AAATGGAGAA
CGACGCTGAA TCCAATGGTG AAGAGAGCAA CGGTGATACG CATATCGTGG GGCAGGATAG
GCAGACCAAG GATGATGGTC GTTCTGAAAT GCTGTACGAT TTGTACGGCG TAGTGCACCA
CCAAGGCGCT CTCTCGGGTG GACATTACGT AGCCTCGCTC AAATCGGAAT TCGATGGTCA
GTGGCGGCTG TTCAACGACG CACAGATTTA TGAGATTCAC GATCGCGATG TAGTAGATGC
GAGCGCGTAC ATTTTGTTCT ACATTCGTCG GGACGTTTCG AAGGCACATC TTTCCAACTT
TTGGGAGACT TCGAAGGAAG GGACATTAAG CGAGGAAGAT ATGGATACTC TTCTCAAGGG
CCGATCTGAT CGCTGCGTCA TTAGCTAAAA TAAGTTAAAA GATAAGGCTG TCGAGTATTG
TAGTAAATGA AATCCAATAT TT
 
Protein sequence
MLILLEAGTA NFFATFLRNT CYLNSSIQCL SHTPIFREYF TSKAYLNDIN TTNPLGHQGH 
LAQVSAVLIN SLWKQFNQTP QVPLRRVRAP GSYAMVNAPS LTPKTFKDSL GKFNDHFAGN
EQHDAQELLA FLLGGLSEDL NRIMDKPYIE APDSDGRPDH ELADIWWTNH LKREMSIIVA
LFTGQYKSLL TCRSCKYESA RFEPFSFLQL PLPEDDQLTV SLVVYPLKDG TDTLKYCVRV
NSDGKLRDVL LALAMLLYVE QNGKAVSPIE DSPLQGNAMT ESDGVSTDES SDGEVLFVKP
RASFLAIAQR RSELVSQNSL HPLAHRVFGT PILMRVNDLQ TCTGRDLYDL IAARVRNVVP
KQAIRFLSEI SSSKKTVNLK EQSVELTKTG KRQSVGRTTT DMEEVSAGPV PRYGFRLRIT
SRDGRRCLIC RWFDCCVGCL IPDDDEFTTG NHAGSITLEQ CLDAFAEEEK IPEAYCSRCK
DFRVQTKRMS LWRLPPVVII QLKRFQFTQH MRRKLRDLVV FPIEGLDLSR IMAPDSDDGR
SEMLYDLYGV VHHQGALSGG HYVASLKSEF DGQWRLFNDA QIYEIHDRDV VDASAYILFY
IRRDVSKAHL SNFWETSKEG TLSEEDMDTL LKGRSDRCVI S