Gene PHATRDRAFT_21970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21970 
Symbol 
ID7202974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp198131 
End bp199793 
Gene Length1663 bp 
Protein Length495 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182346 
Protein GI219124093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00490278 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTACTGTTG ATATCAGTGA AGACAAAAAA CTAATGCTGG CAGTCATTGG CTCGAACGAA 
ACATTGCGCT CTCTCGGTCC CGAGTCAAAT GATATAGCCC ACTTCGAGGC GACTGGACTC
GATGGCGGAG ACGAGTTCAC GTTCAAGGGA TGGGTGCAAC AAAAGCTCTT CCTTGGTATT
GAGCCATCTC CAGATATTAT AGCGATCGCA ACGATATACT TTGTGGAAGG CGCTTTGGGT
TTGGCGAGAC TGGCACAAAC GTATTTGCTC AAGGACGAGC TTGCACTTGG ACCGGCGGAA
ATGTCGGCAC TCACGGGGGC ACTGGTGTTA CCCTGGACTA TCAAGCCACT CTATGGATTT
TTGAGTGATG GATTTCCATT GTTCGGGTTT CGGAGAAAAA GTTATCTGAT TGTAGCTGGT
CTTAGTGGCG GGCTCTCTTA CTCTGTCCTT AGTTTGTCTG GCTTTTGGGA AAGCCTCGAC
AAGGGCGTTG CCATTAGTGG TACTGTTGGT GCATTACTAC TGAGTAGCGC ATGCATAGCC
ATGTCAGATG TTGTGGCCGA TGGAATAGTC GTGACTCGGA CTAGGCAGGC AAAAGATCCT
GCAATAGCAG GCGGTCTTCA GTCTCTGTGC TGGGGATCGG CGGCTGTCGG AGGTTTGTTG
TCGGCGTACT TTTCTGGAGC TTTGTTAGAA GTTATGTCTA TTCGGAGTAT CTTTGGTATT
ACAGCTGTGC TGCCATTTAT GGTCGCATTG ATAGCGCTTC AAATGGAAGA GAAGCCTTAC
GTAAAGGAAG AAGGGCACGA AGGATTGGTC ATGGGTGTCA AGGACCAAGC GAATGCTCTT
TGGGAGGCAC TCAAACAGCC TTCCATTTGG AAACCTACGC TGTTTTTGTT TTTATGGCAA
TCAACACCAA CATCCGATGG TGCGTTCTTT TATTTTATGA GCAATGATTT GGGTCTGGGA
CCGGAATTTA TGGGACGTGT TCGACTGGTT ACATCGCTCG CCACTTTGGG CGGAGTTGTC
GTATACAACC AATATCTGAA ACGAGTGCCC ATAAAATCCA TTTTGTTTTG GTCTACAATC
GCATCCTTCC CGCTCGGCAT GCTGCCCGTT CTACTTCTCA CCCACGTGAA TCGCGAATTG
GGTATTCCCG ATCAGGCCTT GATTTTTGGA GACGACATTG CCTTGGCGGC CCTCGGTGAA
ATCGCCTTTT TGCCGACTCT TGTACTGGCC GCTCGTCTTT GTCCACCAGG GGTCGAAGCC
GTATTGTTCG CCACACTCAT GTCGGTATTC AACGGTGCCG GCACGGTGGG AACCGAACTT
GGCGCTCTTT TGACCAAGTT GTTTGGTGTG ACGGATAGCA ATTTTGACAA CTTGGTGTGG
TTGACTGTCC TTTGTAACGT CACTTCTTTG TATCCACTTT TCTTTATCGG GTGGCTCGAC
AAGATAGGGG ATGTCTCCGA AGAGGAGATG GAAAGCAAAA AGGGTGTGAT TGAAACAACG
GCAAGAACTA AAGAAACGTA GAGGTAGATC AATAGGCCGC GAATGAAAGT GTTCGTAGTC
GCAATTAGAA GATCCGGTGC ATCGTGTGGT GTCTGTCACC CAGCTGCAAC TCTTTACTGT
TCACGAATGC CGATTTGTTG CTACCTCTAC TCGTACACTA ATT
 
Protein sequence
MLAVIGSNET LRSLGPESND IAHFEATGLD GGDEFTFKGW VQQKLFLGIE PSPDIIAIAT 
IYFVEGALGL ARLAQTYLLK DELALGPAEM SALTGALVLP WTIKPLYGFL SDGFPLFGFR
RKSYLIVAGL SGGLSYSVLS LSGFWESLDK GVAISGTVGA LLLSSACIAM SDVVADGIVV
TRTRQAKDPA IAGGLQSLCW GSAAVGGLLS AYFSGALLEV MSIRSIFGIT AVLPFMVALI
ALQMEEKPYV KEEGHEGLVM GVKDQANALW EALKQPSIWK PTLFLFLWQS TPTSDGAFFY
FMSNDLGLGP EFMGRVRLVT SLATLGGVVV YNQYLKRVPI KSILFWSTIA SFPLGMLPVL
LLTHVNRELG IPDQALIFGD DIALAALGEI AFLPTLVLAA RLCPPGVEAV LFATLMSVFN
GAGTVGTELG ALLTKLFGVT DSNFDNLVWL TVLCNVTSLY PLFFIGWLDK IGDVSEEEME
SKKGVIETTA RTKET