Gene PHATRDRAFT_21873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21873 
Symbol 
ID7202907 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp661654 
End bp663039 
Gene Length1386 bp 
Protein Length406 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182112 
Protein GI219123603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCGAAGGG TATCCACGAG ACGACACTCC TTGCCACTGT CGCCGTTTGC CTCGTCAGGG 
TCGTCCTCCA TTTACACGCA ACTATTCAGT TCACAAAAAC CAATGTCGTC TTTGTCCACC
GCTTCCACAT CCTCCACAAC CACATTTACG GATACCGAAC TCACCGAGGT AAAGCGAGAT
TTTCAATCGG CAACCGAATT GTATCGAAAG AATCTAGCCT CCACGACCAA ACTACCGGCA
CTACAAAGTC AAATTGCCGA TTTGGAAACG GAGCAGTCCC AACCCGATTT TTGGGACGAA
GCCAACACGA GTCGCGCCGC CATCGTCAAC GCTCAAGTCT CCACCGCCAC GAGACTCCTC
ACCCGGATAC AAGCTTGGCA AGAGTGGCAC GGAGACGCCC AGGCGGCGCT CGAAATATTG
TCCCAGTCGT GGACCGCATC CTCCGACGAC GCCACCGCCG TTACCAGCGG GGTGAACAAT
TCTTTGAGAT TGGCTTCCGA GGAACGTGCC ATGCTCTTGG ACGAGTTCCG TTCCGCCATT
GCGCGCTTAC GCGAAGACAG CGATCGATTC GAATTGGAAT TACTCCTGAG CGGCCCGTAC
GATCACGCCC CGGCTCGTCT ACTCCTGACG GCCGGGGCCG GAGGTACTGA AGCCAACGAC
TGGGTGGGCG ATCTGAAACG CATGTACCAA CGACACTGCG AGGCAATGGG ATTGTCGTGC
GTCGTACAGG ATGAACAGGC CGGGGAAGCG GTGGGCTACA AGAGCGTCGA ACTGCTCGTA
TCCGGTGACA ACGCCTACGG TTGGCTGCAG GGTGAAAAGG GGGCGCACCG CATGGTCCGC
CTCAGCCCGT TTAACGCCAA CAACAAGCGG CAAACGACCT TTGCCGGAGT TGATGTGGCG
CCGGATATTT TGAATCAAGA TGATGATGCT TACTGGAATA CGATTGATGT TCCCGAATCA
GAGTTGGAAA TTACTACCAT GCGCGCCGGG GGCAAGGGTG GACAGAATGT GAACAAAGTC
AACTCGGCAG TACGCATTAA GCATTTGCCG TCAGGTTTGC AGGTAAAATG TGCTCAGGAG
CGGAGCCAAA GCATGAACAA AAATATTGCC TTGAAGCGTC TAAAAGCGCA ACTCTTGGCC
ATTGTCCAAG AACAGCGCGT GGCCGAAATC AAGGAGATTC GAGGGGATAT GGTGGAAGCT
TCGTGGGGTG CGCAGATCCG GAACTATGTT TTCCATCCTT ACAAAATGGT CAAAGACCAA
AGGACGGGTT GGGAAACGTC CAACGTACAG GCCTTTATGG ATGGTGACCT CCTAGAAGAG
TGCATTGGCT CTTTCCTACG ACATAAGGCT GAAGAACAGC GAAAGGAACA AATAGCTAAC
GAGTAG
 
Protein sequence
MSSLSTASTS STTTFTDTEL TEVKRDFQSA TELYRKNLAS TTKLPALQSQ IADLETEQSQ 
PDFWDEANTS RAAIVNAQVS TATRLLTRIQ AWQEWHGDAQ AALEILSQLA SEERAMLLDE
FRSAIARLRE DSDRFELELL LSGPYDHAPA RLLLTAGAGG TEANDWVGDL KRMYQRHCEA
MGLSCVVQDE QAGEAVGYKS VELLVSGDNA YGWLQGEKGA HRMVRLSPFN ANNKRQTTFA
GVDVAPDILN QDDDAYWNTI DVPESELEIT TMRAGGKGGQ NVNKVNSAVR IKHLPSGLQV
KCAQERSQSM NKNIALKRLK AQLLAIVQEQ RVAEIKEIRG DMVEASWGAQ IRNYVFHPYK
MVKDQRTGWE TSNVQAFMDG DLLEECIGSF LRHKAEEQRK EQIANE