Gene PHATRDRAFT_50075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50075 
Symbol 
ID7198757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp317396 
End bp320387 
Gene Length2992 bp 
Protein Length955 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184860 
Protein GI219129363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTCG AGAACGGAGA TGGAACGAAC GGCAGTAGGA GCGAAGAAAT CGGGGCAAAG 
GCTGCTCCCC ACGCGACTAC GTCGACAGAC AATCGCACAG CATCGTTCAC TTCCACATCG
TCACCACCAT CCTTTGCTCC ATTTCAATCC CAACGAAGCG TTCAGCATAA GAGCAGCTAC
GATGTGTCCA AGTCCATGGA ACGGCTTCGA TCTTCAATGA TTGGTAATGT ATCGTGGATT
CCTGGTCGTC ACGAGAAGGA GGACTACGAT GAACTTGGTC AAGTACCGAC CACAAGGACT
TCAACGACAA CTACTGCATC ACCAGCTACG CCCTTGCCTC CAAGTTTACA CCGCGCGTTG
GGACAGATAC CAGCAATCGT GTTGATTGGC ATATTTCATC TCATGATTGG AATCCCCTTT
GGTGCTTCTT ACTTTCCAAT TGGTTGGAAG GCTCCTGGGA GTGCCGACGA CGAAGACGAG
AATGATGACG ACGGGGTCCA TGGAATTTTT CCGTTACCAG GCAAAGAAGC TCTGGGAATT
CGAATGTTTT TGTTCTCCAC AATCCTGGGA CAAATTGTCT TTACGGGTCT TTCTGGCTTT
CGCAATCCTG TCGGCTTACA AATGGTCGAG AACGTTCCGT TCTGTCACGA GCTGGCCACA
ATCGCCGTGC GTCATCAAGG ATACACCCGA GAAGCTCTGT CAACGCTCTT TGCCATGTAC
GGATTCTCGA GTCTGCTCGT GGGTGCGGTC TTTTATGCTC TGGGACGATG GAAACTAGGC
AACGTTGTTT ACTACTTTCC GAAACACGTT TTGGTGGGAT GCATTGGGGG AATTGGCTTG
TATATCGCCA AAACGGGTGT GGAAGTTACC AGGAATGCGG AATTCTCACT ACGGGCAGCC
ACGGTCACGT ACGGACTTTT GTTGGTCGTC GTCCTTGCCT TTGAAGTCGT CCTGCGTCTA
CTGGAATTTG GAACGCGTGA CGTTAGCGGG AACGCAAGGT ACCCATTGTT GTCGCCGTTA
TATTTTTGCA GCATTACACC CGTCTTTTAT ATGGCACTCT TTGTGCTTGG CGTGAACATT
GAGACTGCCA CGGAGGAAGG ATTCTTCTTT CCCGCATTGG ACAAATGTAC TATTGGAGGA
GGTGAAAATG GTGAAGCCTG CTCCACGTCC TTGTGGGATT CCATTTTTGA TCAGAATCTG
TTCAATATTT GGAAAGTCGT AAACTTCTCG ACAGTCTCCT TTCCCGCGCT GATGGACGCC
ATTCCGACCT TGGTCGCACT GACTTTGTTC AGTCTCATTC ATGTTCCCAT TAATATTCCC
GCCTTCGCAA ACTCCACGGA CACTGATGTA GATATGAACA AAGAACTGAT TGCTCATGGC
TACTCCAATT TGCTGGTCGG CATTTTTGGC GGCTTGCAAA ACTACATGGC CTATACGCAG
TCGGTCTTGT ACGACAAATC AGGGGGAACG GGAAAGGCCT CGGGCTATGC TGTCGCCGGC
ATTACGTCGG TGCTTTTCTT GATTGGGCCC ACCATTGCTT CCTATATTCC CCGGTGTATG
GCGGGGACCT TATTGGTCCA CGTAGGTGTG GATTTGTTTC TGGAAGGCGT TTACGAAACA
TGGGGAAAGT TCGACGCACT GGAGTACGGT GGTATTTGGC TTATAACAAT GGTCATGACA
CTGTACGGGA TGGAGGCCGC CATGATTGCC GGCTTCATCA CGGCTCTTTT TACATACGCC
GTGCAAAATA CGACGTACGT TCATATCCTG CGTGGATCCA TGTCCGCAGC TACGTTGCGC
AGTAGCAAAT GGAATCGCAG TACCCGGGCC AACGCTATTT TGGCGGACGA GTCGACCGGA
CGCAATCGTA TCCTGGTGGT CCAACTCCAG GGACACTTGT TTTTCGGCAA CATGGTGCAA
CTCACCCAGA GTGTAAACGA TGTGCTGAGT GAGAAAGCGA AGCCTCGTAC GGAACCTTGG
ATTGTGATTA TTGATTTTGG TTTGGTACTG GGGATTGACT CTTCCGCGGC ACAATCGATC
AGCAAACTAT CCAAGACACT GCAACACAAG CACGGTGTCG ATCTTTGCAT TTTCGTGACG
GGTTCTGGGG AAGGCTTTCC AACGGCCTAC AGTTTGTCCA AGGAATTGTC CACTTTATCA
TCGACCACGC CAGTGGTTGT TTCGGATGAA GACGTGCGGA CAACCGAAGC GACACCCTTA
TTGGCACCGT TCGCGACACC GAATCCCGAT ACATCATCAT CATTGTACAC GGGCAGTCGT
GTATGCACTA CGCTGGACGA TGCGCTGGTG TTTGCCGAAG ACGCGTTACT GGCGCGCACC
GATTGGTCGT TGTTGGAAGC AGACCGTCAC ATTGGCGATC CCCTCCGCGG CGGCGTGTAC
GATATCACGG ACGAAACGCG AGTGGCTTTG CGGTATTTGG AAAATCTGTG TCCACGCGGG
GTGGACCAAG CGCACGTGCG TTTGCTGTGG AAGTGCATGA CACGGGAAAC GTACGTATGC
GGCGATTCCG TGTGGTTGCA GGGTTCGGAG AGTGACTGTA TGAAACTGTT ACTGCGTGGA
ACCTTATTGG CGTCTCTCGA GAACGAAGCC GGGACGAACG AAAGCATCGC GGCGGGCAAT
ACGATTGGGG AATTGGGTTT GGTGGAACAC ACGCCACGGA TGAGTTCCGT CACGGTAGTG
TCGGCGGACG CTGTCCTCTA CAGTCTACAC CGCGAGCGGT GGCGGGAATT GAAGGCCGTG
TCCCCCCACG CCGCGTCACT GACGGATCGT ATCTTGATTC GTTACCTGTC TGCACGTGTC
CAACACGTGA GCAATCGTAT CTACGAAACA CGGTGTTTGC CGATATAGAC GTTGGGTTGG
GGGCACCAGG CCATCGTTTG TGGGAATGGC AAAACAAGTC GACCAATCAC GTGGGTGGCC
GGGGTTTGCT GGTATCCATC CTTCCATGCC CGTAAGGATA CCATGTCACC CT
 
Protein sequence
MEVENGDGTN GSRSEEIGAK AAPHATTSTD NRTASFTSTS SPPSFAPFQS QRSVQHKSSY 
DVSKSMERLR SSMIGNVSWI PGRHEKEDYD ELGQVPTTRT STTTTASPAT PLPPSLHRAL
GQIPAIVLIG IFHLMIGIPF GASYFPIGWK APGSADDEDE NDDDGVHGIF PLPGKEALGI
RMFLFSTILG QIVFTGLSGF RNPVGLQMVE NVPFCHELAT IAVRHQGYTR EALSTLFAMY
GFSSLLVGAV FYALGRWKLG NVVYYFPKHV LVGCIGGIGL YIAKTGVEVT RNAEFSLRAA
TVTYGLLLVV VLAFEVVLRL LEFGTRDVSG NARYPLLSPL YFCSITPVFY MALFVLGVNI
ETATEEGFFF PALDKCTIGG GENGEACSTS LWDSIFDQNL FNIWKVVNFS TVSFPALMDA
IPTLVALTLF SLIHVPINIP AFANSTDTDV DMNKELIAHG YSNLLVGIFG GLQNYMAYTQ
SVLYDKSGGT GKASGYAVAG ITSVLFLIGP TIASYIPRCM AGTLLVHVGV DLFLEGVYET
WGKFDALEYG GIWLITMVMT LYGMEAAMIA GFITALFTYA VQNTTYVHIL RGSMSAATLR
SSKWNRSTRA NAILADESTG RNRILVVQLQ GHLFFGNMVQ LTQSVNDVLS EKAKPRTEPW
IVIIDFGLVL GIDSSAAQSI SKLSKTLQHK HGVDLCIFVT GSGEGFPTAY SLSKELSTLS
STTPVVVSDE DVRTTEATPL LAPFATPNPD TSSSLYTGSR VCTTLDDALV FAEDALLART
DWSLLEADRH IGDPLRGGVY DITDETRVAL RYLENLCPRG VDQAHVRLLW KCMTRETYVC
GDSVWLQGSE SDCMKLLLRG TLLASLENEA GTNESIAAGN TIGELGLVEH TPRMSSVTVV
SADAVLYSLH RERWRELKAV SPHAASLTDR ILIRYLSARV QHVSNRIYET RCLPI