Gene PHATRDRAFT_45431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45431 
Symbol 
ID7200676 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp131608 
End bp135232 
Gene Length3625 bp 
Protein Length1158 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179586 
Protein GI219117587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.693925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTC CAACCGCATC CCCCAGCTAT GATTCGGACG ATCCGCCATC TCATACACAG 
TTCTTGGATT TGACCAACGG CAAAGGGGGT GATCTGCTCC CGCATCCATA TCTTCCCCCT
CAGCCGGAGT CCGATTACGC CCACGCAACA GAGACTCCTG TGGACGCGCA GGATTCCGAC
GATCAAGACG ACAGCAACAT GGGCGACGAC GAAGACGACG GGGGTATGGA GGGATCCCCC
ACAATCCGCG GTACGGTACC ACCCGGTAGC GGTGCAATCG GTATTAATCC AAACACTCCC
GGTAAGGTCG TTGACGTTGG GCAGGAACAC ACGGGCCGCT GGACCAAGGC CGAACACGAA
GCTTTCTTGT CGGCGCTGCA AACGTACGGT AAGGAATGGA AAAAGGTGGC TGCCAAGGTC
AAAACTCGGA CGGTTGTCCA AACTCGTACG CATGCACAAA AGTACTTCCA GAAACTCCAA
AAGACAATCG AAAGCACAGG GAAAGACGAC GTTACTCAGG TTCACATGGG CATTGATAGC
GGTGTCCTTG ACAAACAAGG TAGCGGGAGT GCTGCCGGAA GTTCACACCA GAAGAAGCAA
CGATGTCCCG CTCCGGTCTC TCTTCAAAAG CCTGAACGTC GGTCCAGTAG TGCCACTATT
TCCGCCGCAC AGGTCATATC CAACCTTTCG TCCCATACGA GCACACAACC TTCTTCTATG
GGCCCATCTG TGGCTGCAAG AGGTAGCGCT CCGCTAAAGT CCAAGTCAGC CGATCCGCAG
TATAGTGCAA TGCGTCCACA TGGCTTTTCA ACGGAGGTTT CTTCATCGTC TTTCCCATCC
TCTTTTTCTT CCTGGATGGG AAACAATTCC ATGAAAATTA CAGCTCCAAA TCCGGAAGAC
ACCAAAAACA GCTTTCCGGA ACCGTCTCCT GCAGCTACTG GCAAACGAAA ACTAGCGGAA
ATTGCCGCTG CTAGAATGTT GGCCGGTGTA GGGCAACAGC AACAAAGGCA GCTTCAACCT
CTGGTGGATC GCAACGATGA AGCTCCGACA CCGCCTCTAC CCGACACAGA AAATAGTAAA
GGTTCAAGTC TAAATCTCCA TGAAGCCCCA CCGCCACCAT TACTATTTGG GGATGGGTTC
AATATGTCAA GTTTGACGTC GAAAAAGGGA GTGGCTCTAC AAATTGTGAA TCCAGAAAGC
TTGGGCGTTT CGCACGATAA ACCTCGTCGT GGAGGTGGAG ATTCGCCTGT CACACCCTGG
GATGGGCAGC TCGAAGCCTT AGTCTACGAA AAGGCCAAAG TTGAATCTAA GGAGGAAGAG
ACAGGAGGAT CAAAACCTGC TGCATTGCAT CCGGTATGTG GCCCGAGTAC AGCATATGGT
CGAACGCCAC TACATCAAGC TGTCTGCGAA ATGGATTTGG ATGGCGTAAG GTGCCAATTG
CAGGATATGC CCAGCCAAAA TGTCAGTGTG CTGCATGGCC TCGATGAAGC AGGCTATTCT
CCGTTGCACA GTGCCTGTGC TTTACGATTG AGCTACGGCC AAAGCGCTAT CGTGGCTCCC
CAACTTGTCA GACTCCTCTT GTCTGCCGGT ATCTGCGATC CCTCTCGACC CGACATAAAG
GGAAATACAC CACTACACTG GGCCGCACGT TCCGGGGATC GAGATGTTGT GGAAATTTTG
CTTCTGAAAA ATTCTCTACT GGATGCCAGA AACCAGGCGG GCGAGGCTCC CCTTCACTGG
GCGATGCGGG CAGGTGAACG AGGGACTACG GTCGCTTTAT TACTTTTGGA AAACGGTGCT
CGACCTAGCT CACTGAGCAA AGAGTACCGT CGACCCTTGG ATGTAGCAGC GGATGGATTC
TTAGACGAAG AAAGGTCGTT GGCTGTTCTG CGGGTCGCGG AACAATCATA TCGAGGGATA
AAGCCAAGCA AAGCATTGAA AAAACGGCTA AAAGAAACCG CAAGCGAACG GAGAGATGCG
CGAGCTGCTT TGCTAATTCG GTCCGCACAG TCTAGAACGC TCGTATTGCA TCACCCCGAA
TGTTTGGAAC ATCACCCGAA ATCAGCTACG GATTGGGAAG CGCCAGATCG AATAAGGAGT
ATTATGCGCC GAGTACTGCC TGCAAGTGAC CCTACCGGTG CGACCGAGAC ATCGGGCATT
TTCCCTCACG AGGTAACGGT GTCCAAAGAA TTTGAAAGGG CAAAGCTTGA TCTCCTCAGT
CGAGTGCATA GTACAGATTA TCTATCATTT GTCAATGCAT TGAGCAAAGA CCTCGAAAAG
CAATTGCGAG AATCAGGGGG GAGCTTCAGC GCAATGGACG AGTCTGACAA TGGTTTTGGA
TCACCACCGC CGGTAGTTCC GTTCACACCG CTCGTTCAGA GATCGCTCAT TAAAGTAGAT
GAGTCTAGAA TCAAGCTGGG TGTAAACTCC GATACATCGT TCAGTGCAGG GTCACTCCGT
GCTGCACGGC GCGCAGCTGG GGCAGTGCAG CATGCAGTGG ATTGGTAAGT CATGACTCCT
GAATTTTCAA AGCGTGCTTG CAAATCACTT TCTAATTTTA AATCTTCCTT GTCTAGCGTT
TTGGTTGGGA GAAATCGCAA TGCTTTTTGT GTAGTTCGGC CACCCGGTCA TCATGCCGGC
ATAAATGGTT TGCTGGATGG GGGTGAATCT TGTGGATTCT GTATTTTCAA CAACGTTGCC
GCAGGCGCTC TTTATGCGAT TTCAGAAGAT AGGCTCCTGT GTGGCCGGTG CGCAATTGTT
GACATTGATG TCCACCATGG AAATGGAACT GAAGACATCG TTCGAAAATG CCACGACCCT
AGCAAACTTT TGTTCTTCTC AATACATCTC TACGACAACG ATAGGAAAAA GAGGGGTTCA
AATCAGTTTT CCTATAAGTT CTACCCTGGA ACCGGTTCTG AGGATGACCT TGCATTGAAT
ATCATCAACG TGCCCATTGT ACCTTTGTGG AAAGAACACT CCGCTACTGT GCAACCTTCG
ATAAAGACCC ACAACACAAG ACGGAAAACT CGAACATCTC AGGAAGGTCC AGACGAAGAA
AGTGATACCA CGCCAAAAGA TAGTTCACGT ACAAGCGATG TTGGCAGCGA AGAAGGCTCT
ACCGCTGCGT CTAATTCATC TCCCAGACCT GGAGGACTGT CATCCGGAAG AACTGCGTAT
CGAAATGCAA TCCAAAATCG CTTACTACCT GCGCTTCGGG CTTTCAACCC TGATCTCATC
CTCATAAGCG CCGGTTTTGA TGCAGCAAAA GGAGATGTGG GAAATGCTCG ACACGAGCGA
GGCGGAGAGA AAGTTGGGCT CGACTTAGAA CCCGAAGACT ATGCATGGAC AACAAGAAAG
ATTCTGGAGA TTGCCGATAT TTGTTGCCAG GGCCGCGTTG TTTCGGTACT TGAAGGGGGA
TATGGAAGAA CGCCAGCTGC CTTGCCCACA GGCTCGTCCG CCCTGGATCG CACCTTGTTT
GCCGAGTGCG CCATCCGGCA TTTACACGCC ATGGTTGATC CGTACGACAC CGAGCAGCGA
TTTAGCTGAA TTTTGCAGTT AGCTTGAGAT GGTCCGAAAT TTGATTTGAA AAATTATGAA
TGTATAGCAA TAGTGGAATT AGAAA
 
Protein sequence
MSFPTASPSY DSDDPPSHTQ FLDLTNGKGG DLLPHPYLPP QPESDYAHAT ETPVDAQDSD 
DQDDSNMGDD EDDGGMEGSP TIRGTVPPGS GAIGINPNTP GKVVDVGQEH TGRWTKAEHE
AFLSALQTYG KEWKKVAAKV KTRTVVQTRT HAQKYFQKLQ KTIESTGKDD VTQVHMGIDS
GVLDKQGSGS AAGSSHQKKQ RCPAPVSLQK PERRSSSATI SAAQVISNLS SHTSTQPSSM
GPSVAARGSA PLKSKSADPQ YSAMRPHGFS TEVSSSSFPS SFSSWMGNNS MKITAPNPED
TKNSFPEPSP AATGKRKLAE IAAARMLAGV GQQQQRQLQP LVDRNDEAPT PPLPDTENSK
GSSLNLHEAP PPPLLFGDGF NMSSLTSKKG VALQIVNPES LGVSHDKPRR GGGDSPVTPW
DGQLEALVYE KAKVESKEEE TGGSKPAALH PVCGPSTAYG RTPLHQAVCE MDLDGVRCQL
QDMPSQNVSV LHGLDEAGYS PLHSACALRL SYGQSAIVAP QLVRLLLSAG ICDPSRPDIK
GNTPLHWAAR SGDRDVVEIL LLKNSLLDAR NQAGEAPLHW AMRAGERGTT VALLLLENGA
RPSSLSKEYR RPLDVAADGF LDEERSLAVL RVAEQSYRGI KPSKALKKRL KETASERRDA
RAALLIRSAQ SRTLVLHHPE CLEHHPKSAT DWEAPDRIRS IMRRVLPASD PTGATETSGI
FPHEVTVSKE FERAKLDLLS RVHSTDYLSF VNALSKDLEK QLRESGGSFS AMDESDNGFG
SPPPVVPFTP LVQRSLIKVD ESRIKLGVNS DTSFSAGSLR AARRAAGAVQ HAVDCVLVGR
NRNAFCVVRP PGHHAGINGL LDGGESCGFC IFNNVAAGAL YAISEDRLLC GRCAIVDIDV
HHGNGTEDIV RKCHDPSKLL FFSIHLYDND RKKRGSNQFS YKFYPGTGSE DDLALNIINV
PIVPLWKEHS ATVQPSIKTH NTRRKTRTSQ EGPDEESDTT PKDSSRTSDV GSEEGSTAAS
NSSPRPGGLS SGRTAYRNAI QNRLLPALRA FNPDLILISA GFDAAKGDVG NARHERGGEK
VGLDLEPEDY AWTTRKILEI ADICCQGRVV SVLEGGYGRT PAALPTGSSA LDRTLFAECA
IRHLHAMVDP YDTEQRFS