Gene PHATRDRAFT_49409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49409 
Symbol 
ID7195903 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp180415 
End bp182886 
Gene Length2472 bp 
Protein Length606 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184076 
Protein GI219127716 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGCG AAGACAGCAC CCATGAAGTG ATAGTCTCGC AATGTTGATG CTAGCCAAGA 
TGTCTCAGGA TAGAGGATTG ACATTGTACT GTCTTTCTGG GAGGGTTGCT ACCCTAAAGA
CATAGGTAAT GTAAGTACCT TCGATTTGCA GTTACAGTCA ATCATACAAT TCACCGATGG
TTTGTACCAT TTGGTCTCTG TATTTTTTGT AGAAGGTATT CATTGTCTTA TTTTATGAGG
AAATTCCCTA TTCGGGCAGC GCCTCTCTAG CTAGATGGCG CAATTCCCCT ATTCCGGCCA
AATCCGAGAT TTCTAACTGT AAGTAATATT TTTTGAGCGC GAATTCCAGG ATCCGACAGA
CATGTCACCG AAAGAGCGGT TGCGAGTGTT AGTGGAGATG ACCGCTTTAA CGCGTTCTCA
CTTCTCTACG TTTGACCATG ACGATTACTC ACATACTTAC AGTTAGGACA GTCGCTCTGG
TCGCCCTTTT CCTTTTCCCG ACCCTCGCAG AGGTAAGGAT TTCTGCGCAG AGCATTGATG
GACGTTCAAT TGTCACGTCG GAAGCAGAAG AAGATGAAGC TGTGGTTTCC TGCACGCTCG
ACAGCCTGTC GATCGGGGGA CAGCAATTGA ACGTTTTGGA TTCGATTTCA TTTTATCTAC
ATCGAAATGG CGAGCCAGAA GCATGCGGAG CTTCCGTTTT GTCAGAGTCT ACGATTGAAA
AAGGTTTATC GCAAATGGAG TGCAAGAAGG AGGTTGACAA GTACCAATTG GAAGCGTTGT
TGACAGCAGT TTTTGCTGAA GAGCTCGCCG CGTCCGATTG TGGATCGAAC GACAACAGTA
CTGCCCCGGA AGGCTTCCTG CACTACTGTG ATATGGGTGT TGATCGTACT ATTGTACAGA
ACGACTATGA TCATTTGGTC CGAATACCAA GGAAAGGGAA TCTACCCTGT CGATTCTTTT
CCCGAGAAGG TGATCGAATT TCGTCGTTTG AGGACATTTT GAGAATGGCC GAAAAAGCCG
AAACGCGGGA ATGTACTGAA GAAAACGATG TATGCACAGA CAGGATGGAT TTGCATTTGT
ACGCCGCCCC GGCCGGTCGC GTGTTCATGT TTGCGCCCTC ATACGTCGGA GAAATATTCC
AAATTTCTCA TGTCAAAGAT AGTCTTGGAC AACCTATCTC GCTTGAAGTA CTGTCGCTGG
ACCCTCGCGT TTTTGATATT TATAATTTTT TTTCTGCTGA TGAAGCACAC AGTTTGATTG
ATAAAGCAGT TAAAGAGACT TCACCCACGT ACAGACTGCA TCGCAGTACC ACCGGCAGCG
CTACGGCCAG CATCTTCAAC AAACGCACCT CGGAAAATGC GTGGGATACT CACGGAACCC
TGGCACAGAC GATCAAGCGG TAAGAATCCA CATCTTCTAG TTCAGCGACT CCTTCGATTC
ATTTGTTTAA ATATTCTCTT TCCCTTTTCA GTCGATGTCT CTCAGTTCTT GGTATTGACG
AATACCAGGA GTCGTTGACT GATGGGCTTC AAATCTTGCG CTACAACCAA TCAAATGCTT
ATACCGCCCA TATGGATTAC TTAGAGGACA AAGGTAAAAT TTTTGCAGGC ATGATGTTTT
TGATTTACCA CGGTTATAGA TCTGATGGCT TTTTATGTAC AATTGAATGA CAGACGGATC
TCAGGTGTAT GACTATGAAA GCGTTGGAAA AGGTGGAAAT CGGTTTGCAA CGATTCTGCT
TTACATGAGC GATCTTGGCG AACATGATGG AGGCGAAACG GCTTTTGTTA AAGCGGATTC
ACCAGGCAAG GAGCATGTTC CTCTCCATCG CGCCATCCAA CAACTACGTG ATTCAGGCGA
TGCAAGTACA TTGACTCCTG ATTCATGGGA AGAAATAATG GCTGCACAGT GCCGAGCACG
CCTTTCGGTC CGACCCAAGC TAGCTCGGGC TGTCCTATTC TATTCGCAAC ATCCCAACGG
AGCCGAAGAC AAAATGTCGT TTCATTCCGG TTGTCCAGTT CTCGGAGCGA CCACGAAATG
GGCGGCAAAT TTGTGGGTCT GGAATGCACC AAGGCCTGAA TTTGAAGGAG CGCCTTTGAA
AAAAGATCTC AGCAATGTGA CAAAAGCCGA ATCGGCTGCA TCACAGCAGC TTCGGGCTGT
TTTTAGAAAT AGTGGAAAGG ATCGTCGTTT CGAAAAGGCA GAACTTTATT TTGACGAAGA
TGGATTCTTT GGCAAGTTGG GACCAAAAGA TCCTCCTATT AGCGTTAACA CATACGAGAC
ACATCGCTGG AACGTCAAAG TTGACGGTGA AATTTTAGTC TCTTTTTACA TCGATGATCA
ACCTGTGCAG GAATTCACTG TCTGAAAAGT TCGATCTTAT TTTAGGTACG ACCAAATCAC
ATGCTCTGTC AACTGGCGTG ACCATTCGTA GAGTAAAAGG TTGTACTTTA GACTTGATCG
CCGTTGTAGA AG
 
Protein sequence
MVREDSTHEV IVSQCNKVFI VLFYEEIPYS GSASLARWRN SPIPAKSEIS NFRTVALVAL 
FLFPTLAEVR ISAQSIDGRS IVTSEAEEDE AVVSCTLDSL SIGGQQLNVL DSISFYLHRN
GEPEACGASV LSESTIEKGL SQMECKKEVD KYQLEALLTA VFAEELAASD CGSNDNSTAP
EGFLHYCDMG VDRTIVQNDY DHLVRIPRKG NLPCRFFSRE GDRISSFEDI LRMAEKAETR
ECTEENDVCT DRMDLHLYAA PAGRVFMFAP SYVGEIFQIS HVKDSLGQPI SLEVLSLDPR
VFDIYNFFSA DEAHSLIDKA VKETSPTYRL HRSTTGSATA SIFNKRTSEN AWDTHGTLAQ
TIKRRCLSVL GIDEYQESLT DGLQILRYNQ SNAYTAHMDY LEDKDGSQVY DYESVGKGGN
RFATILLYMS DLGEHDGGET AFVKADSPGK EHVPLHRAIQ QLRDSGDAST LTPDSWEEIM
AAQCRARLSV RPKLARAVLF YSQHPNGAED KMSFHSGCPV LGATTKWAAN LWVWNAPRPE
FEGAPLKKDL SNVTKAESAA SQQLRAVFRN SGKDRRFEKA ELYFDEDGFF GKLGPKDPPI
SLTVKF