Gene PHATRDRAFT_45650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45650 
Symbol 
ID7200440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp818119 
End bp819345 
Gene Length1227 bp 
Protein Length408 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179921 
Protein GI219118286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00296757 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAC GGCGAAATGA GCAGGCGCAG ATGAGCAAAG AAGATTTCGA AGCACAAATG 
GACCGCGAAG ACACCGACAG TGTCCCTGCC GTGTTCGAAC AGGCCTCGGC CGAACAACTC
GAGCAGCGTC GTATTGTTCG TCGATCGACA CGCCCCCCAT CCATGTCACC AGCCACGAAT
ACGCCGGTCT CATCGTCTGT GCCGCTGTCG TCCAATCCTT TCGCTGGCGT CAAGCTATCT
TCCGATTGCA AACCTCTTTT TTCGTTCGGA TCAAAGCCAA CCGACGGCAC CAGTGTCCAC
GACAACCAAG AGCAACCGAA GCCATACGCT ACACCTTCCC CCTTTTTATT TGGTACTTCT
GCTCCCGCGC CAAAAATCAG CACTATCAGT TTCGCCCCAC CTCCAAGTGG TGGCATGTTC
CAATTCTCTC CAGCAGCCAG TACAACATCG GCATCCGCCA TATCGACGAT TTCAACGACC
AATTGGGGGG AAAAGAGTGC CGAGCTGGAT AGGACCTTTC GTGAAACGGT GCTGTCGTCT
AGCTGGGACG GTCCCCAATA TCATAGCAAA ATAGACAAGG CCTATCTTAA GGACTGCTTC
GCCCAGGAAA CGGCGTACGT CAAGGAGAAA GAAGCGGCTA CAGCAACCAT TCGATACTCG
CCATCCAAAC CAGCCACTTC CGAGTCTACG GCATTTGGTA CGTTTGGAAA GTTTGCACAA
TCCACCAGTA GCAACTTTTC CAAAACACCG ACTCCAACTG CGTTTGCGCC AATCCCCGAC
TCGTCTACGA CAGACAACGA CAACAACGCT GGAGAAGCCG AGGACGAAGA CACAATGATC
CAACCAGCGT CAGATCCCGA CTGGGAGATG GACTCCGAAT TTGGCCGCGT TTTTTTCTAC
CACCTGGTGG ATACCAAAAA GCCGGAATCT GGATACGCGG GGTTTGGCTC CGGCACGTTG
CGTATCCAAA AGAACAGCAA GACCGGAAAA TATCGCATGC TGATGCGAAA TCCCGCCGGC
ATCAAGGTTT TGATCAATAT TCTGATCACC TCCGATATGA CGTTCAAGTT GACTGCGTCT
AAACGAAAGG GTCAAGACGC CACCGAAATT TCTTTTTTTG CCACGACTAG TGTGGATCGA
GGCTACGAAC AGTTCCGGGT AGTATCGCTT GCCGAGACGG GAAAGAAGTT GCACAAAAAG
CTCGAGTCGT TGGCCTCCGT AAGCTAA
 
Protein sequence
MGKRRNEQAQ MSKEDFEAQM DREDTDSVPA VFEQASAEQL EQRRIVRRST RPPSMSPATN 
TPVSSSVPLS SNPFAGVKLS SDCKPLFSFG SKPTDGTSVH DNQEQPKPYA TPSPFLFGTS
APAPKISTIS FAPPPSGGMF QFSPAASTTS ASAISTISTT NWGEKSAELD RTFRETVLSS
SWDGPQYHSK IDKAYLKDCF AQETAYVKEK EAATATIRYS PSKPATSEST AFGTFGKFAQ
STSSNFSKTP TPTAFAPIPD SSTTDNDNNA GEAEDEDTMI QPASDPDWEM DSEFGRVFFY
HLVDTKKPES GYAGFGSGTL RIQKNSKTGK YRMLMRNPAG IKVLINILIT SDMTFKLTAS
KRKGQDATEI SFFATTSVDR GYEQFRVVSL AETGKKLHKK LESLASVS