Gene PHATRDRAFT_42552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42552 
Symbol 
ID7196258 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp395498 
End bp397188 
Gene Length1691 bp 
Protein Length526 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177082 
Protein GI219110661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATCGCTCC ATGGACCAGC AAGTTATCCA AGAAGTATGA ATGGTTGTAA TGGAGGTTAT 
CCTACCTTAT CTCCGTACGT TGGCAACGAT TTCGGGGCAA CGCCTCGAAC CACGCACATG
AATAGGTCAC CTCCACTTCA TCATGTTGTG CACACCGGCG GGTCTTCACC GGACCACTTG
ATCCCATATT GGAAGCCCAC ACATAACCAG TATATTTCTT CCCCTGTACC ACGTCCTTTA
AAAGAGTCGT CTCCAATAAC CAACATTCAA TCCGAAAGTG AAAGAGACTT CTCCTCGCCT
CGGACTCTCC CTTCCTATTG TTCGAGGTCT TTCGATAGCC CCCCGGATCC TAACACGACG
GAACGCTTGA TGTCATCTAC CATGCAAAAG CGCGTCTCGC CCAAGGAAGC CCGAAACAAT
GTCCCCGAGA CGAGTGTACC CTTGAAAAAG CGAAAGACTG TGATGCAGAT GCATCGCAAC
CCCGTGGTAT CACCTTTTCA CGCCAGTCCA ATATCTCATA GTAGTAAGGC GGTCGCCTCT
ATCTCACTGT CTTCTAATCG AATCCCATCA TACGACAGTC GGGTATCATC ATACGACTCT
AAAGACGGAC AAGCCCTTCT AGAAAGTTCC AAGATCGTGG ATCCGACGGA CATTAAGGCG
GAGGAACCTG GAATGAAAGA CTTTCCTAAT GTCTTACATA GCGTACTTTC AGATTCCGAA
TTTGCCGGAA AAGTTGTGCA ATGGCTGCCG CATGGGAAGG CTTGGAGGAT TGTACGATTT
GACGCTCTCA GGAAGCTGGT GCTGCCGAAG TTTTTCGCCA ACCTTCGCCA ACCTAACAAT
AACGAAACAA CCGGTTCCAT CGACACTTTC CTCAAGTATC TTTCTTCCTG GGGATTCGAG
GAAGTTACTG ATGGCCCTGA CGTTGGTGCA TACACTAATG TGGTAAGTAT CTAATTTCAA
GTTTTCAAAT GGGAGAGATA ACGTGGACCC TCATACCAAC TCCATTCTTG TCGCAGCTCT
TCCGACGTGG CCTTCGTCGG CTTTGCTCCG AAATGAAGTT CAAACCTTGG GGAAAGGAAG
ACTCAATACA GATAATTGAA TCATCGAAAC AGCCTCAATC GATTCTTCGT GTGCCTTCGC
TTGCGTCAAC GGTGGATACG TTGGAATGCA CCTCAAGCAA AGAGATGGAT GGTCAGTTTG
TTCAAATGAA CCCTTCGGGT AGCTCAGAGC GTCCGGAGTC GTGGCGCACC AACCAGTGGG
AAAGATCTCC TGATAATCGA TTTCTCCGAC AGACACCACC GTCTGAAGCA TGGCCATGCA
ACTATCAATC TGCCGTTTCG AACAGCTTTA AAGGGTCGGA ACAATCACGG AATATTCAAT
ATTCTCCAGT GCGTATACGT TCCTCTCGTG GAGCGCCTCG CACTTTGAGC AGAACGAAAG
CACCCGCTCA GTATCAGAAC CACCCACAGC TTCAAAAGCG TCCATGTGCC TTCCCAGTAT
CTAATCGTGG CCGTGGAAAG GTATGGAGCC CTCGTCCATT CTCTCCGTCC GTTCACCCGT
CGACATCTCC AGTTTCAATT GAGACAAATG TTATGTCAAT CGGTCAATCA CGCTCGACTC
TCAACCGTAA GGAGCCACTG GCGTCCTCAC CGGAAGAAAC GATACGAGGA GAAAACGTCA
CTGCTGTGTA G
 
Protein sequence
MNGCNGGYPT LSPYVGNDFG ATPRTTHMNR SPPLHHVVHT GGSSPDHLIP YWKPTHNQYI 
SSPVPRPLKE SSPITNIQSE SERDFSSPRT LPSYCSRSFD SPPDPNTTER LMSSTMQKRV
SPKEARNNVP ETSVPLKKRK TVMQMHRNPV VSPFHASPIS HSSKAVASIS LSSNRIPSYD
SRVSSYDSKD GQALLESSKI VDPTDIKAEE PGMKDFPNVL HSVLSDSEFA GKVVQWLPHG
KAWRIVRFDA LRKLVLPKFF ANLRQPNNNE TTGSIDTFLK YLSSWGFEEV TDGPDVGAYT
NVLFRRGLRR LCSEMKFKPW GKEDSIQIIE SSKQPQSILR VPSLASTVDT LECTSSKEMD
GQFVQMNPSG SSERPESWRT NQWERSPDNR FLRQTPPSEA WPCNYQSAVS NSFKGSEQSR
NIQYSPVRIR SSRGAPRTLS RTKAPAQYQN HPQLQKRPCA FPVSNRGRGK VWSPRPFSPS
VHPSTSPVSI ETNVMSIGQS RSTLNRKEPL ASSPEETIRG ENVTAV