Gene PHATRDRAFT_42887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42887 
Symbol 
ID7196528 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1419979 
End bp1421835 
Gene Length1857 bp 
Protein Length374 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176785 
Protein GI219110066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0365267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATGCTGTA CATGTAAATA CCAAATTCGT GAACCTAGTT TCGGAAAGTG GCTTAAAAGT 
TATATGTGAC CATAAAATTG AAATCGAGCT AAAAAAGTTT ATTTCGACGA GGAGATATGT
GCTCACTGTA AAAGACACCT GAGCTTGATA CCACTGCCGG ATTAATTGTG TGCCAGAAAT
TCCCATGTAT ACCAGTGTTC GCATTACTTT GTTGAGCTGA AACAAAGCTT CCGCGAAACC
TGTACTGATT GAGAATTGGA ATTGTTTCCA CAAGATGATG ATGAGCTTGA GAAGTCTGGT
AAGTGTAATT GCAATCGCGT TCACATTGGG AATCGGGATG TACAATGGCG TGCACGATAT
GAGTCATGTG GTCGAGACTT TGTCGTCGAT TGAGAAAGAG CTAAATCCCT TGCAGCCCAT
GACGGGATCG GAGCAAAGCA AGATCGTTAG CATGCCTTAC AAAAACTGGA GGAAAGACAT
TATACGGGAA CCAATTCCTC GACCATCAGC TAAAGCTCTT TTCGACGGAT CAAATGTAAC
GGGCGACGTA TCGTGGCTCC TAAATCTGGC TATTATAGGA TTCGGCAAAT GTGGCACCTC
GTCTATGGTA GGCCATTTCA GCCAACACAA ACAAATATCC ATGATGCACA GAGAGCACTG
CGAGTTTACA TGGCGTGACG ACGACACCAT TCTTCTTAAA GCCCTGGAGT CGGAGCTTCC
ACATGGCAAC TATATGCGTG GGCTGAAGTG CCCCAGTCTC GTGAGAAGCC CTTTGGGAAT
GCAACGGTTA GCCAAGTATT TTCCGAACGT TCGACTCATC GTTGGAGTTC GGCATCCAAT
CCTTTGGTAA GTCATTGTTG ATGGAACAAA TTGATTCTTA CTTGTCAGCT AACCTGGGTT
TTGGTATACC TGGACAGGTT CGAATCTTTG TACAACTGCA AGTATATTTT TTTCAGTGTC
TAGTAAACTG TCTTTCCCAA TCGCTGTACC TTTCTGCTCA CCATCAACAA TACTTGTTAC
TCTACAGTTC GCCAGAGACA GTTTGGATAC AGCCTCTTCC CAGCCCACAA ATTGATTGGC
AAATGCCAGG ATTTGGGACC CAATAAAAAG GTGCACGGTG TATGTACAGA AGAAGCGAGA
TTTCAAGAAG CATTAATTGG CTTGGGAAAG ACTAGCATGA GTACAACGGA CGAAATGCAG
TACTTTTTAT CCTCTGAAAA GAAACCTGAG AATCTTACCG TTATCTCTCA AATGAAAGTA
TTTGTCTACG ACATAGCGCA GGTAGAGGAT AAAGACGAGG AACGCTCCCA ACTGTTTATG
GACGACATGC AAACATTCCT CCAGATGACG GAGCCTTTCA AGCCGATGGG TCAAAAAAGC
GGCGGGAAAA CGAAGCAGTC GCGCATTGAC ATTTGCGAGC AAAAGTACGA CCATTTGCGC
GAGGCTCTCT TGGACATTGG GGTGAATGCA TCGAGGTGGA TCCGTCGCTT TTTCGTACCT
GCCGAGGGTG TGACCGTCTC ATCACCCAAA TTTTTCGAGC AGTCGTTGGC GAAGTGGGAA
ATCGATCCGT GCGAAGAACG CCGAGCAAAT AACACATTCC CTCCCAAATG ATTTGATTGT
GTCATTATCC CGACAATTTG GTTGTAAACT CTAGGCTGCT TGTCAAAGCA ACATGGAAAA
TAGCCAAGAA GCCATTGTCG ACAAATTGGG ATGCCCAGGA TCACGCACCT ATGCAGTGCA
CAGATAAGGT TTCTGAAAGG GGGAAAACGG ACAGCACCAT CACTCTCGAC ATTCGTGTGT
ATGGGTTGGT ATTTACAGTT CAATAATTTA CACCGCAATC TTAAAGTTAA TTTTTTT
 
Protein sequence
MMMSLRSLVS VIAIAFTLGI GMYNGVHDMS HVVETLSSIE KELNPLQPMT GSEQSKIVSM 
PYKNWRKDII REPIPRPSAK ALFDGSNVTG DVSWLLNLAI IGFGKCGTSS MVGHFSQHKQ
ISMMHREHCE FTWRDDDTIL LKALESELPH GNYMRGLKCP SLPSIFRTFD SSLEFGIQSF
VRQRQFGYSL FPAHKLIGKC QDLGPNKKVH GVCTEEARFQ EALIGLGKTS MSTTDEMQYF
LSSEKKPENL TVISQMKVFV YDIAQVEDKD EERSQLFMDD MQTFLQMTEP FKPMGQKSGG
KTKQSRIDIC EQKYDHLREA LLDIGVNASR WIRRFFVPAE GVTVSSPKFF EQSLAKWEID
PCEERRANNT FPPK