Gene PHATRDRAFT_42245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42245 
Symbol 
ID7195096 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp218397 
End bp220210 
Gene Length1814 bp 
Protein Length575 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183320 
Protein GI219126136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTTTG ATGTCCTCAT TGTGGGTGGA GGGCCAGCAG GATTGGCCGC AGCGATACGT 
CTCAAGCAGC TGAGTCTAGA AAAGCAGAAA GATCTTTCAG TCTGTGTGAT TGACAAGGGA
AGGTACGCAT GCATAAACAT TCTAGCGCCG CTGCGATTAA AGAGTACATC CCTTTGTATA
CTCACTTTGT TCTGTGGGTC ACGTCTAGTG AAATTGGTGC CCACATTCTA TCTGGGAACG
TCTTTGACCC CAAAGCCATG CATGAACTCT TTCCCGACCA GGCAGATCCG TCTTCCGATA
CACACTGGAC GAAAGAACTG GAAGCGACCC AGAATTCCGT CGCCACGCCG GTGACTGACG
ACGAATTTCT AGTCCTGACC GAGACTGGGA GCACCAAAAT TCCGAACTTT TTGTTGCCAC
CCCAGCTCGA CAATCACGGT AATTACATTG TCTCCCTTAG TCAAATTTGT CGCTGGATGG
CGGGTAAAGC GGAAGAACTA GGTGTGGAAA TCTATCCCGG CTTTGCAGCC TCCGAAGTGT
TGGTGGACCA AGAAACCAAC GCTGTCAAGG GAATTGCTAC GCGAGATGTG GGCATCGCCA
AGAACGGAAC CCACAAACCC ACATTCGAAC GAGGAGTAGA ACTACACGCC CGACAAACTC
TCTTGGCGGA AGGGGCCCGC GGATCGTGCT CCGAATACGT CATGGAAGCC TTTGATCTGC
GCAGGGATTG TCAACCACAA ACGTACGGTC TGGGACTAAA GGAAGTATGG CAGGTTCCAC
CTGAAAGCTT CCAAAAAGGA TTAGTACAGC ATACACTTGG GTACCCTCTT CAGTCCGGCC
CTTTGGATAA AAATTTTGGT GGAAGCTTTT TGTATCACCA AGAACCAGAT TTGGTGTTGA
TTGGTTTGGT GGTTGGTCTC GACTACGCCA ACCCGTATTT GAATCCTTAT CAGGAGTTCC
AAAGATGGAA ATCCCATCCG GATATTCGTA AGCATTTGGA CGGTGGAACG TGTGTCTCGT
ATGGCGCCCG AGTTTTGAAT GAAGGCGGAT GGCACGCTGT TCCAAAACTC AGTTTTCCAG
GCGGGGCACT TTTGGGGTGT GGCGCGGGAT TTTTAAACGC AGTCAAAATC AAAGGTTCAC
ACACGGCTAT CAAATCTGGT ATTTTGGCGG CAGAAGCCGC CTTTGATGCA TTAAAAGATG
GAGACTCCGT AGCTGAAATT GGGGAATTAC CAGAGACTGG TCCTATTGAA TTGACGACGT
ACGAAACTGC AGTTAGATCG TCCTGGATTA AGGATGAGCT GTATCAAGTC CGAAACACTC
ACGAGGCATT TTCGCGCTGG GGTGTTGGTG GTGGGCTTAT CTACACCGGA TTGACAACTC
ACGTGTTGAA AGGCCAGGAA CCGTGGACAT TGAAACACTT GACAAAAGAC TGTGAAAAAA
CGGAGGCGGC GGCCAATCAT AAGCCCATCG AATATCCCGC ACCAGATGGA AAGCTAACGT
TTGATTTATT AACGAATCTA CAACGAGCTG GCACCTTCCA CGAAGACGAC CAACCAAGTC
ATCTCCGAAT TAAACCTGAG CAAGCCGAGA TTCCGAAAAA GACATCGCTA CAGGTATATG
CTGGTCCTGA ACAGCGCTTC TGTCCAGCGG CTGTGTATGA ATACGTCGAC GTCGTAAACA
CAAAAGGAAA AGAGCTGGTA ATCAATGCGC AGAACTGTAT TCATTGCAAA TGCTGTTCAA
TTAAAACGCC GAAAGAATAT ATTCGATGGT CTGTCCCAGA AGGGGGCGGA GGTCCGCAAT
ATCAGATTAT GTGA
 
Protein sequence
MPFDVLIVGG GPAGLAAAIR LKQLSLEKQK DLSVCVIDKG SEIGAHILSG NVFDPKAMHE 
LFPDQADPSS DTHWTKELEA TQNSVATPVT DDEFLVLTET GSTKIPNFLL PPQLDNHGNY
IVSLSQICRW MAGKAEELGV EIYPGFAASE VLVDQETNAV KGIATRDVGI AKNGTHKPTF
ERGVELHARQ TLLAEGARGS CSEYVMEAFD LRRDCQPQTY GLGLKEVWQV PPESFQKGLV
QHTLGYPLQS GPLDKNFGGS FLYHQEPDLV LIGLVVGLDY ANPYLNPYQE FQRWKSHPDI
RKHLDGGTCV SYGARVLNEG GWHAVPKLSF PGGALLGCGA GFLNAVKIKG SHTAIKSGIL
AAEAAFDALK DGDSVAEIGE LPETGPIELT TYETAVRSSW IKDELYQVRN THEAFSRWGV
GGGLIYTGLT THVLKGQEPW TLKHLTKDCE KTEAAANHKP IEYPAPDGKL TFDLLTNLQR
AGTFHEDDQP SHLRIKPEQA EIPKKTSLQV YAGPEQRFCP AAVYEYVDVV NTKGKELVIN
AQNCIHCKCC SIKTPKEYIR WSVPEGGGGP QYQIM