Gene PHATRDRAFT_39117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39117 
Symbol 
ID7194877 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp370582 
End bp372582 
Gene Length2001 bp 
Protein Length587 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183083 
Protein GI219125639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.294419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTC CTTGTTTGCT CCTTGTTGTT TCGGCTGTAT TGTCTCTAAG CAAGGCGGAA 
GAACAACAGA AAAGTAACGG TACAAGGCGG ACCCGTCAGT TAAAACTCAG CGAAGCAGAA
TCATTTCTTC ATGCTGAACA GTTACTGTGG TCGGGATCGG GACGACATAC CCAAGAGGAT
GCTTTTCTCA GCTTTGAAAC CTCTTCCCCG ACTCCAATCA CTTCAGTCTC GTTTACTGAT
TTTCCATCCA TAGCTCCGGT CGCTAATTCT GCGCCGACTG TTTCATCATC TAAATATCCC
ACTCAGGTTC CGATCCGATC GACTGATCCA TATCCCGGAA CGTCTGCCAC AGAGCAACCG
ACAAACTTTA TCTGCGACGG TTTTAATCGT TCAGATATTC TTCTCGGACT TTTGCAATCA
GTAACTTCAG AGGATCTTTT GCTAAACAGA TCCCTGCCTC AAGGCCAAGC CTTTTTGTGG
CTACTCGACA CTGATATTAC AACAGACCCT TGCATATACC CATCTGTTAA GCAACGATAC
TCTGTTGCAG TGATCTATTT TGCATTAGGA GGCAACAATG GGGCGAGTTG GATTGAAAGT
ACGGGCTGGT TGTCGTCCAT GGAGGAATGC GACTGGGCAC GCGTGAACTG TGACGAAAAA
GGCGGAGTCA CTGGCCTCCA ACTAGGTAGG TACTCAAACT AGAAGGAATG CCGTCTTAGC
AGATTCAAAC TGATGCTTGG AAACTGTACT GAACAGGTAG AAACAACCTC ACGGGGATGA
TTCCGAAAGA AATTTCAGAA TTGACTGCGT TGCAGGCGTT AGTTATGAAC GACAACACCA
TAGAGGGTCC GCTTCCAGAG ACCATTGGCA GTTTACGTAA TCTGACAGAT CTAGACCTGG
AAGACAATTT TCTAGAAGGT AACCCTTTGC TCACGCTATC TTCACTCAAG AAGCTACGCA
GTCTTCGATT ATCATTCAAC TCTTTCGATG GTTCGGTACC TTCATCAATT GGTGGCTGGA
ATGAGCTGCA GGAGATGTGG ATGGCTGGCA ACTTTTTCAG AGGAGAACTT CCTACGGAAA
TCGGCTTGAT TGGGAATCAA TTATGTAGGT CGTTGCTTCG TCCGGTTTTA AAAATTATCC
TTCCCTTGGA AACCTCACAT TTTGTCTTGA TGTCGATTTA GCATCCCTCT TTATTTACGA
AAACGAATTG GAGGGTACGC TCCCGTCCGA GCTCGGCAAC TTGGGACTTT CTGAATTCCT
GGGTCAGGCA AATATGTTCG AAGGCAGTAT TCCCCAGGAG CTGTTTAGAA ACTTTGATCT
TGTGGTACTA CGACTTGACC AAAACAAGCT TACGGGAACT GTGAGCAACG CGATTGGTGG
TTTGAACAAC CTCCAAGATC TACGTTTAAA CCTTAACTCA TTGTCTGGAA ATCTTCCAAT
ATTGCTCTAT GGGCTGAGCA ATATACGTAA GTTCTTCCAG GCCGCCGTCA AAGGCGGCCC
GGATTATTCC AAATCTTCTC TCCTTTTGTA ATCCTCACTC TCCTACCTAC CCCAGAAAAT
CTGCTTTTGA GCAACAATCG CTTCGATGGA CAGATTCGCA ATGCGTTCGG AAACTGGAAT
GCTTTGGATT TCGCAGATTT TGCACAGAAT AGGTTCACAG GGTTTATCCC ACCAAGCTTA
TTCGAAGCCG AATCGCTTCG AATTCTATAC CTTAACAACA ACCTGCTGCA AGGACCAATC
CCCTTAAATT TCGGCAAACC TCGAAAGCTC CGTGATCTTT ATTTGAACAG CAATATTTTG
ACCGGAGAGA TTCCTTCAAT CCCTACAGGA AGTTTGTTGA ACTTATCTGA GTTTCTTTTG
CAAGACAACC AACTCCAAGG CATCACAATG CCTCCTTCTG TTTGTTCTTT GATCGAGCAA
GATGGCGAAC TAGAGGATCT GTGGGCTGAC TGCCTTGATA CTGATGATGT TGATTGTCAA
TGTTGTACCC AATGCTTTTA A
 
Protein sequence
MRFPCLLLVV SAVLSLSKAE EQQKSNGTRR TRQLKLSEAE SFLHAEQLLW SGSGRHTQED 
AFLSFETSSP TPITSVSFTD FPSIAPVANS APTVSSSKYP TQVPIRSTDP YPGTSATEQP
TNFICDGFNR SDILLGLLQS VTSEDLLLNR SLPQGQAFLW LLDTDITTDP CIYPSVKQRY
SVAVIYFALG GNNGASWIES TGWLSSMEEC DWARVNCDEK GGVTGLQLGR NNLTGMIPKE
ISELTALQAL VMNDNTIEGP LPETIGSLRN LTDLDLEDNF LEGNPLLTLS SLKKLRSLRL
SFNSFDGSVP SSIGGWNELQ EMWMAGNFFR GELPTEIGLI GNQLSSLFIY ENELEGTLPS
ELGNLGLSEF LGQANMFEGS IPQELFRNFD LVVLRLDQNK LTGTVSNAIG GLNNLQDLRL
NLNSLSGNLP ILLYGLSNIQ NLLLSNNRFD GQIRNAFGNW NALDFADFAQ NRFTGFIPPS
LFEAESLRIL YLNNNLLQGP IPLNFGKPRK LRDLYLNSNI LTGEIPSIPT GSLLNLSEFL
LQDNQLQGIT MPPSVCSLIE QDGELEDLWA DCLDTDDVDC QCCTQCF