Gene PHATRDRAFT_39274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39274 
Symbol 
ID7195021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp94823 
End bp96934 
Gene Length2112 bp 
Protein Length703 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183419 
Protein GI219126343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.070802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATACG ATTTGCTCGT CTTTCTCGCA GCCTCCGTCG TCGTCACCCC CCTCGCCAAA 
TCTTTGAACG TCACACCTAT TTTGGGATAC CTGTTGGCGG GGGCAATCCT GGGACCCAAC
GGACTAGACC TCTTTGCCAA TTCCAAAGCC GACATTGAAC TCGGGGACGT CGGCATTTTG
TTTTTACTTT TCAGCGAAGG CCTCGAAGTC ACGCAACCGC GCCTGAAAAA ACTCACCAAT
TACCTCCCAC TCGGAATTGC GCAAATCTCA CTCGTCACGG CCGTCATTAC CGGCGTCTTC
ATGTCCGATT TACCCGAATG GACGGCGACC GTCTTTGCAA TCGATCCCAC GCTACTCGGG
ATTCCCCCGA CCGAAGCCGT CGTACTGGCA CTCATTGGTT GCTTGTCCAC ATCGGCTTTC
ATTTTCCCCG TCCTGAAGGA ACGCTCCTGG GAAGACGAAG AATCCGGCGA AGCCGCCACG
TCAATTTTGC TGCTGCAAGA CCTTTTGGTC GCACCACTCT TGGTCCTTTT GCCATACTTG
GCCGGCGATA CCGCCACCGA TTACCTCGCC ATTGGATTTC TCACGGCCAA GGCAACGCTG
GGCTTTGGTG CCGTACTTTA CGTGGGTAGT TTAGTCCTAC AGAATGTCTT TCGTGTGGTC
TCCACCGCAC AATCCAGTGA AACCTTTGTC GCGCTCTGTT TACTGGTTAG TGCCGGTATG
GGTGCCATTG CCAAATACTT TGGTCTGACC GACACCGCCG GGGCTTTTGC CGCCGGTGTT
TTGCTCGCCA ACACCAATTA CCGCGCCCAA ATTCAGGCGG ACATTCTGCC CTTCAAGGGT
ATCTTATTGG GAATTTTCTT CATGGGCGCG GGTTCCAATT TTGACGTCGA ATTGGTCGTT
CGCGAGTGGC CCACTATTGC GATTGGTTGT GTCACTCTGT TGGCGCTCAA GGCGTTTACC
TTGGGTGCCG CCACACGAGT TCCGCACTGG ATGGAACCCA ACCGGCTACC CACCGCGGAT
GCCATCCGGG TGAGTCTCTT GCTGGCCGGC GGTGGCGAAT TCGCCTTTGT CGTGCTGGCT
CTGGCAGAAT CACTCGATAT CATCCCGGCC AGTTTATCCG CCATTGTTAC GGCAATTGTG
TTAATTACTA TGGGATTGAC TCCTTTGCTC GGAGATCTGG CGGTAATATT GTCCGAGCCG
CTGCTACCGT ACAAGGAAGA AGAGAAGTTA ATGGCCCTTA ACGGGAGTAA CGGTGCGGTT
GTCCCGGAAC GCGAAATTCC ACACGTGGCG AAGGAGGCGG TGGTGATTTG TGGCTATGGC
GAAGTCGGAC AGAATACCGT CCGCGTGCTG GGCAAACAAA AGGAAAAGGC CGGCATGATC
AAATCCAGCT ACCTGAAAGA TGAAGTTCCC AGCGTCGTGG CCTTTGACGT CGATCCTTCC
CTATCGGATA CTGCGCTACG ACCGTCCCGC AATACCGCCG TCTTATTCGG TAACGCCGCC
AACCCCGAAG TAATCCGTTC CAGCGGCATT TCTCAGCCTT CCGCCATCTA CGTCACGTAC
GAAGACTTTG GTCGTGTCCA ATCCGCCACA GCGCGTTTGC GGGCCGCCTT TGCCACCGTC
CCCATTTTTG CCCGCGCCGC TACCCGGGCC GAAGTAAGTG CCTTGGAAGA GGCCGGGGCA
ACACAAGTCG TGGTAGAGTC CGATGAATTG CCCCGATCCG CCTTTGCTCT GCTGGAAGGT
GTATGGCAGG GCAACTTGCC CGGCAAGATT TTCAATTCTC CGGAACATTT TCGGGCAGAA
GCCGCAGAAG CGGCTGGTAT CTCCACCGGT GAAGTCGAGG ACCTGCTGGA AATGTACCAA
GCCATGGATC AGGATAAGAC CGGAACTGTC TGTCCAACCG AGATTGAAGC ATTGTTGGAA
CGTTCCGGAA CTTGGATTGC CTCGGACGAC GAAATTCGAC AGCTGGATTC TTGGATTGAA
AGTACCTTGG CCGGAAAGGA TCCAGTGGAT GCTCTGGAAT GGTGTCGACT GTACGGACGC
GCCCCCGATT TTGTAAAACG AGCATTCGGT GGTGATCTAG CCCGCAAGCG GAAACAAAGA
GAAGAAGCAT AG
 
Protein sequence
MGYDLLVFLA ASVVVTPLAK SLNVTPILGY LLAGAILGPN GLDLFANSKA DIELGDVGIL 
FLLFSEGLEV TQPRLKKLTN YLPLGIAQIS LVTAVITGVF MSDLPEWTAT VFAIDPTLLG
IPPTEAVVLA LIGCLSTSAF IFPVLKERSW EDEESGEAAT SILLLQDLLV APLLVLLPYL
AGDTATDYLA IGFLTAKATL GFGAVLYVGS LVLQNVFRVV STAQSSETFV ALCLLVSAGM
GAIAKYFGLT DTAGAFAAGV LLANTNYRAQ IQADILPFKG ILLGIFFMGA GSNFDVELVV
REWPTIAIGC VTLLALKAFT LGAATRVPHW MEPNRLPTAD AIRVSLLLAG GGEFAFVVLA
LAESLDIIPA SLSAIVTAIV LITMGLTPLL GDLAVILSEP LLPYKEEEKL MALNGSNGAV
VPEREIPHVA KEAVVICGYG EVGQNTVRVL GKQKEKAGMI KSSYLKDEVP SVVAFDVDPS
LSDTALRPSR NTAVLFGNAA NPEVIRSSGI SQPSAIYVTY EDFGRVQSAT ARLRAAFATV
PIFARAATRA EVSALEEAGA TQVVVESDEL PRSAFALLEG VWQGNLPGKI FNSPEHFRAE
AAEAAGISTG EVEDLLEMYQ AMDQDKTGTV CPTEIEALLE RSGTWIASDD EIRQLDSWIE
STLAGKDPVD ALEWCRLYGR APDFVKRAFG GDLARKRKQR EEA