Gene PHATRDRAFT_50271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50271 
Symbol 
ID7199035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp155374 
End bp156653 
Gene Length1280 bp 
Protein Length421 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185219 
Protein GI219130117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0291461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTACG GTATTCGTTT CCGGAACAAA AGAAAAATTT GGGTTCAACC CTGGGCATCA 
AACCTACGCT ATCGAACGAT GCCATCCGTA AGACACCGAA CCGATGAAGC GATCGACCCC
AGAAAGACTT CGCCATTGCT GATTCTTGCT CTTGACATCG GTGTTGGCAG TGCTCGGTTA
TTGCTTTTTT TCAGCGTTGA AAGATTCTTT TTCGCAGCTA TGGTTGGGTT GCGGAAACTT
GACGCGTTCG TCAAGACGCG GCCCGAATTG CGATCACAAA GTGCCGTTGG TGGAATGATC
ACACTGGTGG CTGCGACGGT GTCAGCCTTT TTGTTTGTGG GTCAGATTAT TCATTACATT
ATTGGAAATC CGAAAGACTC TCTTCTGCTT TCCAAATCCG TATCAATTCC GCTCATTCCT
CTCACCAGCA ACTACCTGAC AACAAAGATT CTGGAACGAG CAGCCAAACT CCCTTTGGAT
ATGCTGATAA CTTTCCCCTA TTTACATTGC AGTCAGCTGG ATTTCAATCA CGATGGAGCA
TCGCTGGCAA CAAGCGAATT CCAAAAGCTA CATCCCAAAC ACTCTCTCAC GATGCGAACA
CCATTCCAGC ACGAATTATC AACAGCAAAG TTTGAAACCA AAAAGGGACA GGGTTGTACC
ATCGAGGGAC ACATCCGTGT ACCTGTGGTC GCAGGAAAGT TCGAGATTAC TCTCAACAAG
CGCACGTGGC AGCAAGCTGC CAGTATTCTG AATCGCCAAA TGTTGATGCA AGTTCTGGGT
GCCACATCCG AGCACACTTC ATCCAATGAC GAGCTCGGTG ACCGCTACAA CTCCACACAC
TTTATCCACT ATATTCGTTT CGGAGATTCC TTTCCACTCA ATATAGAGAA GCCCTTGGAG
AAACGACGTC ACATCTTCCG TAACAAGTAT GGCGCAATGG CGGTGCAAGA GATGAAGATC
GAGCTCGTAC CCACCTACAC GTCCACATGG TTGCCGACGT CCAGTCGACA AACCTACCAA
GCGTCCGTTG TAGATAGTAC GATAGAACCG GAGCACATGG CGCAAGCCGG TGCCTCTTCG
TTGCCTGGCC TTGCTGTCCA GTATGACTTC TCGCCGTTGA CAGTTTATCA TACCGGTGGT
CGTGACAACA TATTGGTGTT TTTGAGTTCA CTGGTGAGCA TTGTGGGTGG TGTCTTTGTT
ACCGTCGGCC TCGTGAGTGG CTGTTTGGTA CATTCGGCTC AGGCTGTAGC GAAGAAGATA
GACTAACGTT AACATATTTC
 
Protein sequence
MGYGIRFRNK RKIWVQPWAS NLRYRTMPSV RHRTDEAIDP RKTSPLLILA LDIGVGSARL 
LLFFSVERFF FAAMVGLRKL DAFVKTRPEL RSQSAVGGMI TLVAATVSAF LFVGQIIHYI
IGNPKDSLLL SKSVSIPLIP LTSNYLTTKI LERAAKLPLD MLITFPYLHC SQLDFNHDGA
SLATSEFQKL HPKHSLTMRT PFQHELSTAK FETKKGQGCT IEGHIRVPVV AGKFEITLNK
RTWQQAASIL NRQMLMQVLG ATSEHTSSND ELGDRYNSTH FIHYIRFGDS FPLNIEKPLE
KRRHIFRNKY GAMAVQEMKI ELVPTYTSTW LPTSSRQTYQ ASVVDSTIEP EHMAQAGASS
LPGLAVQYDF SPLTVYHTGG RDNILVFLSS LVSIVGGVFV TVGLVSGCLV HSAQAVAKKI
D