Gene PHATRDRAFT_47095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47095 
Symbol 
ID7202011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp386582 
End bp388144 
Gene Length1563 bp 
Protein Length520 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181369 
Protein GI219122054 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAC GCAGAAAAAT GGAAACGGTG GCATCCACGA CCATAGACGA AGCGACGTCC 
TCACTGGATC TTCGCATTCT AGAGCTGATA CAAGCCACTC AGGATCGCAC CATACGACCT
GCCAAAGTCT CCACAGAATT GGGTATTTCT ATCAACGAGG CCACCGCGGA GCTTTGCGGC
CTATTGGCTG CTGTTGGCGG TGGAATCGAT GGCGCCGCTT TTCGATTTGA ACAAGCCGAT
GGAAATCCGG TGATGGTCTT TACATTTCCC GAGGACTTTC GTGCCCGAGC TCTACGGAAG
CGGCGTCGAC AAGACTTTCA CGAAACAATG CAAACTTTTC TTGACATCGT TGGAAAGATA
CTGAAAACGG TGACCGCTTT TGGTTTGATA CTCTCACTTC TGATTGTTTC TCTTGCTGCC
ATGATGGGGC TTGTCGCAGC GGTAATCGGT TTGTCTCGAA CTGGCAATCA AGGGCATCGA
AATATAGTTG TACGCCAACT CCGTTCTATG TTTTATACCA CGCGGCAGTT GCTATGGTGT
TATGCTGTCT TCGGTCCTGC AGGCGATGAT GGACAAGATC TGTTCCTAAG GGAGGTTGCC
TATGATACAT CTCTCGCGTG TTCCGTCTGT TATGGGAATC CGGCCAGCTT TTTTTACTGG
ATCCGTGCTC AACAGCTGGC AAGACGAAGT CGAGTTCGTG GGTGGACCGC GTTTGCTCGT
TCACAGGGAA CACTCACCAA TAACGAAGGA GCCGCGCTTC TTCGACCACG TTTGAGACCT
AACAACCTTC CAACCGATAC TCCAGAAATC TCACAACGGA CGCTACTACC CGTCGCTGTT
GAATTTTTGT TTGGGCCACC TTCGTATCTA AACGAGGAGA CCGAGAAATG GAAGCTACGA
GCACATGCTT TGATCCAAAA GTCAATGATA AGTAGCAGTA GAGGCGTCTC TTTGGAAGAA
ATGAGTCCTT ACGTAGATCA TCCACCAGCA ACGCTGAGTG AATCGTCGAA GATCGTAGAA
CAGGGTTTAA TATTGGTTGC TTACTTCAAT GGAGTTCCAA TGAAAAATTG CACGGACCAA
CCGGCCAAGG CCCTGTTTAA TTTCCCAGAA TTGCTATCCG AAAGCAGCTC GATAACTAAA
TTTGATTCTT CACCGGTGTA TGACGATGAT GGAAGTTGGA GCTCTGTTTT GTACGCCAAG
GAAGCAGGCA GCGGACGCTT ACGAAGCTCA TTTGATGTTG CAGAGAGCAT AGAAGAACCT
CCTTTACGAT TCACTCAACT TCCAAAGAAA GACTTCGTGA GGTGCATCGG TCTAGGGCTC
CTAAACTTGA TTGGAGTGTT ATGGTTAGGA CAGTCAATTG GAGTGGGAGG TGCGCTGGAG
TTGAAGAGCG GTATTTTGCT GGAACGGTTT TTGCGTAGAT GGGTGGTCCC TATTCTTCAG
TTCTATGGAT TTCTGTTCTT GGGCCTACCT GCCGGTCGGC TTTGCATCGT TATTCTGCGA
AACAAGCATC GATATGTGCG CAACAGGAAG CGCCGCTCTC TAGTTAAAGA GCTTACAGCG
TAA
 
Protein sequence
MTQRRKMETV ASTTIDEATS SLDLRILELI QATQDRTIRP AKVSTELGIS INEATAELCG 
LLAAVGGGID GAAFRFEQAD GNPVMVFTFP EDFRARALRK RRRQDFHETM QTFLDIVGKI
LKTVTAFGLI LSLLIVSLAA MMGLVAAVIG LSRTGNQGHR NIVVRQLRSM FYTTRQLLWC
YAVFGPAGDD GQDLFLREVA YDTSLACSVC YGNPASFFYW IRAQQLARRS RVRGWTAFAR
SQGTLTNNEG AALLRPRLRP NNLPTDTPEI SQRTLLPVAV EFLFGPPSYL NEETEKWKLR
AHALIQKSMI SSSRGVSLEE MSPYVDHPPA TLSESSKIVE QGLILVAYFN GVPMKNCTDQ
PAKALFNFPE LLSESSSITK FDSSPVYDDD GSWSSVLYAK EAGSGRLRSS FDVAESIEEP
PLRFTQLPKK DFVRCIGLGL LNLIGVLWLG QSIGVGGALE LKSGILLERF LRRWVVPILQ
FYGFLFLGLP AGRLCIVILR NKHRYVRNRK RRSLVKELTA