Gene PHATRDRAFT_50630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50630 
Symbol 
ID7199474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011701 
Strand
Start bp18925 
End bp21887 
Gene Length2963 bp 
Protein Length971 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185607 
Protein GI219130934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0698106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCATGGTA GTGGAAGCTA TCAAAACCAA ACCACGACGC CGTCGTGGTT CCAAATCGGA 
AAGTAGGCAA CACAGTCAAA GCGAGAGCGC CGCGGTGGAA AGTTCGTCAA CGTATCAGCC
GCACAGTACC TTGCTGATTC AACTGAACGA AGATGCCCCA ACCTGGTACC GCCTCGGCAA
CAAGCAGTAC GCGGAAGAGC GAGACGCTAC CACCGCATCC AATCCACCCG AAAAGGGACA
ACGAGTTAGC AAAGATCTCG TTCTAAAGTA TCGGGCCCTG GGAGACGCAA TCTATCGACG
GGAGGTGCAG CTCTTTGGCA AGGAATCAAA CACTTCGGAT GATCGATGGG TAGAGTCCAC
CATGAAGAAA GGTACACTCA AGGATCGCGT CGCTGCTATG AGTGTGGTCG TTAGCACTGA
TCCGGTACAT AAGTTCTACG TATTGGATGG ACTGTTGAAC ATGGCGGGCT GTTCCGATTC
GAATTCTTCG ACTCAAACAA ATTCTCGTGT GGCTCAGCTT GCTGCAGAAG CACTGGAAGA
CCTCTTCGTT AACACCTTTC TACCGAATCG GCGCAAACTG ATATCAATGG AACAACGTCC
CCTATACCTG TACGAGAGCG AAGGCGTAAA GAACACGACG AAAAAGACCC TGTCCCCGCG
GATTCTTCTT CTATGGCGCT TCGAAGAAAT GGTCAAGGCC AAGTATCATC TGTTTCTACG
TCAATACGTA TCCCTAATAC TTCGAGAAGG AACGGAGTTA CAAAAAATTC CAACTATTCG
TCTTGCTGCA GTTCTTTTGC GTTCCATTCC AGAAGGAGAA TCTACATTGT TACCAGTAAT
TGTCAATAAA CTTGGCGACC CAGCCAAAAA AGTTTCGGCT GGCGCTGCCT TTGAGCTACG
AAAACTTCTC CAACAGCATA CAGCTATGCA AGTGATTGTG GCGCGAGAAG TGCAACAGCT
TGCTCACCGA CCACATCTAT CATCACGGGC TTTATATAAT TGTATCACCT TTTTGAACCA
GCTGAAATTA AAACGGGAAG AGACTCAAGG GGGAGCCGAC GAAGCGACCG AGCCGTCACT
ACCTGCGTCT CTGATCAGCA CGTACTTTCG TTTGTTCGAA CTTGCCGTAC AAAAACCTAA
AAAAAAGGAA ACAGCGAATG AGGAAGAGGC AGGAATGAAA TCTCGACTTT TATCGGCACT
GCTGACAGGC GTGAATCGAG CCCACCCCTA TTTGCCTCAT CATGATTCAA CAATGGAACA
GCACATCGAT GCCCTCTACC GGGTCGTGCA TACAGCACCG GCCGCTGCCA CGACACAGGC
GCTACTACTG CTTTTTCACT TGTCCGTTGG TGCGGAATTC GAACAGGACC AGCGACAAAC
GATTTCGCGC AAATTACGCC CTGAAGAGCA TGCCCGCCGT GATCGCTTCT ATCGAGCCTT
GTATTCAACG CTGGCGCAGC CCTCCCTTTT GGGTACCGGC AAACACCTTA CAATGTTCTT
CAATCTTCTT TACAAAGCCA TGAAATATGA CAATGACCAA ACTCGCGTGG TTGCATTCGC
GAAACGTATT TTATGTACAA CCATTCACTG TTCTTCATCG GTAGTTGCTG GTTCTCTTTT
TCTGCTTAAC GAAATAACGA AACACCACGG GAACCTACTA TCCTGCTTTC AAGATGTCTT
GGAAGGATCC GACGCTTTTC GTGTTTTGGA TCCAACCAAA CGAGAACCTC GCGGTGCTTT
AGTCTTATCG GAGTATGTCG ATGCACCTGA AATAGCTTCC GAAGAAAACG AACAATCAAT
CGAGAAAGCT ATAACGAAAG CTCCGCTGTG GGAATTGACG TTGTTGTTGA AACATTTTCA
CCCATCGGTC TCAAGGTTCG CCAGTGCAAT TGGGAACATT GACTACAGTG GCGACCCTTT
GCGCGATTTT GGTGTGGGAC CGTTCCTGGA TAAGTTCGCT TACCGTAATC CAAAGTCGAT
TGATCGGGTA GCTGGCAAAT TTCAACGCGG TGAGAGCGTG GCAGAGCGAA AGAGTGGTAC
TGGGCTTTTG GTAGAGTCAC AGGTTGCGCT ACCTCTGAAT GACCCAAGCT TTTTAAGCAA
TCCCAACGTC GACGCACCTG ATGACTTTTT CCACAAATTC TTTTTGGAGC AAGCTCGGCG
TGACAAACTC AAAGGCATTG TCCGGCATAA ACCAAAAGTT GATGCTGTAG AACATTTGGA
GGAAGATGCT TTTGACGAGG CCGAAGTAGC AACTTTGGAC GTACAAAAAT TTGATGATTT
GGAGCAAGGC TGGGAAACCG ATGATGATGA GGAGGCCTAT GTCGACGCTT TGGCACAAAA
AATTATCGAA GACTCGATTA ACGAAAACGG GCCCGCGGAT CTTGACGAGG AAGATCCAGA
TATGGAAGGT TGGGGGGATA TGTATAGTGA TGAAGAACTA GAGGATGAGA GCGACGACGA
AAGTGAGTCG TCACAGAAGG GCAAAGCCCT GACCAGAAAC GACACCATTG TAGGCGACGG
TATCAAAGAT CTTGACAGCG AAGAAAATGA TGTCGATGCG TTCATGGATA TTGACGGAGC
CGATAGTAGC GTAAGTGACG ATGACGAGCT GTTCATGGAC GAGCAGTTGG TGTTGATGAG
TGTCGACTTG GATGGTTTGG ATTCGTCGGA CGACCACATC TCCGACAGCG GTGTGGATGG
CAACTTGACG TTGATAAATG AAGAGAAATC TAATGACGGC GAAGTTGTCG AGGAAGGAGC
CTCGTTGAAC CATACAGTAG AGGGATTAGC CACCTTTGTA GATGCTGATG AATATGAAGC
AATGATAACG AAATCCTGGA ACGAGAAAAA GCGCTCGAGA AAACGAATCG ATGGAAAAAG
TGACCATGCA AGAGCGACTT CACCAAAGAG AAGGATGTAA ATATTGCATA GCTTACAACA
GCTAAATCAC CACCGTGCAG TAC
 
Protein sequence
MVVEAIKTKP RRRRGSKSES RQHSQSESAA VESSSTYQPH STLLIQLNED APTWYRLGNK 
QYAEERDATT ASNPPEKGQR VSKDLVLKYR ALGDAIYRRE VQLFGKESNT SDDRWVESTM
KKGTLKDRVA AMSVVVSTDP VHKFYVLDGL LNMAGCSDSN SSTQTNSRVA QLAAEALEDL
FVNTFLPNRR KLISMEQRPL YLYESEGVKN TTKKTLSPRI LLLWRFEEMV KAKYHLFLRQ
YVSLILREGT ELQKIPTIRL AAVLLRSIPE GESTLLPVIV NKLGDPAKKV SAGAAFELRK
LLQQHTAMQV IVAREVQQLA HRPHLSSRAL YNCITFLNQL KLKREETQGG ADEATEPSLP
ASLISTYFRL FELAVQKPKK KETANEEEAG MKSRLLSALL TGVNRAHPYL PHHDSTMEQH
IDALYRVVHT APAAATTQAL LLLFHLSVGA EFEQDQRQTI SRKLRPEEHA RRDRFYRALY
STLAQPSLLG TGKHLTMFFN LLYKAMKYDN DQTRVVAFAK RILCTTIHCS SSVVAGSLFL
LNEITKHHGN LLSCFQDVLE GSDAFRVLDP TKREPRGALV LSEYVDAPEI ASEENEQSIE
KAITKAPLWE LTLLLKHFHP SVSRFASAIG NIDYSGDPLR DFGVGPFLDK FAYRNPKSID
RVAGKFQRGE SVAERKSGTG LLVESQVALP LNDPSFLSNP NVDAPDDFFH KFFLEQARRD
KLKGIVRHKP KVDAVEHLEE DAFDEAEVAT LDVQKFDDLE QGWETDDDEE AYVDALAQKI
IEDSINENGP ADLDEEDPDM EGWGDMYSDE ELEDESDDES ESSQKGKALT RNDTIVGDGI
KDLDSEENDV DAFMDIDGAD SSVSDDDELF MDEQLVLMSV DLDGLDSSDD HISDSGVDGN
LTLINEEKSN DGEVVEEGAS LNHTVEGLAT FVDADEYEAM ITKSWNEKKR SRKRIDGKSD
HARATSPKRR M