Gene PHATRDRAFT_50806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50806 
Symbol 
ID7197801 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp903075 
End bp905137 
Gene Length2063 bp 
Protein Length588 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178326 
Protein GI219115061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCAATTCCT GGAACAATAT TCCAATTTTC AGGAGAGAGA AAGAACTCTT GACCGCAACA 
ATGAGGGCGC AGGAGCGGAG TTTAAACTCG TATAAGTCAA ATAATTGAGA GCAGCGCGAT
GAAGAAAATT TCTGTTATTC ACTGCTGGTC TGCTCCGCGA AGTCGATCTA CGGCGCTAAT
GTATAGCTTC GAGGCTCGAG GATCTGACTG CGCTGCGCTG GATGAACCGC TGTACCGCGA
ATGTCTGATT CAACGCGGCG ATGCGGTTGC CCGGCCATAC CGTAACGAAC TCATTAACGG
CACACCTCCA TCTGGGAGCA TAACAGATCA AGTAGATGTG TGGGTGCGGG AACTTTCTAG
CTTGGAGGAG CGCATTCGAT TGTGCGCACA GCGTTTGCCT GAAAACGGTG TCGTCTTTTG
TAAACACATG GCGAAACATT CGTTTCTTTA CGACTTTCAG GAGGAATTTT TCGCGGATGA
CCTGAACATT AAGCTCATTC ACAAGCATCT CTTCTTAATT CGCGATCCCG TGGCAGTCCT
ATCGTCTTGG GGAGCGTCGG ACTCAGTGCA TGGAAGCAGC GCCACTCCTG ACGAAGTTGG
GATTGTTCCC ATGCTCTCCA TCTTTTCGGC TCTCTGCAGC CGTCCCCACA GAATACGCTC
TATTGTTTCG TTCCTTGATT CGGACGAACT AGTCAAAGAT CCGGAGAGAA CTCTGGGATC
AGTTTGCGAA GACCTGGGTA TCCCGTACAA AGAATCAATG ATGTCGTGGC CAAGCGGTCC
TCATGCTTGC GATGGTACGT TTGAGGTCGC CGGTCAAGAT TTGATAGATA AACAAAAAAC
CATCCGGACT GCCATCTCAC TTTGTTAAAA TTTTAGGACC CTGGGCTTCC TGGTGGTATA
GCGACGTGCA TCAATCGACA GGATGGAAAC GGAAAACACC CGACTATGGG GCTAGTCGGT
ACCGAATTCT GAATCCCGAT TTAATGGATG CGCTAAAGGT CTCCTATCCT GCCTACGAGT
TCTTGAGTAA ACTTACTAGG GGATATCAAA AGCGGGGTCC CTCAACCAAA ACACTGTATG
AAGACCCAAG AAACGAGCAT TTACTGACTT ACATCGGCGC CCCAGGTCGA GGCAGAATAA
TTCCGAGATC CATGGCTGGT GTGAGTCCGT GGGATTCATC AGTACAAGGC GGAGACGCTG
CATGGGAAGG ACTCCGAGTA TACCGTGGAA AAGTGCTTTC GTTGGATAAA CACCTTCAGC
GTCTTTTTAA ATCGTCGAAA GCGCTAGGTT TTGAGAACGT GCACACCAAG GCAGAAGTAG
TGGAAGCCAT TTTCCGCACG CTGGCAGCGA ACGGTATGCG GGACGGGGCA CACATGCGTC
TTACATTGAC ACGAGGTGAA AAGTGTACCA GCAGTATGAA TCCAAAGTTT AATGTATACG
GAACGACGTT GATCATTCTA GCCGAATGGA AGCCCACGGA GGGTGCCACG ACCTACAACA
ATACTTCCGG TATTGCGTTG ATTTCTGCGT CTCAACGGCG AAACTCACCT CAGACGGTGG
ACTCCAAAAT CCACCACAAC AATTTGATCA ACAATATTCT GCCAAAGATT CAAGCAAACT
TGGCGGGATG CGATGATGCA ATTATGCTCG ATCTCGAGGG TTTTGTATCG GAGACAAACG
CTACCAACAT TTTCATGGTT GATAATGGAG TGCTGTTGAC GCCGCATGCT GATCATTGTC
TGCCAGGGAT CACTCGAGCT ACCGTTTTGG AACTGGCGAA AGAAATCAAT ATACCTACCG
AAACTCGTCG GATTTCTCTT GCCGAATTCC ACGCCGCGGA TGAGGTCTTT ACTACGGGAA
CCATGGGCGA ACTGACTCCG GTTCGCATGA TCGACGGTCG GGTCATTGGT ATCGAAGGAA
AGCGGGGTCC GATTACTGCC AAACTACAAA AAGTCTATCA GAGTTTGCCG GAACGTTCTG
GTTGGGCTAC GGAGATTCCG CCTTTTGAAG CCTAAGTTTT TGGCAACGGA ATAACTACGA
GTATAAAGGA TGAATTTATT GCG
 
Protein sequence
MKKISVIHCW SAPRSRSTAL MYSFEARGSD CAALDEPLYR ECLIQRGDAV ARPYRNELIN 
GTPPSGSITD QVDVWRLPEN GVVFCKHMAK HSFLYDFQEE FFADDLNIKL IHKHLFLIRD
PVAVLSSWGA SDSVHGSSAT PDEVGIVPML SIFSALCSRP HRIRSIVSFL DSDELVKDPE
RTLGSVCEDL GIPYKESMMS WPSGPHACDG PWASWWYSDV HQSTGWKRKT PDYGASRYRI
LNPDLMDALK VSYPAYEFLS KLTRGYQKRG PSTKTLYEDP RNEHLLTYIG APGRGRIIPR
SMAGVSPWDS SVQGGDAAWE GLRVYRGKVL SLDKHLQRLF KSSKALGFEN VHTKAEVVEA
IFRTLAANGM RDGAHMRLTL TRGEKCTSSM NPKFNVYGTT LIILAEWKPT EGATTYNNTS
GIALISASQR RNSPQTVDSK IHHNNLINNI LPKIQANLAG CDDAIMLDLE GFVSETNATN
IFMVDNGVLL TPHADHCLPG ITRATVLELA KEINIPTETR RISLAEFHAA DEVFTTGTMG
ELTPVRMIDG RVIGIEGKRG PITAKLQKVY QSLPERSGWA TEIPPFEA