Gene PHATRDRAFT_50520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50520 
Symbol 
ID7199242 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp298480 
End bp300656 
Gene Length2177 bp 
Protein Length657 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185413 
Protein GI219130523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGAGTCTTT GTTGGTATCG TATCGTAGTG TACGTCCTTC CTTGCGAGTA CATGCCGGAT 
TACAGTAACA ACAACAACAA CAACAACAAC AACAACAACA ACAACAACAA CAACAATAGT
ACCATCAGCA TCAGTAACAA CGAGTCCGCT GCCGGCAGTA CCAATCCAGT CTCGACGCCC
CGCAAAGCGC CTCCCCACAC CACGACATTG TCTCGTGGAC GTCGTTCCGC ACGGGTGGTG
GAGCACGATC CCGTTGTTGC TCCCGGGTAC GGCACGCGCA AAGACTTGCA CCGCTTGGCG
GCCGATCTCT CCTCCAGAGC GGGCGTCGTA ACGGTCGCCG CGGTCGAAGC CATTATGAAG
GCTCGCGGAC GCGGATTGCG GGGGAATAGT AACACTCCGC ATACCGACCA AGACGCTCCC
AGCAATGCAC CGCCGGCTTC GTCGGCGTGG GCGGATTTCA TGGCGGACGA AGGCGTCAGC
AGTAACAACA ACAACAACAC GAGCAACAAC GGTACCGGAA AAGGTCCGCG CCCGCAGTCC
TCCACGCGGG CTTCCGGTGC CAACGCCAAC AAGACCCAGA ACGTGACCAA GTCGCCCGGA
ACTCAACGTG ACTACGAGTT GCCCATTCTC AGTGCTTATA CTTTTCATGG TACCGCCGGC
ACGCATGGAA ACGCTACGGA ACCCGCCAAG AAAAAAGCCA AGACTCCCAA ACAGAATACC
CAGCAACCGG GGTACACCTT GTTGCAAGCC GGAACCCTCG ATAGTACCAT TGTCGGACGC
GTCAAGATTA AAACACCAGA GCCCTACCAT CTGATGGTTC CCACTCGTAT CGCTATGGAT
CGCAAGTTCA CCAAAATCTT CACGTCCTGC AATGCCGTTC ATTCCATTGC GATTGACGAA
GCGGGCGTTG CCTACGGTTG GGGCCGGAAC GAGAGTTCCC AACTCGGAGC AAGCTTACCC
AATGTGGTCG TCCTCCCTAC CGAACTCGAA CTGCCCGACA AGGTCGTGGG CGCCGCCCTC
GGCAAGTCGC ACACGATTCT GCAACTCGCC GACCAGTCCC TATGGGCTGT CGGGGCCAAC
AAGGCCGGCC AGTGTGGCGT CCGCGTGGGG ACGGAAGTCA TTCCCAACTT TCGTAAATGC
GTCGTTCCAG AATCTGTAAC AATTGTACAG GTACGTTTTC TTTCTTGTTT GCTTCTTTGG
TACAACCGTA TGGCGCACAA AGGTGACTCA CGCTTTTTCC GTTGGATTCT ACTCTTTATC
ACAGATTTCC TGTGGCGAAG ATTTTTCGGT CGCTCTCGAC TCCGAGGGCT ACCTCTATTC
GACCGGCTCT TCGGAATACG GACAACTCGG TAACGGCGAG ACGGGGGAAT ACTTCATCGC
CGCCAACAAG CTCGGCTTTG CCAATTGCAA CGTCTTTACG AAAAGATCCG TGTTTTGTCA
CACTCCTGGT GAAAACGCGC ATTCCAGCAA TGCGAAAGAT AAGGTCGTCC CTCTCGCAGA
GGATGTTCGT ATTCAATCGA TTGCCTGCGG AAAACACCAC GTGGTTGCCG TCGAAGCACC
GTCGGACCAA AAGCCTCGAG TATTCTCTTG GGGTTCCGGC GACTACGGCT GTCTCGGACA
CGGTGTACAA GCCGACGAGT ACTTTCCCCG TATGATTGGT GGATTCATTA ACACTCCGCT
CGGAAACAAT AAGGATGTTG TCGTTACTGC TGGTGCGCAC TGCAGCCTAA TTCGCACATC
CAACGGACAC GTGTACTACT GGGGCAAACA CCGGCCCGTG GGCGAAGCCG TTATGAGACC
GCAACTCGTG GATGTTCTAG CCAACAACCA GCACGATGTC AGGCACTTTG CCGCCGGAGC
GCAAACGGTA GTGTGCAGCA CCAGTTTAGG ACAAACTGTT GCTTGGGGAC AAGGACCACA
CGGTGAATTA GGTCTGGGGA CGCCGAAATC CAGTGCCAAA CCGAGCTTTG TCTCCGCGTT
GGACGGCGCT CAAGTAATGG ATGTGGTCTG CGGCTACGGG CACACGTTGT ATTTGGTGCG
GGGAGAAACA CCGGAAGACA CCAAAATTAT TGCGGGTCTC GCGGAACTCG ATCTGGATTC
CGTGCAAGAC TTGATTGCCG GCGCGGTGGG AGTCAAGTAG GTATAAACAA TAATCAAAGA
CGAATCGTTT TTGTAAG
 
Protein sequence
MPDYNNNNNN NNNNNSTISI SNNESAAGST NPVSTPRKAP PHTTTLSRGR RSARVVEHDP 
VVAPGYGTRK DLHRLAADLS SRAGVVTVAA VEAIMKARGR GLRGNSNTPH TDQDAPSNAP
PASSAWADFM ADEGVSSNNN NNTSNNGTGK GPRPQSSTRA SGANANKTQN VTKSPGTQRD
YELPILSAYT FHGTAGTHGN ATEPAKKKAK TPKQNTQQPG YTLLQAGTLD STIVGRVKIK
TPEPYHLMVP TRIAMDRKFT KIFTSCNAVH SIAIDEAGVA YGWGRNESSQ LGASLPNVVV
LPTELELPDK VVGAALGKSH TILQLADQSL WAVGANKAGQ CGVRVGTEVI PNFRKCVVPE
SVTIVQISCG EDFSVALDSE GYLYSTGSSE YGQLGNGETG EYFIAANKLG FANCNVFTKR
SVFCHTPGEN AHSSNAKDKV VPLAEDVRIQ SIACGKHHVV AVEAPSDQKP RVFSWGSGDY
GCLGHGVQAD EYFPRMIGGF INTPLGNNKD VVVTAGAHCS LIRTSNGHVY YWGKHRPVGE
AVMRPQLVDV LANNQHDVRH FAAGAQTVVC STSLGQTVAW GQGPHGELGL GTPKSSAKPS
FVSALDGAQV MDVVCGYGHT LYLVRGETPE DTKIIAGLAE LDLDSVQDLI AGAVGVK