Gene PHATRDRAFT_50037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50037 
Symbol 
ID7198733 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp219998 
End bp221295 
Gene Length1298 bp 
Protein Length368 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184842 
Protein GI219129326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAACATTTT TCGACGGAGT CTATTGGAAC GACTAGGGCT ATCTGCGAGA AAAGGTCGAA 
GTTTTTTTTG CAACGTGCTG ATCCACAAGC AACCAGCTTG TATATGCTAT TTGGCGACAG
CGCACTACAC GCACGTTCCA TATAGTTCCA GCTTACACTG GAAGCTCTCC ATGATGGCGG
ACGACTGCCC CATTTGCTGC GAGCCTTTTT CGCAGGCCGA TCACGCTTAT CCATTGCATT
GCCCAACCCC CACTTGTGCC TTCAATTTCT GCTGCAATTG TGTCACTTCC ATCCAAAAGT
CTGCTGCAGA TGGCTATCAA GAAGCCTCCG ACGGTTCTCG ACAACTTAAA GTACAAGTCC
AGTGCCCCCA GTGTCGGGGC AGGTACGTTT GTGCAACCTA CAGCAGCACG TCCAATAATG
CAATTGTTCC TGCCGTTCTC TTAATGCGGC AAGCTTCCGA GTTGGAGGCC GTAGTTTCGA
CGAAAGACTC AGACCTATCG GCTACCGAGC TCGCCACCAA ACACCAATTT TGTCAATCGT
GGAGCCTGCG CGATTTGAAA GATGCCTTGG AAACGTTAGA AACGTACCAC TACGAAATCG
GGAAAAATAT CGGTCGGTCG AGTTTAGCGA CGCTAGACTG GGAGTCCTGG GCCCACGCTT
TACCGGAACA GGCCTCCGGC AACAATATGA GTTGTTTACC ATCATGCATG ACCGGAGATG
GTGCCAAACA CCCTAGCTCG GTAGAGATAG ACCCTTCATT GTTTTTAGGA CTGGACGAGT
TCGTGACGCG AGATGAGCAA GTCTTTGTTC ACAATCTCCT AACATCCGGT GATGTACAAG
GTCTGGTTCA GGCAGCACAA ATATTGCAAT CCATCTTGCA ACTTGCTCAA TCCGGTACCG
CTACGATACA ATCGGCATCA ACCAAGACAC CAGTGCAGTT ACAGAGCTTG CGCGAACGCT
TTCCTCTTCC AGCCCGAATG CCTCGTTCCG TCAATTTGCC CGTCTATGAT CCTATGGCAA
AATACAAGTT GCTCAAGTTT GACAACAAGA ATACGCTGGA GATTGCCTCA CTCCACCACG
GGGCCGGTAA ACTGGGCTTG CGCAAGCGAG ACGTGGTAAC GCATCTGGAA GGCGAAGCAA
TCTTGGATTA CGATGCCTTT GTCAGTATGC TACAAGCCTA CTACGAACAA GATCCGGAAA
CCTCTCTAGC CTTGGTTGTC AATGCAGACA AAGAAACGGC ACAAGCTCTG CAAAGACGTT
CGCAAACCAT TATTTGCGCA TCTACGCGTA GGCTTTGA
 
Protein sequence
MMADDCPICC EPFSQADHAY PLHCPTPTCA FNFCCNCVTS IQKSAADGYQ EASDGSRQLK 
VQVQCPQCRG SSTSNNAIVP AVLLMRQASE LEAVVSTKDS DLSATELATK HQFCQSWSLR
DLKDALETLE TYHYEIGKNI GRSSLATLDW ESWAHALPEQ ASGNNMSCLP SCMTGDGAKH
PSSVEIDPSL FLGLDEFVTR DEQVFVHNLL TSGDVQGLVQ AAQILQSILQ LAQSGTATIQ
SASTKTPVQL QSLRERFPLP ARMPRSVNLP VYDPMAKYKL LKFDNKNTLE IASLHHGAGK
LGLRKRDVVT HLEGEAILDY DAFVSMLQAY YEQDPETSLA LVVNADKETA QALQRRSQTI
ICASTRRL