Gene PHATRDRAFT_49878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49878 
Symbol 
ID7198594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp171242 
End bp172996 
Gene Length1755 bp 
Protein Length533 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184664 
Protein GI219128952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0245361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATCTTATC CCTTCTGCGA AAGACTTTCA CGTTCGTACC AGCGAACACA GCACGGAGAT 
TTCGTATTTG AAAAAAACGT ATGGACCAGA AGGCAACAAA CATAGTTTGA CATTGTTCGT
GTTGCTCACT TGGAGAGATC ATACCGCACT GCAATGAAAG ACAGAGCCAT GCCTTCCTTC
CGTTTGCCTC TACTTATGCT TTTGTCCCTT TACTTTCCTA TGGATATACT CTCAAAGACT
GATCTATCAA GCTGTATGGA AGCTATGCAA CTAGCCGATC AAAACAGAGA CGGTCTTATT
TCCAGATCCG AGTACGTCAA CCTCATGACG ACCCTGAGTC CGTACGAGTC ATGTCCAGAT
TCTCGCCTCG GCGGCGACCT TTTAGGAAAC GGCTCCTTCC ATTTGGCGTT TTCGTCTTTG
GCTTGCCTCT GTCTGCGCTA CACGAATGAT CGGAATTGTT GTACAGAAGC GGGGGAAGAA
AACGGCACCG GTCCCGTTCT CGTGTTGGGG AACGTGTACC CCAAAGCCTA TACCGAGCGC
GTTTGTGCGA CGCTCGCCGG TACGCTCGAC GATGAATGTG CCCCTCCACC AACTTTGTCG
CCTGCAGTGT TACTTTTGGA AACACCCGTT ACCGCAGCGG CGGCACCTAC TGCGGGTCGA
CCAGAAGCCT CTCAACCACC GACTGTAAGT CCCAACTTTC GTCCGTCATC CAACCCGACT
GCACCACCAA GGGAATCGAT TTTACCTACA CCAACGAGTC TATTCAATAG TGGAGGGGTT
GAGGACGGAG GCAACAATGG CTCCCTGGTT GATGCGAACA CAGACGACCC TAATCGAGGA
CTGACCATAG CCTTGCCCAT TGTGCTCCTT TTGACACTCA TCGGCACGCT TGCCTTTTGG
GCAAACCGGC GGCGTTCACG TGCGCGCGAT CGTGAGTTCA GCTCTCTGGC GTTGGGTTAT
TGGAACAAGG GACGCGGTAC GGGTGGCGAA CAACCATCGC CGTCCGACAA TTTCGACAAT
CGTACGACCT GTACAAAATC AATCGAAACG CCAGGATGCG TGTGGGTCTC TGAGTTAAAA
CTCCCATTGT CCTTGGAAAT GCATACGCCA CAGCGTTCTG TTAACCTACA GAATAGTTTT
GAAATGAATC ACAGGGGCAT CGATGCAAAA ACCCAGGTTT CTTTAGGTTC CCTTGGCGTA
TACGAATCAG GAAGCGAAAC CAGTACCGGT GTGGTTGTGG TCGGGGAGTT GGCGCATCCC
AAAATCCGGC GAGATCGGGA ATTTTCATCT CCCTTGGAAA GCGAGGGGTC GGAGATTGAC
GAGGAATTTT TATCGGAACA GGACGTTGAC GAAATTTCCT CGCTGGCGAC AACGTCAGAC
GATTACGATT GTGATTCAAA TTACACTCTG TATCGAGCAT GTGATGCCGT TCCCGAGAAG
AACCAGAACG ATTCGTATAC GCCAGGGGAT AAGGCTAAGC AGACGCCGCC CCCTCATGGT
GTGAGTAGGC AGCGGCGTCC TTCCGCAATC CCAACCGTTG GTATTGCGAC GAGCCCGATT
GTACTAGTCA ACACAACGCT GCATGCGGCA GCAACGGAAT ACCCAGACCA AGGTCCAACA
CCTGCGTTCC AATACGCGGA AGAGGGAAAT GACAGTGACT CGCAAGCAGA TTTTTCGTTC
CTCGATTACA GTCTGAAGCA AGCCGCGTGG CTAGACTTTC AGCCCGGTGC GGCACCAGAA
GAAGGTCTGC CGTAA
 
Protein sequence
MKDRAMPSFR LPLLMLLSLY FPMDILSKTD LSSCMEAMQL ADQNRDGLIS RSEYVNLMTT 
LSPYESCPDS RLGGDLLGNG SFHLAFSSLA CLCLRYTNDR NCCTEAGEEN GTGPVLVLGN
VYPKAYTERV CATLAGTLDD ECAPPPTLSP AVLLLETPVT AAAAPTAGRP EASQPPTVSP
NFRPSSNPTA PPRESILPTP TSLFNSGGVE DGGNNGSLVD ANTDDPNRGL TIALPIVLLL
TLIGTLAFWA NRRRSRARDR EFSSLALGYW NKGRGTGGEQ PSPSDNFDNR TTCTKSIETP
GCVWVSELKL PLSLEMHTPQ RSVNLQNSFE MNHRGIDAKT QVSLGSLGVY ESGSETSTGV
VVVGELAHPK IRRDREFSSP LESEGSEIDE EFLSEQDVDE ISSLATTSDD YDCDSNYTLY
RACDAVPEKN QNDSYTPGDK AKQTPPPHGV SRQRRPSAIP TVGIATSPIV LVNTTLHAAA
TEYPDQGPTP AFQYAEEGND SDSQADFSFL DYSLKQAAWL DFQPGAAPEE GLP