Gene PHATRDRAFT_50095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50095 
Symbol 
ID7198691 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp433960 
End bp438052 
Gene Length4093 bp 
Protein Length982 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184953 
Protein GI219129557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAACA ATGGCACATA CAAGCAAAAA GATTTGGATC AGACTGCTAG AAGCAATACT 
CAAGCAACGC TTTGGATGAC AGACTTCTTT GTTGGGCTTG TTGTTCGCAA ATTGCACCAC
TTGCACAGTA TGTTTGGAAC GTGTCCATTC TTAGCAGTTT CACAAAAGCA CAGGTTGTTG
CGGCTCCAGC ATTGCGGTCA GCTGCAAAAG TGGCTATGTG ATTATAATAG AAAAAGGCTT
ACATTCATAG TGGTTGTAAA TTAAAAATTC TTACTTTACA GTCAATACTG TTTCCTTAGA
ATATAGCAAT CACAGCGTGT ATGGCGGCGT TGTGTGGGGT GCGGGCAAGC GCAGTCCGGC
AGGTGGCAAG TTTGCGAATT TCGAAGCCGT CTTCCCACTA TTCCATCAGC TGCAGAGTGA
TGTGAATGTT GACAGTGCAA CCCAGGCTGA ATCCGCCAGC GGTTGACTTG GCGCGGCGTG
CTGGTTGGCG AGTGCTGTTT TTGTTTGCGG ACGACACCTT CCAGAACAAG CCGGGAGGCT
TCTTTCCCCC GTCTTGGCGG TTCGGAGTTT GACTGGCTGG CTCAATGCGG TATTGGGATA
GTGCCTTTTG TACGGATCCT CGGAGGAGTG TGAAATTGTT GGATATGGAG CAGACTTAGA
CGTCGTTATC CCGGACAATG CGTACAAGTT CCTGAAGTTG GCGGCGGAGC GACTATTGAG
GGGGGGGGGG GACAGAGGAA TGAATCCAGT GAAATCGACG ATTTTAATTT GGGCTTTGCA
AACAGGCACG CAACCCAAGA AAACAGTTCC ACTACCGTGC GGCTACGACA ATTCCTGTAG
GCTACGTTTT CCTCTAGAAG TTGTGTAGAT TGGGACCGGT GGACTCTTTT GATCCCAAGC
GTTCCTACTG CAGATCCTCG TGCTTTTACG GTACTCGCGA AGAATGCTTG CAAAGCCGTG
CTGGGCGGCG TTGGTGCTGG ATAGTGGTGA CGGCTTTGCA GCAAAACCGC ATTGCGGTGG
GAGAACGTCT CTGTATGGTC AGTGCTGGTC TGACAGGGAT TTAGGATGAG TTTGTTAGGT
GGAAAGGTAA ACTTATCGAT GATCAAAGAG AATTACTTAT TGAAAAGACA AACTATGTTA
AAGATCCCGG AGCAGTTACC TACTCAAAAG ACAAACCAAC TTGAAGATCC CGGAGCAGTC
ACCTACGCAA AAGACAAACC AACTTGAAGA TCCCGGAGCA GTTACCTACG CAAAAGACAA
ACCAACTTGA AGATCCCGGA GCAATTACTT AGAAATAGTT TTTGGATAAA TTGTATACAG
CTAGGTATAC CCCTGCTTCT CTGGGATCTT CCTGCCCTTA GGCACCAGCG AGGGGACACT
GGAGGGATCC CGTGATGGAG GAACCGAGGG CAGTCAAAAT GGCACCAGCG AGGGGACACT
GGAGGGATCC CGTGATGGAG GAACCGAGGG CAGTTGCAAT GGTACATCCG TTGGACTGGC
CGATGGAAGT CTTGATGGGC TTTCCGATGG AGACAATGAC TGCATGGAGG ATGCTAGGGG
CGACAAGATC ACAGCGATGT ATGGGATGGT GGACGCAGTC TTGGTATCAG CATCGCCACG
CGTCCAGTCC GCAGAGCTAT TAATCAGCGA TAGGTCCGAG TTAGTAGCGT TCTCGATGGA
ACCGACTTCA ACCTTAATAC CGGCTTTTAC CAGGAACATT TCCACGAGGC CGATTCATTT
CCTTTATGCT CCCGTCAATG CTCTTTTTGA TGCTTCTGTT GACGCATCCG TCAATACAAT
GCCACTCGAA TTCTGTATTT GGAGACAGGT ATGTGTCTCT AAACCTTATT CCATCTTTAG
TTTTTACAGG TAAGATAGTC TAGCCTTAGA TTTTCATATG ATCTCAAGCA ATAGCGACAA
ACGTGGTCTT TAAGTATTTG AATTAATACA GCGACAGGGC GCGTAAAATG TTCCCTACTT
AGGCAAGCAC TCGCGACCCA GCTTTTTTAC TTAGGATTAG TGATTTTGCG GCCTCTTCGA
GATAGAAAAA CCCCACGTCA AGCGTCTGTC ATATATACGT TAGACTAAGA CTGGTCTGCT
CAGGCAGATA ATGGTATAAT TTGTGCTTTA CTCTGTTTCG TAACTACACT AGTGATCTTG
TGCTATTCCA GTGCGATTCT AGTCGGTTAT CGCAGTTTTC TAGAGACCCG GATTTGTGAA
TTTCCTCTCA AAAGAGGAAA TTCACTAACT GTAAAATGAC TTTTCGCAGT ACGCCTGGAC
ATTTCTTGTC GGTTTTTTGG CATCTTTGAA ATGTGCGACA TCGTTTTACT GTTGGGACCT
TGTGTTTGGA ATGCTCATTC GGTCTCTGTA GTCGGCTCAC AGTCAGTTCG AAGGCATGAT
TCGGCGAAAA GGATTGCTCT CGTCGGAGCA AACTCTGACG ACGACGGAAG TAGAATCTGA
TGGACGCAAT CAGATCGTTT CCGTGGCTGG CAAACATGTG AGACTTCAAA GCAGCACCTG
TGTACGAAGC TATGCGCTTC AGCGTCGACG CTATATATTT GCTCTGATTT GTTGCTGTGG
GGCTACCGCG TGGGGAGTCC ACCGATACTG GATTGCTGCT ACCGCTTCTC GTGGAAAAGA
AATAGAAGCG GAGGATGGCA TGTTCCACAA GTGGGATAAA GTTGTACTAC CGCTGACTGA
TCGAGTCGCC GCATTACGGC GGAACACCGC GGATGAATTG GACGACACCG ATTGCATTTT
TCGCGACTCT CCAATTCGTC GAAAAGTGTT TGTGTATCCA GACTATGGAG ATACCGCAAA
CGGTTGGACA GCGGACGTTT TGTCATCGGC AGGGCAAAAG TGGCAAACGA CCTTGCCGCC
TTGGCCTTGG CTCGATCTGC GACGACAATC GCAGGCAAAT CGAACTAGTC ATTACGACAT
AGAAGGCCAA CACGTACAGT ACGCCACAGA GCTACTTGTA AGAGAGGTGA TGATCAATCC
CAAGTCCTGT CTACGAACGT ACAATCCCGA CGAAGCCACA CTTTTCTACG TACCTTACTT
GCCCTCGGTA GAGCATCACA AAGGCAGCAA GTACATCAAT GATATGGCGT TATCTCCGTA
TGGGAATGCA ATACTCGATA TTCTCGACAA GGATAATTAC ACGGCTTGGG AAAACACGTT
TGGATTGACG GCGAAGTACT GGAAACGTCA TGGCGGGGCT GATCATATTC TTGTCTTCTC
CGAACCTATG CATGGACTCT GGCATCCTCG TCAACGACGC GGGAACTACC ATTTTATTCA
TTCGCAGAAG CAGCTGCATC CACCAATCGT CATTTCAGTC GAATTAAGTA CCACATTCGT
AAAAATGTAC CCCAAGTGTG CCGCCAAAAA TATTCTAATG CCGTACCCCA ACACGGATGG
ACGATGGTTC AACGGCAAGC ATCACTCGGA AGCGGTGAAA GCCTCTACGG CTTGGAATGC
CTCTCTGAAA GTTTCAATTG CCGCCTTGCC AGAAGAACAA TTATTGGGCC AGGAGCCTGC
GCGACCCATC GCTCAATTCT ACGGTGCAGG AAACCACGGA ACCTGCAAAC AATTGCGTCA
AGCAATGGCT TCCGACTATT CGCAATGTGC ACTGTCCAGT AAGCTTTTCA AGCAAAACGT
CAAAATATCG TCATACGTCA TAGGTATGAA TTTGGCAAGC TTTTGTCCGT GCCCAGGAGG
CGATTCGCCG AGCGCTAAAC GGATGTTCGA CGCAGTCTTG GCCGGATGTA TTCCAATCAT
CTTGTCGCAA GATTTCGTTT GGCCGTTTAC AAACGAGTTT GATCCAAACC TTGAGCTTGA
TCCGACAGTG TTTTCTCTGC GTTACTCAGC AAAAGACTAC GAAGACCCGT TGCTGGACGT
CACGACGTGC AGTCCACTTA ATTCCTCTAA ACCAGGTTTG CAAAGTAACT TGGAGCAGAT
TTCCGCTCGG GAAATAGGGC GTCTTCGGAA TGGACTTCGG CAAGCTCGGG ATCTTTACAG
CTGGTATCAA GTCCGACCCG ACCTTCCCGA CAATCCGTTG TGGGAAAATA TTTTACCGCC
CATTTCTTGG TAG
 
Protein sequence
MGNNGTYKQK DLDQTARSNT QATLWMTDFF VGLVVRKLHH LHKYSNHSVY GGVVWGAGKR 
SPAGGKFANF EAVFPLFHQL QSDVNVDSAT QAESASEQAG RLLSPVLAVR SLTGWLNAEC
EIVGYGADLD VVIPDNAYKF LKLAAERLLR GGGDRGMNPV KSTILIWALQ TGTQPKKTKL
CRLGPVDSFD PKRSYCRSSC FYGTREECLQ SRAGRRWCWI VVTALQQNRI AVGERLCMVY
PCFSGIFLPL GTSEGTLEGS RDGGTEGSQN GTSEGTLEGS RDGGTEGSCN GTSVGLADGS
LDGLSDGDND CMEDARGDKI TAMYGMVDAV LVSASPRVQS AELLISDRSE LVAFSMEPTS
TLIPAFTRNI STRPIHFLYA PVNALFDASV DASVNTMPLE FCIWRQFLQS AHSQFEGMIR
RKGLLSSEQT LTTTEVESDG RNQIVSVAGK HVRLQSSTCV RSYALQRRRY IFALICCCGA
TAWGVHRYWI AATASRGKEI EAEDGMFHKW DKVVLPLTDR VAALRRNTAD ELDDTDCIFR
DSPIRRKVFV YPDYGDTANG WTADVLSSAG QKWQTTLPPW PWLDLRRQSQ ANRTSHYDIE
GQHVQYATEL LVREVMINPK SCLRTYNPDE ATLFYVPYLP SVEHHKGSKY INDMALSPYG
NAILDILDKD NYTAWENTFG LTAKYWKRHG GADHILVFSE PMHGLWHPRQ RRGNYHFIHS
QKQLHPPIVI SVELSTTFVK MYPKCAAKNI LMPYPNTDGR WFNGKHHSEA VKASTAWNAS
LKVSIAALPE EQLLGQEPAR PIAQFYGAGN HGTCKQLRQA MASDYSQCAL SSKLFKQNVK
ISSYVIGMNL ASFCPCPGGD SPSAKRMFDA VLAGCIPIIL SQDFVWPFTN EFDPNLELDP
TVFSLRYSAK DYEDPLLDVT TCSPLNSSKP GLQSNLEQIS AREIGRLRNG LRQARDLYSW
YQVRPDLPDN PLWENILPPI SW