Gene PHATRDRAFT_18027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18027 
Symbol 
ID7197076 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp76883 
End bp79296 
Gene Length2414 bp 
Protein Length706 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177552 
Protein GI219111601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGAGGTACA TCGAGGAGAA CCAACTACAT CCCCCGACAT TTCCCCCCTT TTTCCCAAAC 
ACACCAACAC TGACAAATCG CCGCATATGA ATCCCCCGGT GGCTTTGGAA TCCTGCGAAC
CCGCGTCTCG CGCCGCGCTC CGTCGGGCGC GTCGGGTCTG TGTCAAGGCC GGTACATCCG
TCGTAGCGAA TGAAGACGGA CGGCCTTCGT TGACGCGTCT CGGCGCCATG ACGGAACAAA
TCGCCGACCT CGTCCAATCG GGCATTCAAG TCATTCTCGT ATCCAGTGGA TCTGTGGGAA
TGGGGAAGCG ACTCTTGCGC AAACAACGGA ACCTGCAAAT GAGCTTTCGG GACATTCACA
ACAACGATCA CGTCAATATT ATCGGTACCA ACAACGGAAT GATGCCGGAC GATGTCTCCC
TTTTGGCCAA GTCCGGCGTG CCCCGGACGG CGTCCAGCTC GTTCGTGTCG CTGCTTGACG
TCAACGAACG CCCCCACACG TTGGCCGAAA AGAAAAAGTA CTACGACTCG GCCTGTGCGG
CCGCGGGACA GTTCGAAATG ATGAATCTCT ACTGCGGGTT GTTCGCATCG TACGATATTA
CCGCCTCGCA AATTCTCGTT ACCCAGACAG ACTTTGTCGA CGAATCGCGT CAACGGAATT
TGCAGTATTC CATTGAGCGA CTGCTGGGGT TGGGAATCGT CCCCATTATC AACGAGAATG
ATGCCGTCTC GGCCAATATG GGATACACTG CGGACGACGT GTTCTCCGAC AATGATTCAC
TCGCGGCCCT CTGCGCCCGC CACTTTGGCG CCGAAGTATT GCTGCTGTTG ACGGATGTAC
CCGGGGTCTT TGATCGACCA CCAACGGAAC CAGACGCCAC TCTGCTGCGA TTGTACCAAT
CGCAACCCGT CGCTATTGGG GAAAAATCCA GTCAAGGTCG CGGCGGCATG GCCTCCAAAA
TCGACGCCGC CTTGTCCGCC GTCCAACCCG GATCAACCTG TTGTGCCTGT GTCGTGGCGG
CAGGGAACGA TTTGAACGTC ATTCGTTCGG TTCTCGCCAA AACACCACCG ACCACAGCCG
TGTCAATGAA AGACATCAAA GGTACCATGT TTTGTACACC GGGAAGTGCG TTGGAAGCAC
AGGCCGTGGC CGATTTCGTC TCGGACACCC ACCAAGACGC GAGTGTTGCC GAACAAACAC
GAATCTTGGC GACGGCAGCC CGAACACAAG CGCGTAAACT CCAAGCCTTG CCGTATGGGG
CACGACAAAC GATTCTCAAC GCCGTCGCGG ATGCCTTGCT CACCCATCAG GAGGCCTTGA
TGGAAGCCAA CTTGCTCGAT TTGCAAGCAG CCGAGCGGGA TGGCGTTAGT GAGGTGTTGA
AGAAGCGTCT GGGTCTGACG CTCCAAAAGT TTGACACGCT GGCGGCTGGT ATACGCCAAA
TTTCGGCCAA CAAGGACCCA CTCGGTGTCA TACACAGCAG GCGTGAATTG GCGGACAATC
TTGTCTTGTC GCAAGTGACG GTTCCGATTG GCGTATTGCT TATTATTTTT GAATCGCGTC
CAGACAGCAT GCCGCAAATT TCTGCGCTGG CTCTGGCCTC AGGCAATGGA CTGTTACTGA
AGGGAGGAAA GGAAGCAACG CATTCGAACG CAGCCATACA CAAGGTTATC GGAGACGCGA
TTGAAGAAAG CAGTGGAGGA GAAATTACCA GAGATATCAT TGCATTGGTA ACAAGTCGAG
GACAGGTGGC TGATTTACTG AGTCTAGACG ACGTGATCGA CTTGGTCATC CCTCGAGGGA
GCAATGACTT GGTATCGTAC ATCAAGTCCC ACACGAAGAT TCCGGTCCTC GGACACGCCG
ACGGCGTATG TCACGTCTAT GTGGATGAGT CCGCCGCTGC GGATGCCGCA AGCAAACTGT
GCGTGGATGC CAAAACGGAC TATCCATCGG CTTGCAACGC CATGGAGACA CTTTTGTTGC
ACGCAGCAAC GCTTTCCAAC GGGGTAGCCG CTGCGACGCT CATGGCCCTG CGAGCATCCG
GAGTGCAGTG TCTAGGAGGA CCGGCCGCAA TGAAATCTGG GTTGTGCGAT CGGGCTGCCC
CAGAACTCAA ACATGAATAC GGGGACTTGA CCTGCTTGGT CGAGGTCGTT CCGAACCTAG
AGGCCGCAAT TGATTGGATC CACAAGTACG GTAGTGGTCA TACCGAAGCG ATTGTCTGCG
GTGAGGAGAG TGACGTTGGT GAGGAATTCT TACGAAAGGT TGATGCAGCT TGTGTCTTTC
GCAACGCATC GACACGATTC GCCGATGGCT TTCGATTTGG CTTGGGCGCT GAAGTGGGTA
TATCGACCGG TCGTATTCAT GCACGTGGCC CCGTAGGCGT GGAAGGTTTG CTGACGACGA
AATGGCAACT GCGA
 
Protein sequence
MNPPVALESC EPASRAALRR ARRVCVKAGT SVVANEDGRP SLTRLGAMTE QIADLVQSGI 
QVILVSSGSV GMGKRLLRKQ RNLQMSFRDI HNNDHYYDSA CAAAGQFEMM NLYCGLFASY
DITASQILVT QTDFVDESRQ RNLQYSIERL LGLGIVPIIN ENDAVSANMG YTADDVFSDN
DSLAALCARH FGAEVLLLLT DVPGVFDRPP TEPDATLLRL YQSQPVAIGE KSSQGRGGMA
SKIDAALSAV QPGSTCCACV VAAGNDLNVI RSVLAKTPPT TAAVADFVSD THQDASVAEQ
TRILATAART QARKLQALPY GARQTILNAV ADALLTHQEA LMEANLLDLQ AAERDGVSEV
LKKRLGLTLQ KFDTLAAGIR QISANKDPLG VIHSRRELAD NLVLSQVTVP IGVLLIIFES
RPDSMPQISA LALASGNGLL LKGGKEATHS NAAIHKVIGD AIEESSGGEI TRDIIALVTS
RGQVADLLSL DDVIDLVIPR GSNDLVSYIK SHTKIPVLGH ADGVCHVYVD ESAAADAASK
LCVDAKTDYP SACNAMETLL LHAATLSNGV AAATLMALRA SGVQCLGGPA AMKSGLCDRA
APELKHEYGD LTCLVEVVPN LEAAIDWIHK YGSGHTEAIV CGEESDVGEE FLRKVDAACV
FRNASTRFAD GFRFGLGAEV GISTGRIHAR GPVGVEGLLT TKWQLR