Gene PHATRDRAFT_21794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21794 
Symbol 
ID7202837 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp366548 
End bp367867 
Gene Length1320 bp 
Protein Length346 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182056 
Protein GI219123489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGC CACGCAAGGA GATTTATACC TACGCGGCGC CTTGGACTGT ATTTTCAATG 
GCTTGGAGTC GAAGGTATGT AGGCATGCAT GTAGACGGAA CTGTGTTTCT AGCTACCACG
ACTTCAATAC CGTATAAGTC GGAGAATGAT CAGCGAGAGA TAGTTGTGTC CCGAATTTCT
TCAGGATTTG ATTGCCACTA CCATCTTCTT GCATCACACA AGATCTAACA CCTGGCATAC
TATATACTCT TTGGCTGTCA TAGACAGGAC AAAACGTCCC AATTCCGTTT GGCGATTGGT
AGCTACGTAG AGCAGTATTC CAACGCGGTG CAGATTGTGA AAAAGGTCCC CCAGCACGTA
GATAGTGACT TTGCCGGCGG TGGGTCCGCT TCGCTCTATC AGGCGGGGTC GTTCGACCAC
CCATATCCAT GTACCAAAAT TTTGTGGAGT CCGGATCAAT CACTCGCGGC GCCAGACCTG
TTGGCTACGA CCGGGGACTA TTTGCGGGTA TGGAACATAC GGGACGACGG CAGTGGACAA
GGCACGGTGC AATGCAAGAA GGAGTGCTTG CTCAACAACA ACAAAACGTC TGAGTACTGC
GCTCCGCTTA CTAGTTTCGA CTGGAACGAA GCTGATCCGA ACATTGTAGG GACGTCCTCC
ATTGATACCA CCTGCACAAT TTGGGATATT GAAACCCAAA CGGCGCGCAC CCAATTAATT
GCGCATGATA GAGAAGTCTT CGATTTGGCC TTTGCCCGAG GAAAGGACGT GTTCGCCTCG
GTCGGAGCGG ACGGGAGTGT TCGCATGTTT GATTTACGGA GCTTGGAGCA TTCTACTATT
ATTTATGAGT CGCCCAATTT GGATCCTTTA TTACGGTTGG AATGGAACAA GCAAGATCCG
AATTACCTGG CCACCTTTAT GGTGGATAGT CGAAGGACGG TCATTCTTGA CATCCGTGTC
CCTAGCTTGC CGGTTGCAGA ACTTGGCGGT CATTTAGGAT GTGTAAACGC TACGGCTTGG
GCGCCCCATT CTTCCTGTCA TATTTGCACG GCGGGAGACG ACAGTCAGGC CCTGATTTGG
GATTTAAGTG CCATGTCCAA AAGGCCTGTT GAAGAACCAA TTTTGGCTTA CAATGCGTCC
GGAGAAATCA ATAACTTGCA ATGGAGCGCA TCACAACCCG ATTGGGTGAG CATCGCGTTT
CACGACAAAT TACAGATTCT CCGCGTCTAG GAAAAAGCAC TAGAGACTAC TGGAGAGCAA
TAGACCGAAG TACGGCCCTG ATTCGACAAG ATGCCAATGA ACACGCCATT GAAGTATTCG
 
Protein sequence
MVPPRKEIYT YAAPWTVFSM AWSRRQDKTS QFRLAIGSYV EQYSNAVQIV KKVPQHVDSD 
FAGGGSASLY QAGSFDHPYP CTKILWSPDQ SLAAPDLLAT TGDYLRVWNI RDDGSGQGTV
QCKKECLLNN NKTSEYCAPL TSFDWNEADP NIVGTSSIDT TCTIWDIETQ TARTQLIAHD
REVFDLAFAR GKDVFASVGA DGSVRMFDLR SLEHSTIIYE SPNLDPLLRL EWNKQDPNYL
ATFMVDSRRT VILDIRVPSL PVAELGGHLG CVNATAWAPH SSCHICTAGD DSQALIWDLS
AMSKRPVEEP ILAYNASGEI NNLQWSASQP DWVSIAFHDK LQILRV