Gene PHATRDRAFT_21897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21897 
Symbol 
ID7202930 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp784764 
End bp786364 
Gene Length1601 bp 
Protein Length351 aa 
Translation table 
GC content50% 
IMG OID 
ProductG protein beta subunit 
Protein accessionXP_002182135 
Protein GI219123650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.173476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCACAGAG TCTCCTGACT AACTGTAAAA AAGGGAGAGC GATCTACTCT TGCAACTCAC 
CAGTCATCAA TACCACGCGT TCTTAGTTCC AATCTGTGTG TCTCGGCTTA CCCCAAGCTC
TAATTTTTGT CTACATTTGG TCGAGAGCGG AAAATTGAGA TCCCAATATC TGCCTGTCCG
CCTCCTGCAA GCTCGCTTAC TTTCGTGCTA GTACGAACCA CCATAAAGAA GTAAAGTCAT
GTCTACTTCC GAAATTCAAC AGGATACTGC TAGAGAGGAG GTACGTTGCT CATCTTCGAT
TCCTTTCATT CTGTTGCTGA CCGTTCACGA TATCCGGAAA TGTGTTGGGA AAGATTCCCT
ACCTGGAAGA TACACTTTCC AACACTAGCA TACTTGTTTT TTGGCATCCG CCCCTGATCT
TTTCATGCTG GGAAAACAAG AGGCCATTCA ACGTCTCTTT TGCTTCAAAA CGGAAACGGT
GTCATTCCAA ACCCGCCTCG GTCTCTCACT CGGCTTTTGT CCTTTTCGAT TTACCCGACA
CGCGCAACAA ATAGGTACCT TCTCTGACGC AACAGATTGA AAAGGTACAA AAGTCCAAAC
GGGAACATTC CGGCTCGGCG GCGCAGGGGT CGCCGGTTCG TGCTCCCTCT GCCGCCAAAC
TGCGTCGAAC GCTCAAGGGA CATTTCGGTA GAATTGCTGC ATTGCACTGG GGCGGCGATT
CCAAAACGGT CGTTACGGCG GGACAAGACG GAAATTTGAT TCTTTGGAAC GCGATTACCA
GCAACAAGTT GCAGTCAATT GGTCTCAAGT CTTCCTACGT CATGGCGGTT GGTATCGAGC
AAACTAGAGG CAATTTGGTA GCCTGCGGAG GACTCGATAA TCTTTGTACG ATTTTCCCCC
GCAATAATGT CGGTAAGGCT GCCGAAATGG CCTCGCATGA CGGTTTTCTT TCCTGCTGCC
GTTTCTTGAG TGAGCAAGAA ATCATCACGT CGTCGGGTGA CTCGACTTGT ATTTTGTGGG
ATATCAACAC GCACAAACCC GTTTCACGCT TCGAGGAGCA CACGGCAGAT GCCATGTTCT
TGTCGCTCCG ACCAAGCGAT CGCAATGTCT TTGTCTCCTG TTCAGTGGAT CAAACTTGCA
AGGTGTGGGA CACTCGAGCC CCTACCAGTT CGACTTTGAC GTTCACTGGA CACACAGGTG
ACGTCAATGG AGTAGAATTC CTACCATCGG ACAACAATTG TTTCGCCTCT TGTAGTGAAG
ATAACACCGT CCGTATCTTT GATATTAGGG CCAGCGATGA ACTCGCAAAA TTCCAAGGGC
CAGCGAGCTT GGGGTCCTCG GCGGTTAACG GCAGTGGAGG TTTCAGTGAG TCTCCATCGG
ATGGATTGAC ATCTTTGGCC GTGAGCAAAT CGGGCCGACT GGTTTTCTGC GGTGACTCGG
AGGGCAACTT CTCGTGCTTT GACATTTTGT CGGAACGATC TGGACCAGCT TACACGAATA
CAGGTGCGCA CGATCGATAC ATCTCGTGCA TCGGCATCAG TCCCCACGAG GACGCGATTT
GCACCGGAAG TTGGGACACT CAAGCCAAAG TCTGGGCTTA G
 
Protein sequence
MSTSEIQQDT AREEVPSLTQ QIEKVQKSKR EHSGSAAQGS PVRAPSAAKL RRTLKGHFGR 
IAALHWGGDS KTVVTAGQDG NLILWNAITS NKLQSIGLKS SYVMAVGIEQ TRGNLVACGG
LDNLCTIFPR NNVGKAAEMA SHDGFLSCCR FLSEQEIITS SGDSTCILWD INTHKPVSRF
EEHTADAMFL SLRPSDRNVF VSCSVDQTCK VWDTRAPTSS TLTFTGHTGD VNGVEFLPSD
NNCFASCSED NTVRIFDIRA SDELAKFQGP ASLGESPSDG LTSLAVSKSG RLVFCGDSEG
NFSCFDILSE RSGPAYTNTG AHDRYISCIG ISPHEDAICT GSWDTQAKVW A