Gene PHATRDRAFT_47962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47962 
Symbol 
ID7203145 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp562747 
End bp564027 
Gene Length1281 bp 
Protein Length270 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182253 
Protein GI219123898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.581205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCATCGTA CGCGTCGTAC CCACACACAT TCATTCATTC ATTCATCTAT TGTCACCCAA 
TCTACCGAGA AACGTTGTGA GATCCATTCC GTACGATCTA CAAGCTCGAC CGACCTACAA
CCATCGCACG CAATTCACTG TCAGTGACCT CGCAGTCAGA GTCCCACTTT GCTCGTTCGC
TACACATACT CGATCGACGC CCAGCCACAA AACGGAACGT TTCTGTTGGT CCTTGGCCCA
TTCGATTCTA CAGATATCTC CACTGGTTTG ACAAAGTGTG AATTGCCCAA CATGAGAACA
GTATTTACAC TTTTTCTTTC CGCAACCTTG GTACGTAACT AGTTCAGTCC CACGACACCC
CCTTCACCAC AAGGACCATC CATACAAATC GCCTAACGTT CGTCTCTACT CTCTGCTACT
TTCATTCGGA CAGCTCGGCC GTTGGCTAGC GGCTGCTTCG GCACCCGTCC CGAACGTCCA
GGTCACGCTC CGCGGCAAAA AGTACGACGT CACCGACGTG CGCACGGTCC AGGACTTGCA
GGATCGCATC GAGGAGGTTT CGGGGATACT GGCGCCGCAG CAGGGACGGG TACTCTTTGA
CGGCAAACGA TTGGAGTCGA CCGATGTATT GGCCGATGTG GGTGTCGCGG ACGGCGCCCA
ACTCAATATA GTGCCTTCCA GTAAGGCCGC GGGGAAAGTC AAAAAGACCG CGACCACCAC
CGAATCCAAA ACCGATTCCG CCGCCATGAT GGAGGATTAC CTGAGACAGG CCGGGCTGGA
TGGGGACAAG CTGGATGAAC TCATGAAGGG CATGTCGGGA TCGGATGGGA AAGTACCTTC
CATGGAAGAG AGTTTGGGAA TGATGAACGA AATGATGAAT AGCCCCATCT TTCAGGAATA
CATGAGCGAT CCCGCGAAGC TCGAAGAGTC CCGGCAGATG ATTCTCAACA ATCCGATGCT
TAAATCGATG ATGGCCGGCA TGCCGGGAAT GGAAGACATC CTCAACGATC CCGAGGCTTG
GCGAGAAGCC ATGCAAGCAG CAGCCAGCCT CTACAAGAAT ATGGATAAGA ACCAACTGAC
ACAAGCAATG ATGGGAATGG GTGGTATGGG CGGTGGTATG CCAGATTTTG GTGGAAACAT
GTTTGATGGC ACTCTGGACA ATTCAGCCGC CGCAGCGGCA CTGGACGAGC TGGACGAAGA
CGACTAAATC TTTTTCCGAT AAATACTACA ACATTCACAC ACTCATACAT ATACACATAC
ATTTATTCCT GCATTGTCTC C
 
Protein sequence
MRTVFTLFLS ATLLGRWLAA ASAPVPNVQV TLRGKKYDVT DVRTVQDLQD RIEEVSGILA 
PQQGRVLFDG KRLESTDVLA DVGVADGAQL NIVPSSKAAG KVKKTATTTE SKTDSAAMME
DYLRQAGLDG DKLDELMKGM SGSDGKVPSM EESLGMMNEM MNSPIFQEYM SDPAKLEESR
QMILNNPMLK SMMAGMPGME DILNDPEAWR EAMQAAASLY KNMDKNQLTQ AMMGMGGMGG
GMPDFGGNMF DGTLDNSAAA AALDELDEDD