Gene PHATRDRAFT_42794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42794 
Symbol 
ID7196158 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1132664 
End bp1134667 
Gene Length2004 bp 
Protein Length490 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176728 
Protein GI219109951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAATGGAAC ACGCAGCGGG GTCCATTCTA GAATCTACCA ACCACAAGTG GTGGGTGTTT 
CTTGTTGGTC GTCGTCGTCC CTCCTCTTTC TCCATTGGAA CGACTCTAGT CGTCGTTGGG
TATCACACAC ACATCCACAC AGACACATAT CGTTCGATTG AAGCGAGTCA TCGGATTGCT
TTGTTCGACA GGTATGAGTG GCACCAACCA TTCCGCTCCG CACGGGTCGC CGAACCACAC
CAGTACTACT GCTACCACTA CCACCACGAC GACGACAGCT CCACCCACGT ACTACACTGC
GTACGGAGGA GGTGCCGGGA ATCCACCCCC ATCCTTTGCT GACGGATTCC CCCACAGCCA
CACGGGACCG CCACCTTCCC GAGTCTACCA CGGCACGGGA TCGCCCGAAA CAGGATCACC
CGCGCCGCAC CAGCCACCAC CATCACAACC ACCATCGTAT CACCACCACC ACCGTTCGCC
TTGGAAAGGA TATCCCTGGG GAGGATCATC TGCGGAATCC GTACCGGTCA ACGCATCTTC
CGCTACTACT ACCAATAGTA CTACCAATAC TACCACGGGG GCTCCACCGC CGTACGCCAT
TGGAAGTGGT GGAAGTGGAA ACCACACCCA CCACAACGCG ACTACGGCGG GATCGACCGG
ATACCGTCAC TCGTACGTGG GACCACCACT ACCACCCGCC CACGCAAGTT ACTGGGGACC
ACCACGGTCC ACCAGTGATC TCCTAGCCGG AGGAGCATCC GTCGCCAACG ATGGACGTGA
AGGCGCCACG TCGCCCTCGC AAATTGAAAC GGATCATCTC GAATTCGTTC AAGCCGTGGG
TTGTACCTGC AAGAAAACAC GCTGTTTGAA ACTATACTGT CAATGTTTCG GAGTCAAGAT
CTATTGTGGT CCCAACTGCC GTTGTTTGGA CTGTCACAAT GTTCCCGCAC AAGAAGATGC
CCGGCAGAAT GCCATGAAGG TTATACTCTC ACGCAATCCC CACGCCTTTG ACACCAAGTT
CCAAAAAACA CCCGTCGACG GCGCTACGGT GGAAACGCCT TCCAAGCTAT TGACGCACAA
GTTGGGATGC AAGTGCCGCA AATCGGCTTG CATGAAAAAG GTACCTAAAC CCAACAACAC
TGCGGGTCGA TGACCTTGCG TCTCCTCCGT CACACTCTAG TCACTCACCC CTTGCTCGCT
TTTGTCGTGG TTGTTTGTTT TTGTGTATGT GTGTATTTGG TTTGTTTGTA TTGTTTCTTA
CAGTATTGCG AGTGTTACGC CGGTCACGTG TACTGCAACA CGCACTGCCG TTGCACCGGT
TGCAAGAATC GGGATGGCTT ACTTCCGGGA CCGGGTGGTC CCGGAGGTCC GTACGGGGCG
ACGGTCCACC ACCACGATCC CCGCTTCGCC TCGCCGGCAC GGGCCACCGC CCCGGTCTTT
GCGCCGCCAC TGCCGCACCC GACGCACGTT ATGCAACCCG TCGGTAGTCG CTCCAACGCA
ACCAACGCGG GGGGTAAACG GGGTGAGCCC TTTGTCGCGG CCGCACAAAA TTTGGCCTTT
CTGAAACGGG GATCGCCCGA AGATGCCACC ACCACCGGGC CGGTTAAAAA GGCCCGTGGT
CCGCCCTCTT CGGAAGGAAT GAACAGTCTC ATGATTGCGG CGCAAGCCAT GACGGAATTT
GGACAAGGAT CGTCGCCCTC GAAAGCCCGC TTGTCGGTCA AGGAGGCGCA GGAACTGGCC
AAAAGAGCCG TGGAAACACC GACGCCCCGC AAACATTCGG TCTACAAGCA AGAAACCGAT
ACGGTATAAA GGTAGTTCGT CGCAGAAGAA GAACAAAAAG ACAAACAAAT GAAAAGGCAC
GCCAATGACC CAACCAAGGG TTTAAGCTAG GATTAATCCG ATTGTGAAAG TGAAACGTGT
GATTGTCTGT GGTGGATGCA CTTGTCCCCA AACAACCAGT CTCACCATAC CAATCGATGG
TTCTAGGTCT TACTCGTTTA AAGT
 
Protein sequence
MSGTNHSAPH GSPNHTSTTA TTTTTTTTAP PTYYTAYGGG AGNPPPSFAD GFPHSHTGPP 
PSRVYHGTGS PETGSPAPHQ PPPSQPPSYH HHHRSPWKGY PWGGSSAESV PVNASSATTT
NSTTNTTTGA PPPYAIGSGG SGNHTHHNAT TAGSTGYRHS YVGPPLPPAH ASYWGPPRST
SDLLAGGASV ANDGREGATS PSQIETDHLE FVQAVGCTCK KTRCLKLYCQ CFGVKIYCGP
NCRCLDCHNV PAQEDARQNA MKVILSRNPH AFDTKFQKTP VDGATVETPS KLLTHKLGCK
CRKSACMKKY CECYAGHVYC NTHCRCTGCK NRDGLLPGPG GPGGPYGATV HHHDPRFASP
ARATAPVFAP PLPHPTHVMQ PVGSRSNATN AGGKRGEPFV AAAQNLAFLK RGSPEDATTT
GPVKKARGPP SSEGMNSLMI AAQAMTEFGQ GSSPSKARLS VKEAQELAKR AVETPTPRKH
SVYKQETDTV