Gene PHATRDRAFT_49750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49750 
Symbol 
ID7198338 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp156430 
End bp157969 
Gene Length1540 bp 
Protein Length419 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184497 
Protein GI219128601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACTCGAACG TTAGATTGTC GTAAACGCCC TAGTAAGTGG TGCAATACTA CTGCTAAAAC 
AACACGCTTC CGCTGCGCAG ACACAGCAAG CAATACCCAT GGTCAGCACT AGATCAGTCA
GGGTACAACC CCCCTCAGAG GGAGGTCCCG TCGGAGCCGA CGATCCGTCC GTAGGAGATC
CACCTCCTTC AGACGACGAT GGCGAAGACG ACGACGACGC AGGAACGGAA CCCAGCAACG
GCTCCAGTCT TCCAGCTCAA GTGGTTTGCG TCCATCAAAC TCCACGCTTT TCCATCTCAC
CCAGTCTCTC GCAGGGACGT CGGGCGATAT TGGACTACAG CATTCCGAAG ATCGCGAAGC
TTTATGTTTC GGCAACGAAG CCGCTCTCGA AAACTGAATT CGACATCAAG GCAGGTGCTC
TCACACCTTT ACTTTCGAGT CTGGCACTCA GGGCTCGAGA GCACGGATGG CATGGACACG
GAAGTAACGG AATTCTCAAA ATTCCAAATA ACATCACTAT TCCGAACGGC GCCAGCAAGT
CGCTCATCAA GGAATACGGT CAAATCTCCC TTCCCCACAT CCGCAACTAT GTCGGTACAT
TTGCGAACAC CAAAACTCGT GAAGTACAAG ATGACGAGGC ACTCTACCAG TGCTTGAAAG
TATCGCTCAC GCACGAAGCA ATGGCGAAAA TCAATCTCTA CGAAACTGAA TGGACGGTTG
CAGGTGAGCC ATCTGGAGTC GCAATGCTCA AGGTAATCAT TCGACACGCA TACGTTGATA
CAAATGCGAC GACAATGCAT GTATGACGAA ACTTAGCAAG CTTGACTCAT ACATGGAGTC
TCTCGCGGAA CACAATGTTA CTCTTTTCAA TGAGTATGTT TATGAGCAGC TCCACGCTCT
GACAGCTCGC GGCGAACAAA CTCTTGATTT ACTTCCGAAT CTTTTCAAAG GATACGAAGC
GGCCAAGGAT ACGCAATTTT TGGAATACAT CCGCAAGAAG AAAGCCGAAT TTGAGGAGGG
AACAGTCTTT TTGGAGCCGG AAATTTTAAT GTCGCAAGCA TCTATCAAAT ATCGAACTTT
GGTGGAGAAA GGAGAGTGGG ATGCCCCATC CGAAAGCGAA GCGAAGATTC TAGCTCTAAC
TACCCAAGTC AAGGAACTTC AGGCAAAGAA GTCAGAGAAA CCTAAGTCTA AAGCTAAGGG
TGACTCAAAA AGGAAGAAGA AGAAAGGGAA GAAATCCGAT AAACCAAAAA CGGACAAGTA
TGCTTCACTA AAGAAGCCAA GCGCGTCAGA ACCTCACACA AAGACTTTTG ACGGAGACAA
GATTAAGTTC TGTACTAACC ATCAAGCTTG GGGCACGCAT TTGGCAAGCG AATGCAAGGG
ATACGGACTC GAAAAGGATT CCAATGGAAA ACCAATTCCA AAAGGATCTG AACCTAATGC
CACAGACAAG AAAGGCCCGC CCTCAAAGTC ACACGCCGCA ATCATGCGGA TGAGCAAGGC
CCTAACAACC GAAATCGAGA AGGCCGAGAC CGAAGAATGA
 
Protein sequence
MVSTRSVRVQ PPSEGGPVGA DDPSVGDPPP SDDDGEDDDD AGTEPSNGSS LPAQVVCVHQ 
TPRFSISPSL SQGRRAILDY SIPKIAKLYV SATKPLSKTE FDIKAGALTP LLSSLALRAR
EHGWHGHGSN GILKIPNNIT IPNGASKSLI KEYGQISLPH IRNYVGTFAN TKTREVQDDE
ALYQCLKVSL THEAMAKINL YETEWTVAAR GEQTLDLLPN LFKGYEAAKD TQFLEYIRKK
KAEFEEGTVF LEPEILMSQA SIKYRTLVEK GEWDAPSESE AKILALTTQV KELQAKKSEK
PKSKAKGDSK RKKKKGKKSD KPKTDKYASL KKPSASEPHT KTFDGDKIKF CTNHQAWGTH
LASECKGYGL EKDSNGKPIP KGSEPNATDK KGPPSKSHAA IMRMSKALTT EIEKAETEE