Gene PHATRDRAFT_49802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49802 
Symbol 
ID7198464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp366213 
End bp368143 
Gene Length1931 bp 
Protein Length570 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184531 
Protein GI219128672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAACAAGTCT ACTCACACTC ACATACACAT GTAGTATACA TACACACACA CACACCCATT 
CCCTCACATA ACAATCATAT CCTCTTACCA ACCAACCAAC CTTGCTGCCG ATACCATCCA
CGGAAGACAC ACACACACAC ACTCTCACTA CCCTCAAAGT CACTTTTAGG TCACAGCCAC
TCCTCACACG TATCCAAGGA GTCGGTTTCG ACTTCCCCAT GGGACTTGGT TCCAACTTAT
CGTCGTCTCC GCTCGGTAGG AGCGTTCACC ATTCTTCGAG TGATTCGACG TTGCGGAGTC
CCCCACGGAC GACTCGGACT ACCGTCCGGC ATTCCACGGC GTGTGCGCGT TCCTCTCCCA
CCAGCGTTGC TCTGGTTCCG ACGACGTATC CGTCTCCTAC TGCTCATCAT CAACAACAAC
AACATCACTA CTCCGTCAGT GCGAGTCATT GCCACCGCAT GCCGCATCCC CAATCACCAC
CACCGCGTTT CTGGCTCCAG TCATTCGTTC CATCCTCCTT CTTCTCTTCC TTCTCTTCTT
TTTCTCTCTT TTCCGCACGT CCCATGAGCC GACGAAGACT GCGGAGATCG CGACCCAACG
CTCTGCTATT GTACAGTCTC GTAGCCTGGG GGACGCTACT CGTCTCGTGT TGGACCATCT
TTGTTATACT CTCACCGCCA CAATCCGCAT CCGTACCTCC CAATCCGCTA CCACACCATT
CCGTCGCACT CCTGCGGCAG GCCTCCACCG AACTGCGGAC ATCGTCAACG GGACAACGGC
GTCAAGTCAC CACAGTCCAA GTCCCCGACC TGTCCTCCTG GCGCAACACA CACGGCGTAG
TACACGTCGT ACAAACACGC TTTATGCAGT ACCAACCCAA CTTGTTGGAT CTGGGTCATG
CCCGTTTGGA AATATTCCAA GCCCTTACCC TGCCTTCCGT CCGCGCCCAG TCCTCGCAAG
AGTTCCTGTG GCTCATTCGC ACCGACCCGG CCTTGCACCC AACCTTGCGT ACCGCACTCT
GTCACGTCCT GCGGGACGTC CCCAACGTCA TTCTGGTCGC CTCCAACGAA AACCCCGAAG
GCTTCCGCGC GGACGACGCC GTCGCGGACG TCAGCGACGA TTCCGTCTGG GTCGGCCACG
CCGACACGGT CCGGGCCTAC CACGCCGCGG CACAAACCCA CGTTCTGTTG GAAACACGAC
TCGACGCCGA CGACGGCCTC GAAACACACG TCCTCGAGAA TTTGCAACGC CAGGCCGCCA
CCGCGCTCGT GCACGCACCG GCCGTGGGAT GGCGCGTCTG GTGCGCCTCG AGTCACCTGG
AATACCAACA CTACAACGTG TGGGACGCTG GGGACGTCCG GGGAGCCATT GTGGGGATCA
AAACATCCTA CTGCGTCACT CCCGGCTTGA CTTGGGGCTA CGCGGTTGGT GTCGTCCCGC
ACAAGGTCGA ATCGAAACAC GATCGCATAC ACAAACGTGT CCCGGCCTGT ACGGAGGCAT
CGGCCGAGAA CGCCCGTGTT ACGGGCTGTC TGACCCGCAT ACAGAACGGA CGGCATCCGG
CCGCGGTGCG CGCGCGGTCG CCCACCAGTG CCGGCATGGC CAATCTAATA CTCGACGCCA
CCATGACACA AAGCGGCAAC GCTGCCGCCA CGATGCACAA ACACAACCTA CAGAAATTGC
AAAAATCGCG TTGGAAAACG TTGCAGGACG ATTTGTGGCT CACACTGCCC CTAGTGTTTG
GGATTGTACC TGCACGGGTG TGGCAAGCCC GGGAATATCT AGAAGCTCAC ATGGTTAATA
TTGTGCGCGA TAATCTGGCC GGACAGTGTA CCAAGGGACA CAGTTGCAAG GAATTGAGTA
AACAGGCACT ACAAGTTCTG TTGGATATGT ACGAGGCCGA CCAGGCGGAA CCGGAACCCG
AACAGCTGTA A
 
Protein sequence
MGLGSNLSSS PLGRSVHHSS SDSTLRSPPR TTRTTVRHST ACARSSPTSV ALVPTTYPSP 
TAHHQQQQHH YSVSASHCHR MPHPQSPPPR FWLQSFVPSS FFSSFSSFSL FSARPMSRRR
LRRSRPNALL LYSLVAWGTL LVSCWTIFVI LSPPQSASVP PNPLPHHSVA LLRQASTELR
TSSTGQRRQV TTVQVPDLSS WRNTHGVVHV VQTRFMQYQP NLLDLGHARL EIFQALTLPS
VRAQSSQEFL WLIRTDPALH PTLRTALCHV LRDVPNVILV ASNENPEGFR ADDAVADVSD
DSVWVGHADT VRAYHAAAQT HVLLETRLDA DDGLETHVLE NLQRQAATAL VHAPAVGWRV
WCASSHLEYQ HYNVWDAGDV RGAIVGIKTS YCVTPGLTWG YAVGVVPHKV ESKHDRIHKR
VPACTEASAE NARVTGCLTR IQNGRHPAAV RARSPTSAGM ANLILDATMT QSGNAAATMH
KHNLQKLQKS RWKTLQDDLW LTLPLVFGIV PARVWQAREY LEAHMVNIVR DNLAGQCTKG
HSCKELSKQA LQVLLDMYEA DQAEPEPEQL