Gene PHATRDRAFT_49984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49984 
Symbol 
ID7198693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp35405 
End bp37141 
Gene Length1737 bp 
Protein Length572 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184879 
Protein GI219129403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGCA AAACGAAAGA CTTCACTATT GCCAACAATA GTCTTCCCGA ATTTGGAAAT 
CCCCCTGACA AGCACAGCAG AAAGAAGAAA AGCAAAATCC GGAAGAAGGA TCGACCATTG
TGGTCGGCGC CGCGCACGGA CGCGCCACTT ACTGCTACAG ACGGGGAAAC CAACTCCGTT
AACGAAGCTC TGGGCTATTC TCAATTTCAT TCTCACGATG CCGAGCACGA ATCTGACGAC
GGTTGGTCTG ATACTTCCGG TAAAAGTAGT GAGCCAGACC TGGGGAAAGG ACGGCAGGGT
CTAACAAAGA ACGCCGCCAC GATCCAAACC AACGTCGCAG AATCTCGTCA AGCCGATCCT
CCGATGATTT GCAATCTCCC TGTACCGGTA TCCTTTAGTT CCGACGATGA AGACTCTAGT
GGAGAGGATA GTGACCATGG ATTTGCGCAT TCCGCGCTCG GAGCATCTCC TTACACTGTC
AAACAGCGGT CAAATGAAGA CTTGACGCGA GACATCAATC AGCGACCTCC TTCCCCTCCC
GCATCTCCGA AAAAGAATAC GTCCGAACAT TTCATTGGGT ACCGTTCTAT GGACACAAAC
CGCAAGGCAA TTCCCCATTA TTTGACACGC ACAAGTAGCC GTCCGGACCC TAGCGATAGT
CCAAATTTTC ACAGCAGCAA TGCCCGTCAA CATCAGCACG AGCCGCAGTA CAATCCAAAC
GAGCAGAGCT CTGTTGGTAG TCCTCGTATA AATGGGATTG AATTTGCCGG GAACTACGGA
GATGGACGAG TTTCCGGCAA TGAGAAGGGT ACGCGAGGGT ACGATCCTGA CTACGAGAGT
GATCAGACCG GCCTACAGTA CATTCCAAGA GATCCGTACC AGGATTCCGA TTATGATCTC
CAAAACGATC CTCGATACTA TCCCGAAACG GCTGAAAGCG GTAGTGAATA CATTCCCGCA
ATCTATAACG CCGAATTTCC CGATGCCTCG AAGACAGACC CCGAGACCGG GTATTTGCCA
TCACAGTCGC AACTACTCCG AGAGGCCAAT GATTCGATTT CGTCCAGCCA AGTTCAGAAG
CGTGACCGGC GCACAATGAC ATGGCTCATT GTGTGTCTGG GATGCGCTCT CGTGGCACTG
GCTGCTCTAA CAGGAGGAAT AGTCGGAGCT TTGGTGTCCA AAGAGGATGC CGATGTGGTT
GAACTGTCGG AGCCAACGGA AAACAGTTCC CCAACCACGC CCGCTGCCAA CATAACAAAG
GCACCAACAA TCCCAACTCT AGTACCCACT TCTTCGCCAG CCGATATTGA AGAGAGAACA
GAGGGACCAA CTCCATCTCC TCAAACATTC TCACCCACAT CATTGGCCAC AACGATCACA
AGTCTTCTAC CGACACCTTC GCCAACGAGA ATGGAACAGA GTACAGAATT TCCTACTCAA
GCTCCGCAAA CAATTTCACC CACGTCCTTG GCACCAACGA TCCCAAATCC AGTTCCGACA
CCTTCACCAA CGGACACTAA AGAAAGTGAA GGAGAACCAA CGCAATCTCC TCAAACAATT
TTTCCCCCTA CAACCGCTTC AAGCGAAGAA GGTACTAGCA ATGCGCCAGC GGCCGCCATT
ACAAATACCC CACAAGTTGC GACAACGCCT GCGCCTGTTA CCGGCGGTGG TGGTTTCGGA
ACTAGCGGTG GATTTGGCAC CGGTGGATGG TGGCGGTAGG GTGGAAACTG GCAAGTG
 
Protein sequence
MSRKTKDFTI ANNSLPEFGN PPDKHSRKKK SKIRKKDRPL WSAPRTDAPL TATDGETNSV 
NEALGYSQFH SHDAEHESDD GWSDTSGKSS EPDLGKGRQG LTKNAATIQT NVAESRQADP
PMICNLPVPV SFSSDDEDSS GEDSDHGFAH SALGASPYTV KQRSNEDLTR DINQRPPSPP
ASPKKNTSEH FIGYRSMDTN RKAIPHYLTR TSSRPDPSDS PNFHSSNARQ HQHEPQYNPN
EQSSVGSPRI NGIEFAGNYG DGRVSGNEKG TRGYDPDYES DQTGLQYIPR DPYQDSDYDL
QNDPRYYPET AESGSEYIPA IYNAEFPDAS KTDPETGYLP SQSQLLREAN DSISSSQVQK
RDRRTMTWLI VCLGCALVAL AALTGGIVGA LVSKEDADVV ELSEPTENSS PTTPAANITK
APTIPTLVPT SSPADIEERT EGPTPSPQTF SPTSLATTIT SLLPTPSPTR MEQSTEFPTQ
APQTISPTSL APTIPNPVPT PSPTDTKESE GEPTQSPQTI FPPTTASSEE GTSNAPAAAI
TNTPQVATTP APVTGGGGFG TSGGFGTGGW WR