Gene PHATRDRAFT_49964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49964 
Symbol 
ID7198553 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp448646 
End bp450346 
Gene Length1701 bp 
Protein Length387 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184805 
Protein GI219129246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCATACGAC TTCCTCCTCC CCACCCAACT CAACTACTTA CCCATCCACC AACTGACGGT 
GACCTCCTTT TAGAGTGTAC TGTGAATACG AGCGTCTACA TCTCACGGCA AGGGAGAAAA
CAGCTTTCTC TGAACAGTGT GTAACGAAGT CCTCCAATCA CCAAACAACC CTATCAAAAA
TGAAGTTCTC CTTTGCTGCA GTCAGTGCGG CCTTGACAAT TGCTTCCGTC GATGCCTTTG
TGCCACAGAC TCACAGTCAC ACCAATGCTC GGCCTGGGTC AAATCCCGAT TCGGTCCGGA
CTGTGTCTTC CGCACTGCGA GCCGAGGAGT CACACGCATC CTCGTTTTGG TGGGGACCCG
TGGCAACGGC TTTTGCGGGA GTTTCTTTGG TCGCACAGAT CGCCGTGGCG GCTCCGGAAC
CTACTTGGAC GACTGGCGCC GCAGGTGTGT GTTTGTGTGT GTCTGTAGTT AGATGTTAAT
TGATTGTGAG GAGTGATGCG TACGACTCTG CGGTTCAGAC GCAGAGATCT TGAGAAGACT
GGAGCAAGTG CTTCGAAAAT CTCCCCGTCG GTATCCTTGC CTGCTGCTTC CAAGTGAGGG
AGGTTGCGAG CCTCATCGTC GCTTTCGTAA TTCATTTCTC ACCCTCCTTT GCCGTTGTCC
GAACTTTGCT TGACCTTGCC AACGCAGACG ATCTACTCCC CCAAACCTCA TCGGTACTGA
TCGCGGATCG AATAGATTCC ATGGACTTTT CCATGCCTTC ATATTCGGAT GCCATCAAGT
CCGCAACTCT GGATACTTCC GCCTCGTCCT CATCCAAGCC AGCCCCTCCG TCCTTTAATC
CCTTTGAAGA TTCGTCCAAA GACGACGCTG CCTCGGCTGC AGCTAAAGCG GAAGAAAAGG
CGGCTGCCGA GGCCAAAAAG GCCGAGGACA AAGCACGTGC GGAATTAGAG GCCGCCACCA
AAAAGGCCGA AAAGGAGGCG AGGCGTCAAG CGGAGCTCGA GAAGCAAAAA GCTGCCGCCG
AACGTACCAA GGCTGCTCAA GAAGAGAAAG CTGCGGCGCC AGCGAGCAGT AGTGTGGAGC
TGCCGTCGGT CCCTTCGGTG GATCTCAAAC TCCCCGATAT CTCCATTCCA GACTTCAAGG
CTCCCGACAT TACCATTCCC GATTTCAAAG CACCTGACAT CAAGATGCCC GACTTGCCCA
AATTTTCCAT GCCCAAAATG GCAACGGATG GCTACGATTT CCCCGATATC AAAGCACCGA
AGGTCGATAT TCCCAATGTG GATATGCCCA AGATTGCTAT GCCCGCCCTT CCATCTTTTG
GTGGTGGTGG TGGTGCTTCT TCGTCAGACA ATCCTTCCTC TCCGCTGGAA TCCCAAGATG
TCCGAGACGA ACGCGCGCGC TCTGCTAAAG CCGATTTTGC CGACGCCGAC AATACCGCCA
GAGAGATTGA AGCCAAGGCG CTGGAATTGC GGGCCGTTGC CAACGACAAG AAGCAAGCCT
TTAAAGACGC CAAGGATGAA GCCTGTGCAA CACGACCTGG CGGCAAAATC TTGTGTTTGC
GTAACCCCAT GAAAGCTGGA TTCTAATACG AAAATTATTA CCTATGTGGT GCTGGAACAA
ATTATCATCA CAGTCAGTAC TATTTTTATC ATTTTCCAAT GTACAAGACT ATTGCGTAAC
ATTTGGGCCA CCAGGCGACA C
 
Protein sequence
MKFSFAAVSA ALTIASVDAF VPQTHSHTNA RPGSNPDSVR TVSSALRAEE SHASSFWWGP 
VATAFAGVSL VAQIAVAAPE PTWTTGAADD LLPQTSSVLI ADRIDSMDFS MPSYSDAIKS
ATLDTSASSS SKPAPPSFNP FEDSSKDDAA SAAAKAEEKA AAEAKKAEDK ARAELEAATK
KAEKEARRQA ELEKQKAAAE RTKAAQEEKA AAPASSSVEL PSVPSVDLKL PDISIPDFKA
PDITIPDFKA PDIKMPDLPK FSMPKMATDG YDFPDIKAPK VDIPNVDMPK IAMPALPSFG
GGGGASSSDN PSSPLESQDV RDERARSAKA DFADADNTAR EIEAKALELR AVANDKKQAF
KDAKDEACAT RPGGKILCLR NPMKAGF