Gene PHATRDRAFT_46158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46158 
Symbol 
ID7201246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp440199 
End bp441688 
Gene Length1490 bp 
Protein Length442 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180636 
Protein GI219119766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGTCCTGTC GGAAACAACC TTGGCCATGT AAAACGGAGA AAGAATCACC CCGTGACCAC 
CAACGGAAGT ACGGGAAGGT AGTCTTTTCC ACGCGTACCT CCATTGAATC AAAAGACACC
ACAACAGATT AGCAAGCCCG CGAGGATTAT CTCGTCTCAC GATGACCGTG ATACAGGCAC
CAAAACGAAG CGGTCGGAAA GCGAGACCCA CGGTACTGCT AACGGGCTGC TTAGCGGTCA
GCTTTCTGGG CGCCGCACTG ATAGTCTTCG TAAGTCGCTA CGATGCTCCG CCTTCGTCGT
TGCCGTCGGC CGACAAAAGT GCGAATTCTC TTAGAGATGC ACCCAAGCCG TCCATGATGA
GCACCACAGC AAGGGAACAA CCGGCTGTGT CGTCGGTTAC CCGTCCCGTC GTAGAAGGTG
TCCCTACCGG GTCCTATCCC TACAAGGATA CCGCTGCCTG TGCGGGTTTG ACCAACCAAG
GTCACTCAGA ACACCAGTGT TCCATTCATT ACCCGCTCGT AATAGATAAA CAAGGCAACC
TACGGTCGGA CGACAGCAAT TCACTTTCTT CGCAATCGGA TTTCTATATC GTTACACGCA
AAGGCGATAA AGATGGTGTT TACCCCATGC CGTTGCCCAA TCAGGATCGG TTGGTTATGC
TGAATCCGGT CACGGTCTCT ACCGATCGAG AAGAGTCAGC GTCCGACAGC GTGTTTGTCA
TGCTGGCGGA TGGGCACGGT GAGTTTGGCC ACGATTGTGC AGATGTGGCT AGCAAAGAGT
TGCCGTCGCG TTTTCTGAAC GCTGTGACCG ATTCACTTCC AACGGCCGAC GACACCGAAA
CGGCAATCCG CGATGCTTTG CGAAACGCGT TTTTGCAAAC AGACGCGGGT CCACTGGAAC
CTTTTGCTGA GGCAGGGACC ACCGCCATTG CAATGTTCAA ACACAAATCC AAAGTATATT
TGGCTTCGAC GGGTGATTCC ACTGCCTTGG TCGGCAGGTA CAGCAAGGAT CACATTGTGT
CTATAGAAAA ACAAGCCGTC CATCATAAAC CGTCCGATTC AGATGAGCGC GTGCGAATCG
AAGCGGCTGG TGGGACGGTG ATTGTGCCTG CAGATCCATC CCTTACTAGC CGAGTAGTGA
TTGGAATGTC TGCTCTGGCC ATGTCTCGAT CCTTGGGTGA TACCGAAGGC AAACGAGCCG
GCTATTTGAC GGCGGAACCT AGTGTTCAAG TTCTGGATCT ATCAGAATAC GACGAAGCGG
ATACGTTTTT TCTCCTGGCG GCGACGGATG GAGTCGTGGA TTTCCTCGAC TTGAAGGAGA
TTGTCCAAGC AATCGGCCAT GCAATGTACG GCCCACCCGG GAAGGAAACT TCCAATCTAC
CCAATACCGT GAAGCGTATC ATGGAGAACG CGTCACAGCA ATGGTTTGCT CGAACAAGCG
GTACTTATCG GGATGATATG AGTTTGGCAG TGACCAAGAT TCGACGATAG
 
Protein sequence
MTVIQAPKRS GRKARPTVLL TGCLAVSFLG AALIVFVSRY DAPPSSLPSA DKSANSLRDA 
PKPSMMSTTA REQPAVSSVT RPVVEGVPTG SYPYKDTAAC AGLTNQGHSE HQCSIHYPLV
IDKQGNLRSD DSNSLSSQSD FYIVTRKGDK DGVYPMPLPN QDRLVMLNPV TVSTDREESA
SDSVFVMLAD GHGEFGHDCA DVASKELPSR FLNAVTDSLP TADDTETAIR DALRNAFLQT
DAGPLEPFAE AGTTAIAMFK HKSKVYLAST GDSTALVGRY SKDHIVSIEK QAVHHKPSDS
DERVRIEAAG GTVIVPADPS LTSRVVIGMS ALAMSRSLGD TEGKRAGYLT AEPSVQVLDL
SEYDEADTFF LLAATDGVVD FLDLKEIVQA IGHAMYGPPG KETSNLPNTV KRIMENASQQ
WFARTSGTYR DDMSLAVTKI RR