Gene PHATRDRAFT_50353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50353 
Symbol 
ID7199136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp54423 
End bp56393 
Gene Length1971 bp 
Protein Length553 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185272 
Protein GI219130229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTGCCGAC GCAAAGTATT AGTCAATAGG TAGTCAACAC TTCTCAGTTC GCTTTCGTCA 
GCGACGTGGG GTTCGCTTTT GTAGCCGATC TCATAGCTAC GCCGATGAGA GTAGCATAAA
CCACCTTATT GCTTGCTACC TCCTGAGCGC AGCCAATATG CTGCTGGGTT CAAGCACATA
AAATTCTTTG ATCTCCGTCC CCCTTTCGAA ACCAAAGCAA AATGAAACAA TTTGGAATAT
TCTCGCTTGC TTGCGCATTG GCTTTCTCTT CGTTGGCACC GGCATATGCG TATTCCGTTG
CCCCTTCAGT ACCAAGCATT ATTCAAGGCG GAATGGGAAT CCGGATTTCC CAATGGAAGC
TTGCGCGTGA GGTAGCCCTG AAAGGTGAGC TAGGTGTGGT TTCAGGCACT GCGATGGATA
ATGTCATGGT GCGGGAGCTT CAGAAAGGTG ATGCGGAAGG ATCCTTTTTT CGCGCACTCA
AGACTTTTCC TGATCAAGAT ATGGCAAACC GGATTGTCGC GCGATTTTAT ATAGAGGGCG
GAAAAGACCC TTCGGAGCCT TACAAATCAA TTCCAATGTG GACGCTCACC CCTAACCAAC
TACTCTTGGA AGCCAACGTT CTTGCAAACT ACTGTGAAGT TTGGTTGGCC AAGCACAACG
ACGATGGATC AATTATTGAT GGGCTCGTCG GAATGAATCT ACTGACAAAG GTCCAACTTC
CGACGATCGC ATCCCTCTAC GGTGCTATGA TGGCTGGCGT CGACTACATC ATAATGGGAG
CTGGAATACC GATCTCGGTT CCTGGATTTC TCGACAATCT ATCCGAATGC AAAGACTGTG
AGCAGAAGAT TGATGTGGAC GGAGTTGCAG AGAAAGAAGC CCCGGTTTAC AAGTTTTCAC
CGATTGCGTT CTGGGAAGCA GCGGGGAAGC CAGAATTGGC TGCTCCGTTA AAGCGCCCAT
CGTTCCTCCC TATAGTTTCT TCTACAATAC TCGCTCAGTC CCTTCTGAAA AAAGCCTCCG
GAAAAGGGCC GACGAGAGGA ATTCAAGGAT TCGTGGTGGA GCTCAGTTCC GCAGGAGGCC
ACAACGCCCC TCCCCGTGGT TTTAAATTTG ATCCTGTCTT CAGTACACAC GCAGGCGGTT
TAAATGAGCG TGGCGAACCT GTATACGGCC CCAAGGATGA AGTCGACCTG GCTAAGTTTT
GCAAGGCTTG CCAGGGTTTG CCGTTTTGGC TAGCCGGTTC TTACGCTCGA CCTGAACGAT
TTGCCGAGGT CCGAGCATTG GGAGGTGCAG GTGTCCAGTG TGGTACAATT TTTGCGCTTG
CTGAAGAATC TGGCCTCGAC GATTGGATCA AACAGGACAT TCTTCGCAAA CTGTCAGAAA
CCCGTTTGGA TGTGCTGACA GATCCCGCTG CCTCTCCCAC AGGATTTCCG TTCAAAGTAC
TCGATTTACC CCAGAGTCTT TCCCAAAGAG AAGTCTACGA AGCTCGTCCA CGTGCATGCA
ACCTGGGCTA CTTGCGACAA CCTTACAAAC GGCCTGACGG CAAAATTGGT TATCGTTGCC
CTGCGGAGCC GGAGGTAGCA TTTGCTAGAA AAGGTGGGGA TGCCAAGGCC ACTGTCGGTC
GTAAATGCCT CTGCAACGCC CTCTGTTCGA ATGCAGGGTT TCCGCAAGTT GGAGAGGTCA
AAGCCGTCAA CGGAGAAAAA ATGAAGTACG TTGAGCTACC CCTGATCACG ACTGGAGACG
ATATTAGTAG TTGCCGAGAC TTCATCAAGG AAGATGCTGA TGGTCATTTA GGCTTTCCTG
CTGGCGAGAT TGTGGATTAT CTGCTCTCTG AATGGAAAAG GAAGCCGGTC GGATCCGCAG
CCGAGGGATC GATGTCAATT TAAAATTGGA CAGCAGGAAT CTTAAATGTT TTATTTGACT
AGCATCAGAA TGTGACACTG ACAGTAAATT AAATTTTGAA CAAAGCGGTT G
 
Protein sequence
MKQFGIFSLA CALAFSSLAP AYAYSVAPSV PSIIQGGMGI RISQWKLARE VALKGELGVV 
SGTAMDNVMV RELQKGDAEG SFFRALKTFP DQDMANRIVA RFYIEGGKDP SEPYKSIPMW
TLTPNQLLLE ANVLANYCEV WLAKHNDDGS IIDGLVGMNL LTKVQLPTIA SLYGAMMAGV
DYIIMGAGIP ISVPGFLDNL SECKDCEQKI DVDGVAEKEA PVYKFSPIAF WEAAGKPELA
APLKRPSFLP IVSSTILAQS LLKKASGKGP TRGIQGFVVE LSSAGGHNAP PRGFKFDPVF
STHAGGLNER GEPVYGPKDE VDLAKFCKAC QGLPFWLAGS YARPERFAEV RALGGAGVQC
GTIFALAEES GLDDWIKQDI LRKLSETRLD VLTDPAASPT GFPFKVLDLP QSLSQREVYE
ARPRACNLGY LRQPYKRPDG KIGYRCPAEP EVAFARKGGD AKATVGRKCL CNALCSNAGF
PQVGEVKAVN GEKMKYVELP LITTGDDISS CRDFIKEDAD GHLGFPAGEI VDYLLSEWKR
KPVGSAAEGS MSI