Gene PHATRDRAFT_37756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37756 
Symbol 
ID7202294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp880331 
End bp881722 
Gene Length1392 bp 
Protein Length463 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181821 
Protein GI219122997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTA TTCGGAAAAC CCTGCAGCTC ACGATCTGGC TATCCGCGTG TCTCGTCACC 
TCCTATTCCT ACAGCAATAG CAACGTGCCG GTAGGCAAAT CCAGTGCCAG GAACACTGAG
ACTTCCCCCG TCCCTATATC TTCGCATACC TTTTTGTTCC GATCCCACCC CATTGCATAT
GAGACCGCTG TGGTCAGATT TCCCACCAAG ATAGCCGTGC CGCCGCAATC ACCAACCACA
CCGTATCGAG ACGTCTCGCC GGTGCTGCTC CTGAACGGCT TTGGGGTCGG GTCCTTCCAC
CAACACCGAC TCATCCAAGC CCTGCAACAA CAGTCCGACC AATCTACAGT AACTGACAAA
AACAGCAATA GAGATGAACC CGCCAGTCTT GCTACTATTA TTTACACGCT TGATTATCTC
GGACAAGGTC GCTCCTGGCC CGTGGATTCC AACGATGGAC AAAGTGAAGC GGAATTGGGA
TTGCGCTACT GTGGACAAAC ATGGGTGGAC CAGATTGTAG CATTTTTGGA GACAATCGTT
TTGCCTGCTC GTGAATCCTG TTTCTCGTCC ACGAGACACT ATACTGCTCC TCCGGAACGA
GTCCATTTGG TAGGCAATTC TGTCGGCGGA CACTTGGCCG TATTTGTGGC TGCCTTGCGA
CCCGACTTGG TAGCCTCCGT CACCCTGCTC AACGCCACTC CTGTTTGGGG ACTCAATTTG
CCCGGCTGGA CCGGTCATTT GCCGGCTCCT TTTCTGCCCA AGACCATTGG TCGATTTCTG
TTCGATCAGA TTCGCAATCT CAACACAATC GAACAATATT TGGCGGCGGC GTACGTCCAT
CGGGAGGCGT TTGACGCCAC GCTCATGCAA CAAATCCGAG CCTGCACTGA AAGTCAAGGG
GGACACGCGG CCTTTGCCTC GATTCTTTGG TCTCCTCCCG TGACCTTACC GACGAAACCA
AATGATGCTC CAAGCAATAC CAAAAACGAC TACAAAAAGA TCAACGCTTT CGACGAAGCC
CTTTCCCGGC TCGAGTGTGA CGTTTTGCTA TGCTTTGGAG CCGACGATCC TTGGTGCAAA
CCGGCCTTTG CAGCGCGTAT GCTCCGAGCT CTGGGACAGC GTCCAACGGG TAAGGTCCAG
CGATACGTGG AACTCTCCAG CGTTGGTCAC TGTCCCAATC ACGAGGCGCC AAACGCCGTA
GCATACGTTT TGCTACCCTG GTTGCTTTCG TCAAATGCAC AACGCCAACA AATTGCATTG
GTGCCAGCGC CACTCTCAGA AGACAAACGA ACGTCAGTAC GAGAAACCTG GGGGGTCACG
GAATTGACCG AACGCCAAGC CGACGACATT TCTTTATCAT TAGTGGATCG ACTAGCCGTA
CTATTTGTAT AG
 
Protein sequence
MKIIRKTLQL TIWLSACLVT SYSYSNSNVP VGKSSARNTE TSPVPISSHT FLFRSHPIAY 
ETAVVRFPTK IAVPPQSPTT PYRDVSPVLL LNGFGVGSFH QHRLIQALQQ QSDQSTVTDK
NSNRDEPASL ATIIYTLDYL GQGRSWPVDS NDGQSEAELG LRYCGQTWVD QIVAFLETIV
LPARESCFSS TRHYTAPPER VHLVGNSVGG HLAVFVAALR PDLVASVTLL NATPVWGLNL
PGWTGHLPAP FLPKTIGRFL FDQIRNLNTI EQYLAAAYVH REAFDATLMQ QIRACTESQG
GHAAFASILW SPPVTLPTKP NDAPSNTKND YKKINAFDEA LSRLECDVLL CFGADDPWCK
PAFAARMLRA LGQRPTGKVQ RYVELSSVGH CPNHEAPNAV AYVLLPWLLS SNAQRQQIAL
VPAPLSEDKR TSVRETWGVT ELTERQADDI SLSLVDRLAV LFV