Gene PHATRDRAFT_49960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49960 
Symbol 
ID7198650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp435324 
End bp436999 
Gene Length1676 bp 
Protein Length398 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184704 
Protein GI219129035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATTTGCCGG CATTTACAGT TATCCATGTG CTTCAATTCT TATGTTGTCT ACGAATCAAT 
TTACCTTCCC GTATGACAAC GACACATGGC TTCTCAACGA CATCTCCGCA CTAATTGCTG
GTTCGCAGTA AGCGCGGTGA AAAACGTCTG AGCGAGCTAT CTCACGTGTT GATTCTAGAG
TAGAGAGCAG AAAATGCCAT AGAGCTTGAC TAGGTAATGG CGCCCGCATG GTTTCGCTCC
CACGGTTCTT CATGTACGAA TACTGAAAAG GAGCGAAACT GCTCTGAAGC TAGAGAGCGC
CTGACCCGTC ATACGTTTCG AACGGTAGTA TATCGTCGGA CTGGACCCTC TCCTCTGACA
CCGACGCCTA ACGCGTAGTA CACCAGGGCA TTGTCTCTCG TCTCGTGCCT CCTGGAGTTG
GTATACTACA GCATAGTACA CCAGCATCAT GTCCAACACA GCACGTACAC AGCGGAAAGG
TACTTTCGGT AACGGCTCCA AACGAAGGGC ATCATGGACC GTAACACCGC TTGTCGTGGG
AATCGTCGTC ATGATAGGTT TCGGCCTCTT CAACTACGAC GCAGCATTCG ATGCGAGCCG
TGGAGTTGAA AACACACTGC TGCATCTGGC CAGGGTAGAC ACGGCTAGCG CAATCGCCTC
CTTCACGTCC ACCTCGTCCT CTCCTTACCC TGGCTGGGAT CCAGCCAATC CCCTCGGGAT
TCCACAGGGC CAGGCACCCA ACCTACCGTC AATACGTGAC TCCACGTCCA ACGCTAAGCG
TGCCAATTAC GGCGGCCAAG GTGACAAGCC CCACCTCGGC GGCTTTACCG AATTCGACAT
CCAAGGCCTC TCTCCTCATG TATGGAAGCA CATGATTCAA GACTATGGCG TACACTCAGT
CCTCGACGTT GGTTGTGGAC GCGGAATAAG CACGTCGTGG TTCCACATGC ACGGCGTGCA
GGTTCTGTGT GTTGAGGGAT CCTACGATGC CGTACAAAAT TCTGTCCTAC CCGATCCAGC
GAATCAGGTC GTCGAACACG ATTTTTCCCG CGGTCCCTGG TGGCCCCGCG ACACCTTCGA
CGCTGTTTGG GCGGTCGAAT TTCTCGAACA CGTCAACGTA CAATACCACT TCAACTACAT
CGCTGCCATG CGCAAGGCCG CTCTCTTGTT CGTTTCCAGT TCCCGTTGGG GCGGCTGGCA
TCACGTCGAA GTGCATCAAG ACGACTGGTG GATTCGCAAG TACGAACTCT ACGGCTTCCG
CTACGACGAC ACCTTGACCC AACAGGTCCG TGAGTGGGCT ACCGAGGAAA GTCGGGACGT
CAACGCCACG ACGGCGCCCA ATGGAAAACC CTGGAACGCC CAACACGTAT GGTTGTCCAT
GAAGGTATTT GTCAATCCTG CTGTTGCCGC CCTACCCGCC CACGCCCATT TGTTTCCCGA
AAACGGCTGC TTCCAGGGAC GAGGGGAGGG TGGCCGCATT CATAATCGGG AGTGCGGCAC
CGGCAAGGAC GCTGCCCTGG AAACCAAACT CGATCCGGCA ATGTATCCGC TCACCCTGGA
TCCCAGCATG GATACGGCGT GGACGACACA TATTCAAAGA CACCTCGCAA AGTCCCAGAG
TATTCAATCG TCGCAAGCGC AATAGTCGCT AACTCAATAA CGATCGAACC GATTTG
 
Protein sequence
MSNTARTQRK GTFGNGSKRR ASWTVTPLVV GIVVMIGFGL FNYDAAFDAS RGVENTLLHL 
ARVDTASAIA SFTSTSSSPY PGWDPANPLG IPQGQAPNLP SIRDSTSNAK RANYGGQGDK
PHLGGFTEFD IQGLSPHVWK HMIQDYGVHS VLDVGCGRGI STSWFHMHGV QVLCVEGSYD
AVQNSVLPDP ANQVVEHDFS RGPWWPRDTF DAVWAVEFLE HVNVQYHFNY IAAMRKAALL
FVSSSRWGGW HHVEVHQDDW WIRKYELYGF RYDDTLTQQV REWATEESRD VNATTAPNGK
PWNAQHVWLS MKVFVNPAVA ALPAHAHLFP ENGCFQGRGE GGRIHNRECG TGKDAALETK
LDPAMYPLTL DPSMDTAWTT HIQRHLAKSQ SIQSSQAQ