Gene PHATRDRAFT_46971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46971 
Symbol 
ID7202079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp44406 
End bp46496 
Gene Length2091 bp 
Protein Length696 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181293 
Protein GI219121896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCT CTTCTCTCTG GAAAGTCTTG GACAAGGCCG GTTGCGCCAC CCCAGTGGGT 
ATAGCAGAGT TTACTGTTTA TGACCCACAG CACTACGCTC GTAGTCAAGA AGCACCCAAT
GAGGCGAGTC CGTGGAATCA CAACCTGGGC GTTCGTCCAT CTGCAATACG AGAGCAACAC
AAGGCAACCA CATTGGCGGT GGATTTGTCA ATTTGGATTT GTGAGTCCCT TACGTCCCGT
GCGATGACGG AAAATCACGC CAATCCGGCT CTTCATCTGG TCTTTTCACG AACAATGAAA
TTGCTTTCCC TGGGAATAAA GCTTATTTTC GTGCTTGAAG GCAAGCGACG CGTGCAAACG
GCGGGAAAGC GAGACAATTT TCGTAATCGG CGCAGTGGCA CTACCTTTTG GAAAGCAGGT
GAGCAATGTC ATGACTTGTT GACACGGCTT GGCATTCCAG TTTTCCGGGC CAAAGCCGAA
GGTGAAGCTC TTTGTGCTTT ACTTTCACAG CGCAATATTG TTGATGGTGT AATTTCCAAT
GATGGGGATT GTCTACTATT CGGAGCCCGA GTTGTATACA CGAAATTCTC GGTCGAAAAC
CTTGTTGAAG GTAGTGTTAT GCGGTATGAC CTCGGCAATC TTCGAGCCCT GATTGACCAC
GCCGGCGACA AAGAAGCCTC AGACCAGCTT ACTGGATCGC TTTCTCTCAG TCGTTTTGAC
CTCCTCTCTT TTGCATTGCT CACCGGCAGC GACTTAGCAG GGAACGGACT ACCGAAGGTT
GGGCACAAAA AGGCCATTCG TTTCATTCGA AAGTGCCAAA TCGACAACCC CCTTACGACT
GAGATGGCCT CCATTGATGA GGTGAAATCA TGGGCAGTTG CCGCTCATGT TCGACCAACC
AACCTGCCAC ATCAAACGAA AGCGAACGAA AAATGCTGTA GTCGCTGCTG TCATATCGGA
ACCAAGCACA GCCACGAGAA ACTAGGGTGT GAAGCTTGTG GTACTGCTCC TGGAGAACCA
TGCTATGCAT TCTCTACGGA AGATCGCTTC CGAAAGTCTC TCCGCGAAAA AGCTCTAAAA
GTCATTCCGA TTTTTGAGCC TTCCCAGGTT GTTGAGGCCT ATCTGCGACC TAACGACAAT
CAGCTCCCCG CTCAGCTGGC TGGCAAGACC TCAAATCAAA TAAAGATGGA CCCACCGGAC
CTCAACGCTC TCTTGCAACT CCCTCTAATA CTCAAGGGTC GTAGTCAGGA AGAGAGTCGC
GAGTACTTGG TTCGTTCAGT TGGCCGCCTC TTATCTCGTG CTGAACTATT TCGAAAGCAC
GAGGAAAGCG ATGAAAAAGA ACTGATGACT TCTTATCGCC GAGCGCGAGA AAGACCCATT
CCCCACAAAA TTACGAACTC AATGACGCAG AACGGAGTGC CTTCTTACGA GGTAAAATGG
TTGGTTAACG CTACCGTGAC TGATAATTTC GGTGAAGGAA TCGATGGCTA CGAATATTCC
ACGGTTGAAC CTTGTGACCT GATAGCGAAC CGTTACCCCG TCCTAATAAA GGAGTTTCAG
CAAGCGGAGA AAGAGCGTAT GAAGCAAGGA GACGGAGAAA AGCTTCGTCG TTTAGACTTT
CTACAATCGC ACCTGTTCCT TCACAGAGAC CCGCAACGTC CTTCAGAAAC CGGCGACGCA
AAGAAGGTGA AACGATCGAA GCACCGAGCA GGATTCTTTG AGACCAAAGA AGCAAGCCTT
CCAAAACTCT ACGCATCAAG CAGACGAAAT AATCGATCCA AGCGCAAGAC GGGAGATGAC
GTTGGTCATT TGTTACGTTA TATTGTCCGT TCTAGCGACA TCACACAAGA TCCATTGGGA
TTCAGTCCGA TTGACAACTC CAAATATGGA TTACTTCCTG GAACGAGCTT GCGCGGTCAC
TGTACTCCAT CGCCGCGAAA GAACGTATAC AAATCAAAAG CTTCATTGCA CTCACCGGAA
GGAAGGGTGT TTTGCAATAT GGGTGGTGTA TTCATTGAGA TTACGCCGAT TATATCCAAC
CGTCGTACCT TCCCACCCCG TCACATATTC ATTCGCCGGA GTTCAGCCTA G
 
Protein sequence
MTVSSLWKVL DKAGCATPVG IAEFTVYDPQ HYARSQEAPN EASPWNHNLG VRPSAIREQH 
KATTLAVDLS IWICESLTSR AMTENHANPA LHLVFSRTMK LLSLGIKLIF VLEGKRRVQT
AGKRDNFRNR RSGTTFWKAG EQCHDLLTRL GIPVFRAKAE GEALCALLSQ RNIVDGVISN
DGDCLLFGAR VVYTKFSVEN LVEGSVMRYD LGNLRALIDH AGDKEASDQL TGSLSLSRFD
LLSFALLTGS DLAGNGLPKV GHKKAIRFIR KCQIDNPLTT EMASIDEVKS WAVAAHVRPT
NLPHQTKANE KCCSRCCHIG TKHSHEKLGC EACGTAPGEP CYAFSTEDRF RKSLREKALK
VIPIFEPSQV VEAYLRPNDN QLPAQLAGKT SNQIKMDPPD LNALLQLPLI LKGRSQEESR
EYLVRSVGRL LSRAELFRKH EESDEKELMT SYRRARERPI PHKITNSMTQ NGVPSYEVKW
LVNATVTDNF GEGIDGYEYS TVEPCDLIAN RYPVLIKEFQ QAEKERMKQG DGEKLRRLDF
LQSHLFLHRD PQRPSETGDA KKVKRSKHRA GFFETKEASL PKLYASSRRN NRSKRKTGDD
VGHLLRYIVR SSDITQDPLG FSPIDNSKYG LLPGTSLRGH CTPSPRKNVY KSKASLHSPE
GRVFCNMGGV FIEITPIISN RRTFPPRHIF IRRSSA