Gene PHATRDRAFT_47512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47512 
Symbol 
ID7202288 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp854246 
End bp856632 
Gene Length2387 bp 
Protein Length733 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181818 
Protein GI219122991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGG TCGATCCCGG GGAACTAGCC TGGCGTCGAC GTTGGGCCAA AGCTTTCCCC 
ACACTTGTGC CGCACTTGAT GGATGTCGGG CCGTCGCCAG CGACCTTGCC TGTACCGATT
ACGAAATTGC CGTATCCGGA AGAATTTACA GCTACAACCG ACAATATGGA TCATTTGTGG
TGGCAGCGTC TGGCTTGGCG AACCTTTTGG GACGAGTACA CGGACGACCG ATCGCTCCCG
GACGTGGCGT CCGACGATTG TATCCCCAAT ATTAACAGCA ACAGCAACAA CACTGTAAGC
AACGCCAACA TCAACAACAT CACAGGCCTT CCTCCCGAAG ATCCCCCTTC CGTACTCCGC
AACATTGCGA CCTGGAATCA GAGTTTGGGT TACGGAGAAA TCACCGCCGA AGCGACCTGG
ACGTTGGTGC GTACCATCCA ACCGTACCTG CCATCATCCC AATCTCTGAC AGCCGTGGAT
TTGGGTAGTG GGAACGGTAA GGTCTTACTG GCGGCAGCCT TTGCCTATCC CTTTCGTAAA
CTCGTAGGCC TGGAACTCCT GCCCGGTCTC CACGAAGACG CCCTGCGGTA CCAAGCCTAC
TGGCAACAAC GGTGGAAAGG TACAGCCGCA TCCAATACAA CAATCCCGAC AACACTAGGA
GCAGCAGCAA TATCGACAAC CACCACAGTA TCGAATCTCG ATCCGTCCAC CCTAGCCACA
AAATCCAGCG AGACCGGCGC TCTTTCTCCG GCCGTGTTGG AATTCTACTG CGACGACTTT
ACCAATAGTC CAAACGTCGA TTGGATCCAT CAAGCCGACG TGGTCTTTTG CCACGCCACC
GTGTTTCAAA CAAAACTCTT GGAGCGTCTG CAAGTCTGTT GCGAACAAAC TCGTGAAGGG
ACCTTGTTTT GTATGGTCAC CAAACCCTTG CAGTGCAACG CACGCATCGA AACGTTGGCC
GAAATACGAC TCGATATGAG TTGGGGACGA GCGGCCGTCT ACGTGCAACG CCGTAGAATC
CATCATCGAC TACCACCAAC CGAGACAGCG ACAGATTTGT CTCTTAATGT CACCACGACG
ACTGTGTCAA CCACGTCTCC GAACGATCAA TCGGCCCCAC AGGCCACTGA ACTAAGAGCC
AACGCCGATT CCTCCGACCG TGGCACTAAT TGTACCAGAC ACAGCATGGC TTCCCAAAAA
CGACCAATGT AAGTATCGTC TCGTGCATTT ATTGGAACTT TTGTTGTACA CGAGAGACAT
CCTTTCTCAA ACGGATTCGT TCGGCGATCC AATGTAGGCC TACAAGTTCT ACCGAGACGG
CTGTAGCACA AAAGAAGCGT CTAGTGGCCG ATTCCGAAGC AACGCTTGCA GAATCTGCCC
TGTCGTTGAA CGAGGTACCG GAAACAATGG AGTCACCGCC TCATTTGGTA AGCTTCGAAG
CACCTTGTCC AAGAGGCGAG CCGAGAGAGG AAAAAGTGTG TTCGACAGTC GCGAGACCAA
TCGGAGATGA GTACCGAATT TTGCATCTGG ACGATTGCCC CGATCTTATA CACCAAGTTG
GCGGTGCCGT CCTTACGTGC CCACCAGATC TCGAATCCAT GCTGTGGGAG ATGGGCGACA
GCGTTCCGGA TCCTACCAAA CTGACGGGAC AGCAACAGTA CAACCGAGTA TCGGGCAAAT
CTTCCATGAG TGTACGGTAC CGTATGTACG ATGGCAGCAA TTTGCAGCAA TCGTTCGCCG
CCAGTAGCAA TACCCGAACG GAAAAACAGT TCCTCGCCCG CTGCGGCCCA TCGCTCGATC
TATTTGATGC GGTCCTGCGG CAATTTGTGC TGGAGCAAAA ATTTTGCAAG TCCTGGCACG
AGTCATCAAC CATGAAGTAT CGCTTTTCGG TAATGTTCAC CGACAGTCAG GCCGTTCCAC
AAAACGCTCA CATTGATTAC CAATGGGATG ATTTGGACGG ATCAGATCCC ATGCCGTATC
TCGGATTTCT ACCGTTGACG AAAGCGGGTA TGTTTTTACA ACTGTGGACG GGTAATCCGG
ATGACGGTGT GATCAAAATG GGGAACATCG TGTTTGTACC GTGGGGAAAG TTGCTCTTGG
TTCCGGGCAA CACGGTCCAC GGCGGGGGCT TCCGTACGGG TAATCACGGC AATTTACGGG
CGCACTTTTA CATTCACTTT GGCGTCCTCA AGGTCAACGC CAACAATCAC TACAAGAATC
GGTACGGGTA CGATCTTTCC TTGACGCATC TGCACAATCC ATCCAACGAT TTTAGCAAGT
TTTGGAACTA GATGAATGGA ATTGCTGTCG CAATCGCTAC GCAAACAGAT TGAATAGATT
TTCCCTGGGG AGAAATTTGT AAAATATAAG CTTATGCTCA TTTTTTG
 
Protein sequence
MATVDPGELA WRRRWAKAFP TLVPHLMDVG PSPATLPVPI TKLPYPEEFT ATTDNMDHLW 
WQRLAWRTFW DEYTDDRSLP DVASDDCIPN INSNSNNTVS NANINNITGL PPEDPPSVLR
NIATWNQSLG YGEITAEATW TLVRTIQPYL PSSQSLTAVD LGSGNGKVLL AAAFAYPFRK
LVGLELLPGL HEDALRYQAY WQQRWKGTAA SNTTIPTTLG AAAISTTTTV SNLDPSTLAT
KSSETGALSP AVLEFYCDDF TNSPNVDWIH QADVVFCHAT VFQTKLLERL QVCCEQTREG
TLFCMVTKPL QCNARIETLA EIRLDMSWGR AAVYVQRRRI HHRLPPTETA TDLSLNVTTT
TVSTTSPNDQ SAPQATELRA NADSSDRGTN CTRHSMASQK RPMPTSSTET AVAQKKRLVA
DSEATLAESA LSLNEVPETM ESPPHLVSFE APCPRGEPRE EKVCSTVARP IGDEYRILHL
DDCPDLIHQV GGAVLTCPPD LESMLWEMGD SVPDPTKLTG QQQYNRVSGK SSMSVRYRMY
DGSNLQQSFA ASSNTRTEKQ FLARCGPSLD LFDAVLRQFV LEQKFCKSWH ESSTMKYRFS
VMFTDSQAVP QNAHIDYQWD DLDGSDPMPY LGFLPLTKAG MFLQLWTGNP DDGVIKMGNI
VFVPWGKLLL VPGNTVHGGG FRTGNHGNLR AHFYIHFGVL KVNANNHYKN RYGYDLSLTH
LHNPSNDFSK FWN