Gene PHATRDRAFT_48186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48186 
Symbol 
ID7203319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp445998 
End bp448976 
Gene Length2979 bp 
Protein Length992 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182536 
Protein GI219124492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCCA TTGTGTTGCT ATCTGCCACT GCCCTGATGG GCTGTGACGC CTTCCAGGCT 
TGGAGCATTA CCGCTGCACA GAATTATCTT GCTCGCTCTT TCTTCGATGC GCATCGGATC
GATACTGAGT TTCGAACAAG ATATCCAAGA CACAACGTTT TGTCCCGCTT ATCCGGGTCC
ACTTCTGGCA TTGACCCTGC TCCCCAGGTC TTTGCTTCTG GCTTTTCCAC GAAAACCGAT
CTCGTCGAAG CTTTACAGGA GGCAGTGGAA ATGGCTGTTA GAGCTTTGCC TCCGGCAGCA
GCAGAAAACC CGATCATCGA TTTGTGTACA GTATCGGTGT CGTCTTTATA TGACGGAGGT
TCAAGCCCAC CGACAACGGT GGTAATCCCG ACAATTGTAG AAACTGCCCG GTCAAAGTAC
GGGATCATTC AACACTTGAT TGGTAGTTCA GTGGCGGGGT GCATTGCGAG CGTCGCAACG
ACTGAAGTTG ATAATGCTTT GACTGCTTGT CAACCTGTGG AGCTCGATGG AACTCCAGCT
GTTTCAATTT CTCTGGCGAT CCTTCCAGAT GTACAGCTAC GCACCTTTTT TTGCCAATCG
GCTTATGTTC CTGACGATAT TGGACGCATC AGTCCTGCAG AATGGAAGCG GGCCGTTGGA
CTCAGTGGCT TTTTGGAATC AAACAAGAAA GATGGTTCTG ATACAGAACT TAGTCAGCAG
GACTCGGTTG TCATGCTTTT ACCGAGTCCG GCCTTCTCGA CAGAGCTCGA TGACTTCTTG
CTCGGTCTCT CCCTCTACTT ACCCCGAGCC CAAACTTTCG GCGGCATAGC AAGTACGGTC
TCATCTCTCT CGCGCGCCAA GCTATACCGA TATTCAGCTG CCAGTCACTT ATCCGAGTGT
TTGTCCGATG GTTGCGTTGG TGTTGCAATG ACAGGGGATA TCCAGATTCA AAGCATGGCT
GCGCACGGCG CCAAACCTGT CGGTGGTATT TATCAAATAC TCAAAGGCCA AGATTCAACA
ATCGGCGTTA TTGTCCTGGA CGAGACTGCA ACTCAGGCCC TGAAGGACGA AGAAGACAAC
GTCGATAACG ATAGCGACAA TGATTCAGAA GAGTCAGAGC CATTGGACAA GAAAGCTGCT
TTAGCACAAG CGTACGCCAA GGCCCAGATT CCGAAACCTG TTTTAGCCGA AGCCAATTTT
TTGATGAGAA CACTGTCAGA TGAAGATCAA GCTTTTATGC GTCGACAGCT TTTAATTGGG
ATAGATAAGG GTGGTAGTAT CGGTAGGTCT GCGAGCGAAC TTGCGAGACT ATCGGAAGGC
GAAGGACACA GGTTCACCGT GCATAAAGTA GCCACTGCGG GCATGAAGGA TGGAAGTGTC
ACGTTCTCGT TGGGAAGTAT TGATGTTAAG ACTGGTACGC GTATGAGATT CTTTGTGCGC
GATTCGGAAT TTGCCAAGAA GGAAGTCGAA GCTTTATGGT TCGGATACAA GAAGCGATTG
TTAAACCAGC AGTTTGGGAA AGGCGAGCAT ACGACAGATT CCACTTTCAC ACGGTCGGGA
TGCTTTGTAA TTCCAACTCT TGATCGAGGG AACAAGTTCT TTCAAGGCAA ACCTGGGTAT
GAAAGTGGAA CCGTTGCTCG CATTCTACCT ACCCTTCCTA CAATAAGTGG ATTTTTTTCG
AACGGCATCA TTGGAATTAC GGAGGGCGAC GGTGATACTA GCACGGGTGT GCAGGGCAGC
GCGACAGGTT ACACATTGAT TGGCAGTAAA ACGGATCGAC CAATATTTTC ACCAGCAGCT
GCAGCCGCTG CTCATACTGC AGCACAAGAA GAGAAGGAGG CCCAGGAAGC GGAAGCTGAA
GCTCAGGCTC TTGTTGCGGA AGCTAACAGT AAGGTCGGGG AAAGTTATAC CCAAGGAAGC
AATGGCGTCG TGAAGACAGC ACCTCGTTCC GAAGATGGAG AACTCATAAT CAAACGCCGA
GAGGTTCACT CCGGGCGAGC CATGACAGTT TCAGCCGTCG AATGGAGTGT AGCGGAAAAG
GCGGCAATTC CAACGAGCAC ACTGGAAGGA TTTATGTGGG ACAAAGAAAC AGAAGTTGAC
CGCTTCCGAG AGCGGGTACC TCTGGTCAAT CTGGTATCTC AGTGCAGATT ATCACAAATG
GATCCAAAAG CTCCTAAACC TCGAGGCTTC GTTGTACCAA TCCAGCAAAT GGTTTCGGAA
GGAAAATTTG TTGTTATTCC AGAATGCAAG CGAATGGAAC CCACGATCGG GAGTTTGCGA
CGTCGCTATG ATTTGAGTAA GCTTGCTCGC GATTTCACTT TTGACGGCGC TGTAGCCATT
AGTGTGAATT GCGATGCAGT CCTCTTTGGC GGGTCTCTGG GCGACGTCAC TGCAGCACGT
GAAGCTGCTG GTAGCGCCGT GATTGATAGC ATATCAGAGG AGGGAGTCGT CGTCCCCCCA
ATTCTCGCAT CCGACTTGAT CCTTTATCCG TATCAGCTTT ATAAGCTGCG TTTGGCTGGC
GCCGATGCAA TTAACCTCTT GGTTGGAGCT CTAGAGAAGA AAGATCTGTC ATACCTTACC
AAGATAGCGT CTAGTCTTCA GCTTCAGTCA TTTGCCACCG TAACTTCCGA AGTGCAATTA
CTGGAAGTGG CAAGTCTGCA GGAAGGGACC ATTGACGGAA TAATCGTATC CAATCGTGAG
CTTGAAGACT TTTCCTTCGA TATGACTGGG GAGCAAGCAT TGTACCTGTT GAAAAGCAAT
GCTCTGGCGA AAGTCCGCGC AAAACATGGT GAAGACCTTC TTATCTTGGC TGAAGGAAGA
GTCGGTATAA TCGATCGTCC TCAGGCAGAC AGCACAAGAA GTGCTAAGCT TTATATTACC
GAATTAAGGG AAGCTGGCGC AGTGGGTGCG ATAATGGGTG GTGCATTGGC AGTGGACGGA
GGGGGGTATC AGCAGGTAGC GAAAATGGCG CAACTGTAG
 
Protein sequence
MKSIVLLSAT ALMGCDAFQA WSITAAQNYL ARSFFDAHRI DTEFRTRYPR HNVLSRLSGS 
TSGIDPAPQV FASGFSTKTD LVEALQEAVE MAVRALPPAA AENPIIDLCT VSVSSLYDGG
SSPPTTVVIP TIVETARSKY GIIQHLIGSS VAGCIASVAT TEVDNALTAC QPVELDGTPA
VSISLAILPD VQLRTFFCQS AYVPDDIGRI SPAEWKRAVG LSGFLESNKK DGSDTELSQQ
DSVVMLLPSP AFSTELDDFL LGLSLYLPRA QTFGGIASTV SSLSRAKLYR YSAASHLSEC
LSDGCVGVAM TGDIQIQSMA AHGAKPVGGI YQILKGQDST IGVIVLDETA TQALKDEEDN
VDNDSDNDSE ESEPLDKKAA LAQAYAKAQI PKPVLAEANF LMRTLSDEDQ AFMRRQLLIG
IDKGGSIGRS ASELARLSEG EGHRFTVHKV ATAGMKDGSV TFSLGSIDVK TGTRMRFFVR
DSEFAKKEVE ALWFGYKKRL LNQQFGKGEH TTDSTFTRSG CFVIPTLDRG NKFFQGKPGY
ESGTVARILP TLPTISGFFS NGIIGITEGD GDTSTGVQGS ATGYTLIGSK TDRPIFSPAA
AAAAHTAAQE EKEAQEAEAE AQALVAEANS KVGESYTQGS NGVVKTAPRS EDGELIIKRR
EVHSGRAMTV SAVEWSVAEK AAIPTSTLEG FMWDKETEVD RFRERVPLVN LVSQCRLSQM
DPKAPKPRGF VVPIQQMVSE GKFVVIPECK RMEPTIGSLR RRYDLSKLAR DFTFDGAVAI
SVNCDAVLFG GSLGDVTAAR EAAGSAVIDS ISEEGVVVPP ILASDLILYP YQLYKLRLAG
ADAINLLVGA LEKKDLSYLT KIASSLQLQS FATVTSEVQL LEVASLQEGT IDGIIVSNRE
LEDFSFDMTG EQALYLLKSN ALAKVRAKHG EDLLILAEGR VGIIDRPQAD STRSAKLYIT
ELREAGAVGA IMGGALAVDG GGYQQVAKMA QL