Gene PHATRDRAFT_42902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42902 
Symbol 
ID7196474 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1469998 
End bp1474979 
Gene Length4982 bp 
Protein Length1343 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177298 
Protein GI219111093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAA AAGGATCAAA TGATCCGCCT CTCGAAAAGA AACGGCGACT CTCTGTGAGA 
GATCTTCCCC AATTTTGCGA CACAGCCGGG ACCGTATCGG TCGCCAGTTT TGGAGCCCGT
CGTTTGCCCG AGCTCCGCCA TCTGTGGGAG AAAACCTTTC GGCGAGCTTC TACTCCCGAT
CCCAGTAATG TGGAATCATT GAAAAGTACA GGTTGTAAGA CTTCGCGTCG TCATTTGCGA
CGAAGGGCCA CGAGCCATTT GGGCCGCAAG CATCATCGTT TTCCAGCTGG ACAGCAAGAC
GCCAACGCGC AGGATAAGAA GTCGGAACAT TCGATTATCG AAGAGCCAAA GCTAACTCGC
CGTAAACGAC GGCGCAACAC GGAACATTTG CACTCCTCTC ACGAAGAATG GAAGGTTCCT
AAGGGAGCAT TGGGTATACC ATCGACAGAG GACAAGGAAT CGAATGGCAA CAATCATCGT
CCTGATAACT GGATGCCAAC CCATCTTTGG CACGTGAAGC GATTTCACAT GGAATCTATG
TGGGGCTGGA AAGTGCCCAT GCTGCATAGT AACCGAGGCG CGAAGGCAGC CTTGCGGCTA
GCTCGCGAGG GGAAAACGAT CCTGCATGAC GCCACTTGGA CGTCTCAACC AGTTTGGCTG
CGTATCGCGA CAGAGGTTGT CCCCGATATG ATAGACCAAA TACGGATCAT AATTCCTGAC
TTTGCTTTGG ACGAGACAAA ATTGCTTGAA GCGTTTGCAT ATCAACCAAG TGCGTTTCCT
AAAGGAGGCA TCGGCCCGGT TGCTTGGCTT TGCTCCACCT CACCTCTTTT TGGTAGTGCA
AAAGACGACA CACAGAGCTG CTATATCTAC TTTTTTTTAC ACCCTTCCAT TCAGCAAACT
CTGCTACACA TTTTGCGACA AATACTGGAG CCTTCGACTT TGACTGGGCC ACTCTGCTTT
GTTCACGGGG GTCTATCGTG GTTAAGACTG CGTGGGAAGG CTGCCCTCGA AAGTCTCCTA
ATGGCGATGC TTTCTATAGC AAAAGACGAA AAAAGAAGTA CGGACATTGG CGCCATCATT
GCTGATTTAG AAACGGGTAT TGATTGCCAA CTCTCGCACG TCCAGTGCAG AGCAGCTACT
AATCACGAAA CCCCTAACAG CCAAGAAGTC AAGCTCATCT TACGCCGTCC GGTACAAGTT
GACTTCCATG GAAATTTTGG ATCTTGCGGT GTGGATGTTG TGTGCGAGCC TAGTTTGGGT
CAGAAATTGC TGGTCGCTTT AGTTTTACAT GGTATGGCCT GTCCGATTGG TGTAACGGAG
CTAGCGCACT TGGAGCTGGA ATGCGAACCT CCACGGCCGT TGTTTCCCCG GGACTATCCT
GACACCTCAG TCGGGATGTC ATACTGGAAA GCAGCCCTGC CAGAATGGAA AATTTTACGG
GCTTACTACG AGGGTGGCTT GGGAAGAATC CGACCAGATC GTACGTCTCA GCAGGTGTTG
ATCTCGTGGA ATACGATACT ACCTACCGAC TTTCAATTGT TAGTCTCGGA GGATGAAACC
ACCGTTGTCA CAGTGCGAGG TTCCTTTGGT CAACCCTTTC AGCAAGCCAT TAATGGAATA
GGGTATATCC ATAGGAACGG GGACAAGGCG TCAAATAGTG GAGCAGCTGC AGAAAGTCTT
GTCATTCGGC GGAGCCGTCG CAAGTCCGGC AATCCATCGT TACCCGTTCA AGCGCCACCC
ATTTCTCGGG ATCAAGTATC GCAGCATAGA AACCTTTGCA AGACACTATT GCTGTCCTTG
TCGCTACCGG CTGTCCTTCT TGCTCATATT TGCTTGCACG AAAAGGGACA ATTGGACACG
GGTACACTTT TGTACGCCAC GGACGGTTCC GTCAATGAAC CATTGGGTGT CGTAACGGCG
GCCATCTTTT CGTCTGGTCG GGGGCGGTAC CACGGGATTG CGGTGGTTGG TGCGGCGCGC
CTTTTAGAAG CCTTGCAGCA CGCCCACGAC AGTTGCGCGG GTAGAATTGT AGCGCCGCGG
GTTGGGCCCA AACGTATTGA ATTGGAAATT GGCGTCGCTA GGTCAAACAC CTACAACGAA
AAGCATTCGA CAGCAAAGTT GACAATGTTT GGCACACTGT CGTTGCTGCT TTGATGGCAC
ATCCGGTTCG GTTGTCGAAG TTGGGGCGGT GGGCCGCGTC TGGTGCGTAC GCAAGACAAC
GCAGAGGTGG CGGTTAGATG GCCTAGTTTT TGCTTGCAAT GAGATTTGAA CTGAGGTCAA
AGGTCCTGTA AACGTTTGAC ACACCATAAC TTTGTTTTGC ATACGCCAGC CACTTCCTCA
AGAGGTAGGT GTAGTTTCTG TCCGCCATGG CAGAACTGGA TATTGACTCT CTAAAAAAGC
ATAATTGTAA ACAACGAAAG AGTTGAAGTA CGACAACGAT GACGTAGAAC TGACGTCAAC
AGGAGCTGTA CGTCAGTTCT ACTATAGCTA GCAGTAGCCA ACGAAAACGG CAAGAGATTT
TGTTGACAGT GACACGCAAA TTTTCTCCGG TCCGGACAAT ATCGTCCCCG CCCATCACTT
TGACAGTGAG CAGAATATCT ACACTGTTTA CTGGCACTGT TGAAGCTTGT GCTTACTCGG
TGGCTTGTTG GACTACGACG ATGATAAAGG CATCGCATTT CTACTCAATT GCGGCAGTCT
TCTTTGTTGG CGCACTGCGA CTGACATCCG CCTTCACTGT CGCCAACCCG ACGAGTGCGG
CGCTCACTCC GGACGCGGGC CTCCAGGATG TCAAGCCCAT CCCAATCACA GTTTTGGCCG
GTTTTCTGGG CTCGGGTAAA ACCACACTTC TTCAGAACTT GCTCGAGAAC AATGAGGGCT
TGCGTATCGC TGTCGTGGTC AACGACGTCG CATCGGTCAA TATCGACAGC AAACTGGTCG
CCAATCAGAA TTTGGCCTCG GGCATGGTGG AACTACAAAA CGGCTGTGCC TGCTGCTCAC
GATCGGAAGA GTTGCTTGCC AGTGTGCAAG AACTCGTCAC CTTGAGCGAC ACACGAGGAG
AAGGCGAAAG TTTCCACCAC ATCGTCGTGG AAATGAGCGG TGTGGGGGAT CCCCGCAGCG
TCCGGGCCAA ATTCCAGGAA GCAGTCCTGT ACGATATGCG TAAGTGAGGA TGACGAAACC
AGCAAGCAAA GATGGGGAAT GGGTTGTTTT GTGACCTGTC TCCGTATGGT TAAGATTTGT
AAAATACTCG GGCTTTCGGA AATGTTCGTG CTCTAATTGC AATTTCATTA TCTTTTCATA
GCTTTGATGC AACGTGCACA GCTCGATACC ATGGTCACGG TGGTAGATTG CAGTTCCTTT
TTGACCAACT TGAACTCGGA CAAGGTTGCC ACGCCAGAGG ACACTCCCGA GTTGTACTAT
CGCGACGAGG ATGAGGCCAA AGCTGATCGT AAGTGGATGG AAGATGACGA CTTACCTCCA
GGTTTGTTAG AGGCGATCGA GGCCGGAGAT CGGGCATCGG CTAACGCAGT TGCCGATTTG
CTCGTTTCGC AAACGGAAAT TGCTGATATT GTCCTCCTGA ACAAAGTCGA TCTCGTGGAT
GAGAGCAGTC GTGATATGAA ACAGATCGAA AACATTGTTA CAGCTTTAAA TCCTCGGGCG
ACTCTGCTCA AATCCGCGTT CGGAAAAGTT TCTCTACAAC AAATTTTGGG GGTGGCTCAG
GGTATGGGAG TAGCGGAAGC AGGTATTATC GATGACCATA AAGATGCGGT CAATGCCGCG
TTAGAAATGG CGCACGATCC TGATTGCGAA GATCCGAATT GTGTAGATCC CTCGCACTCC
GAGGTTGTAG CCGTAGTGGA CTGCGCCAAG CCCGACTGCA CCGACAGTAG TCATGAACAT
TCGCATACAC ACGCTTGTGA CGATCCGGCC TGTGACGATC CTGCTCATGC CAAAATGACC
GTATGTGGAG AACCTGGTTG CACCGACAGC CACGAACACT CGCACACTCA CGCTTGCGAT
GATCCAAGCT GTGACGACCC GGCCCATGCA AATACTGTCG CCGAAAGTGT CTGTAACGAT
CCTGGCTGCA CTGAAAGCCA CGAACATTCG CACGCACACG CGTGTGACGA CCCAAGTTGT
GATGACCCTT CGCATGGGGT TGATGTGGGC ACGCATGCCG GAATCGGAAC GTACGTTTAC
ACATCGCGCC GTCCCTTCCA CCCTACGCGT TTGCTTTCCT TTCTGCGAAA TCTTCCTGCA
ACGCGTGGTC TACCACCACT GGAAGCAGGC GAACCGGATT TGGCTGTTTC GGCCACCGCG
AAGTCGGCAA TGAAGAAAAT TCTTCGAAGT AAGGGGTTTG TTTGGTGTGC GGATTCCTTT
GAAGTTGCCC GCTACTGGAG TCACGCTGGT ATTTCGTTCG AATTAACCAA CTTGGGCAAA
TGGTGGGCAA CACTACCACG TGAGCAGTGG CCGCAGGAAG CTATTCGTGC CATTTTGGCC
GATTATGACG ATGCCAACCA CGACGACCAG AGCGCCAGTA CAGGAACTGT CGGTGACCGA
CGTCAGGAAG TTGTTTTGAT CGGACCCGGT ATGGGTGGAC CCACGGCACA AAAAGAAGTT
TCGTCAATTT TGGATAAGTG TCTTTTGCGT GATGATGAAT TAGATTTCTT CAACGAAAAG
AAATTGGACG AAGGGGCATT GCAGAAAGCC TTTCCCAATC CAATCCAAGC GGGCATCATG
ATATTTTAAC TCAGTGGGGA AGTGCACATA AGGCGTAGTA AGTTGGTAGT ATGTATCAGT
GGACAACAGC GATTTTTTGT GAGGTTATGG GGGCCGTGCC CTTGCTATTT GAAACAATAC
ATGGCTGCTT CTCGGAGTTC CTTAATACTA TCGTCTTCAT TCTCATTCAA AATTGTGAAC
TCGAAAACTC CACATGATAT TCCGTATTCG ATTTCTTGGA CGATATCCCG AATCCTTCCT
CG
 
Protein sequence
MNSKGSNDPP LEKKRRLSVR DLPQFCDTAG TVSVASFGAR RLPELRHLWE KTFRRASTPD 
PSNVESLKST GCKTSRRHLR RRATSHLGRK HHRFPAGQQD ANAQDKKSEH SIIEEPKLTR
RKRRRNTEHL HSSHEEWKVP KGALGIPSTE DKESNGNNHR PDNWMPTHLW HVKRFHMESM
WGWKVPMLHS NRGAKAALRL AREGKTILHD ATWTSQPVWL RIATEVVPDM IDQIRIIIPD
FALDETKLLE AFAYQPSAFP KGGIGPVAWL CSTSPLFGSA KDDTQSCYIY FFLHPSIQQT
LLHILRQILE PSTLTGPLCF VHGGLSWLRL RGKAALESLL MAMLSIAKDE KRSTDIGAII
ADLETGIDCQ LSHVQCRAAT NHETPNSQEV KLILRRPVQV DFHGNFGSCG VDVVCEPSLG
QKLLVALVLH GMACPIGVTE LAHLELECEP PRPLFPRDYP DTSVGMSYWK AALPEWKILR
AYYEGGLGRI RPDLSEDETT VVTVRGSFGQ PFQQAINGIG YIHRNGDKAS NSGAAAESLV
IRRSRRKSGN PSLPVQAPPI SRDQVSQHRN LCKTLLLSLS LPAVLLAHIC LHEKGQLDTG
TLLYATDGSV NEPLGVVTAA IFSSGRGRYH GIAVVGAARL LEALQHAHDS CAGRIVAPRV
GPKRIELEIG VASKVDNVWH TVVAALMAHP VRLSKLGRWA ASATSSRVFF VGALRLTSAF
TVANPTSAAL TPDAGLQDVK PIPITVLAGF LGSGKTTLLQ NLLENNEGLR IAVVVNDVAS
VNIDSKLVAN QNLASGMVEL QNGCACCSRS EELLASVQEL VTLSDTRGEG ESFHHIVVEM
SGVGDPRSVR AKFQEAVLYD MPLMQRAQLD TMVTVVDCSS FLTNLNSDKV ATPEDTPELY
YRDEDEAKAD RKWMEDDDLP PGLLEAIEAG DRASANAVAD LLVSQTEIAD IVLLNKVDLV
DESSRDMKQI ENIVTALNPR ATLLKSAFGK VSLQQILGVA QGMGVAEAGI IDDHKDAVNA
ALEMAHDPDC EDPNCVDPSH SEVVAVVDCA KPDCTDSSHE HSHTHACDDP ACDDPAHAKM
TVCGEPGCTD SHEHSHTHAC DDPSCDDPAH ANTVAESVCN DPGCTESHEH SHAHACDDPS
CDDPSHGVDV GTHAGIGTYV YTSRRPFHPT RLLSFLRNLP ATRGLPPLEA GEPDLAVSAT
AKSAMKKILR SKGFVWCADS FEVARYWSHA GISFELTNLG KWWATLPREQ WPQEAIRAIL
ADYDDANHDD QSASTGTVGD RRQEVVLIGP GMGGPTAQKE VSSILDKCLL RDDELDFFNE
KKLDEGALQK AFPNPIQAGI MIF