Gene PHATRDRAFT_20424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20424 
Symbol 
ID7201414 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp102984 
End bp106378 
Gene Length3395 bp 
Protein Length704 aa 
Translation table 
GC content48% 
IMG OID 
Producturea transporter 
Protein accessionXP_002180571 
Protein GI219119631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.333459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCATTAC AACAGAGTGC CGATATCCAA TTGAGAATTC GAGCCGCACG ATGAGCACGA 
ACACCGTCAG CGAGATCACC TATGATCTCC CTCCCTTTGA CAAGTATCGG GGACAAGACT
CCTTCTTCGG AGGCGAGCCT CCGCTTTCGG AGGGTGTCGG ATACCTTGTT GTCCTCGGGT
TTGGATTTCT CTTCTCGATC ATCACTACGA TCCTTGTTTA CCTGAACAAA TATTTTGGTG
CCCGGGGTGA AGTCACTTCT GAGCATTTCA AGTAAGTTGT CTGCATATTC CTAATTGTTG
TCACAATCGT CGCAATCCCG CTCTGGTCTC TGTTCCTCCG TTATCGGATT TCCAAAGGTG
GTACTTAATC GTCAATTTCG CTTTCTAACG CCCTTCGTTC TGTCATTATA CAGCACTGCT
GGTCGCATGA TTAAGACCGG TCTCACGGCC TCGGTTATTG TTTCCCAGTG GACATGGGTA
AGTGATACCG AGCTGTGGAT CATGTGATTG TTGGTTTTGT TACTTCGTTC CATCCCCCCT
CACAAATGAA GCTCCCTTTT ACCCCTCTTT TAGGCTGCGA GTAAGTGATG AAATTTTTTT
TCATCATTCT AAAAGCTTTT GAATTCCGAT TTCTCAAATA GATTTTCTTT CATCACAGCA
CTACTCCAGT CCTCCAACGT TGCCTGGGCA TACGGTGTAT CGGGACCTTT CTGGTAAGTC
TGTGACCTTG TGACATTTTT GAGAGGTTCA TTCCCTTGAT GGCCGCCCTA CGACATGGAC
GACAACAATC ACCACAATCT GAGTGATGGA TTGGATAACT CGTCAAGAGG TATATCCTCA
TTTGTTACCA TGGTCGTGTT TCATACCTCA CTGAAAATGA TTCTAACGAC GGATCGGTGA
GCGAGAGCTC GGTATGACGT CTTATTATCT TCAAGCTGAT GTATGTTACA CAGTCCTTCC
ATATGTAACT GCTTTTTGAT TGGCTGAAAT TTTTAAGTGG CTTCCGGCAC TAAAATCTTT
CTCCATCAAG TTCCGTTTAG ATCTTCGGCA TTTTACGCTG TCTGTTTTCG CCCATCGGAA
ATGCCCTTGT TGCGACATCG ATCTCATGTC GCCCTTTTCT CATATTCTTC TTCGACTGCA
TCAAAATAGG TATGCGTCTG GTGCCACCAT TCAGGTTTTG CTCTTCGGAG TCTTGGCCAT
CAACCTCAAG AAGGTGGCTC CGTCTGCACA TACCGTCGCT GAAATTGTCA ACGCACGGTG
GGGAAAGACC GCCCATCTGA CGTTTCTCTT CTTCTGTTTC GCCGCCAATA TCATTGTGAC
TTCGATGCTC TTGCTCGGTG GAGCCGCCAC CGTCGAAGCT CTAACTGGTA TGGACTACCG
TCTTGCGTCT TTTTTGATTC CTTGGGGAGT CATTTTGTAC ACCGCTAGTG GAGGGCTCCA
GGCGACTTTC TTGGCGAGCT ACATCCACAC AGTGATCATC TACGCCGTGT TGATCACGAT
GATCTTCCTG GTGTACATCA AGTTCTACTC GAGTGACCAG ATCTACGATT TCCTTGACCA
GACCGTCTCT TTCACTACTG AAGAGTGTGA AGCTATTTTC TCTGACGATT CTGGAACTTT
CTTCGAGGCC GGGAAGTATG CGTGCGGTCC CGTTTCTGGC AACGAGAAGG GATCGTATTT
GACCATGATT TCTGGAGATG GTCTCATGTT TGGAATCATC AACATCGTCG GTAACTTTGG
TACCGTGTTT GTGGACCAAT CTTACTGGCA ATCTGCTATT GCTGCCAAGC CCAGCAGTGC
CGCCCGTGGT TACTTGCTTG GAGGGTACGT TAAAATGTTC ATTGACTTTC CATTCTAATG
TAGAATATTT CTTATCATGT TGGCTCTTTT TTACAGAGTG TGCTGGTTTG CCATTCCGTT
CTCTCTCGCC ACCTCTCTTG GCCTCGCCTC TACGGCTCTT ATGCTCCCAA TTACTTCTAC
TGAAGCCGGA AACGGTCTGG TCCCTCCTGC TGTCGCTTTC GAATTGCTTG GCGATGCTGG
TGCCATTCTC ATTCTAATTA TGCTCTTCAT GGCGATTGTG TCCACCGGAT CCGCCGAATC
AATCGCCGTT TCTTCCTTGG TCGCTTACGA CATCTATCGT CAGTACATCA ACCCCGAAGC
TACCGGAGAT CAGATTCTGT TCGTTTCCCG GGTCGTGATT GTTGTATTTG GACTTTTTAT
GGGATGCTTT GCCATTCTTT TGTTCGAAAT TGGACTGAGC TTGGGCTGGG TATACCTTTT
CATGGGAGTT GTCATTGGAT CGGCCGTCGT TCCCTTGTGG AACATGATGA CTTGGAAGAA
GGCGTCTGGC ACCGGTGCCG TCATTGCAGC ATGGACCGGT CTTGTGTTGG CCGTTACTGG
ATGGCTTGTT GCCGCTAAGG TCCAGAGTGA TACTATTTCT GTGGACGCCC TCGGAACTAA
CGAGGTCATG TTGAGTGGCA ACTTGATTGC CATTCTCTCC TCAGGTGCCA TTCACTATGT
CTACTCCATG TTCATCGATC CCCAGGACTA CGACTTTTCT GAACTCGACA AGCATATAAC
TCTCGTCGAA CAGGATACGC GTGGACTCAC GGATGAAGAG AAGGACCCTG TTGCGCTTCG
TCGTGCTGAA CGCTGGATTA CCCGCCGTGG ATATGCCTTG ACGTTGGTGC TTATATTTAT
TTGGCCCATC CTTTCTGTTC CTGCTGGTGT CTTCACCAAG AGCTACTTTG CCTTCTGGGT
TTTGGTGGCT ATCGCTTGGG GTTTCGGTGC CGCTTTGGTC ATCACGATTC TCCCTTTGAC
GGAGAGCGCC GAAGACATCA GCATGGTTCT TTCCGGAGTT TTCTACGCTG TTACAGGCCG
TGAACCCAGA CGTGCTGAAG ACCCGGCCGA GGCTGTAGCT GCGGAGAAAG AAATTTCGGA
AGAGATGGAC AAGGCTGATG CAGAAGTCGC TGCCGAGATG GAAGCCTAGG CGCTGTTTTC
TTACTTTTTG TAGCAAGGAA TCTTTCTCTC AACCTTGATT TGTGTTTTGT CTTTCACTTC
TGCAATTTAT GCTGATTTTG GTCTCAATTA TCTATTTTGC TCTACACACC TATTTTTTGC
CCGGGCGGCC GTAAAGATCA GTACTTTAGT AACAGTTGCA ATTTTAGCAA TTTGTGGTGT
GTATTTCTCA TCATAACCGT AGCCGCATCA GTACCACACT GATACTGATT CGAAGACTCC
CGAGGTTGCA TCAGTTTCAT CCTTGCGTTC ACTCGGAAGT GATGCAGCGG ATGTAAAAGA
CATCGCTGAG GTAAACAGTT CTTACGTTGC TGTTTGTACA GAGCAACGAC AATTCTTGAG
CTACACGTAG ATCACAGATA ACGAACATTA CTCGG
 
Protein sequence
MSTNTVSEIT YDLPPFDKYR GQDSFFGGEP PLSEGVGYLV VLGFGFLFSI ITTILVYLNK 
YFGARGEVTS EHFNTAGRMI KTGLTASVIV SQWTWAATLL QSSNVAWAYG VSGPFWYASG
ATIQVLLFGV LAINLKKVAP SAHTVAEIVN ARWGKTAHLT FLFFCFAANI IVTSMLLLGG
AATVEALTGM DYRLASFLIP WGVILYTASG GLQATFLASY IHTVIIYAVL ITMIFLVYIK
FYSSDQIYDF LDQTVSFTTE ECEAIFSDDS GTFFEAGKYA CGPVSGNEKG SYLTMISGDG
LMFGIINIVG NFGTVFVDQS YWQSAIAAKP SSAARGYLLG GVCWFAIPFS LATSLGLAST
ALMLPITSTE AGNGLVPPAV AFELLGDAGA ILILIMLFMA IVSTGSAESI AVSSLVAYDI
YRQYINPEAT GDQILFVSRV VIVVFGLFMG CFAILLFEIG LSLGWVYLFM GVVIGSAVVP
LWNMMTWKKA SGTGAVIAAW TGLVLAVTGW LVAAKVQSDT ISVDALGTNE VMLSGNLIAI
LSSGAIHYVY SMFIDPQDYD FSELDKHITL VEQDTRGLTD EEKDPVALRR AERWITRRGY
ALTLVLIFIW PILSVPAGVF TKSYFAFWVL VAIAWGFGAA LVITILPLTE SAEDISMVLS
GVFYAVTGRE PRRAEDPAEA VAAEKEISEE MDKADAEVAA EMEA