Gene PHATRDRAFT_31433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31433 
Symbol 
ID7196649 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp87697 
End bp89869 
Gene Length2173 bp 
Protein Length641 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177013 
Protein GI219110523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201267 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACG TTGAATCCGG AGACGGTACC GGGAACTACG GCAACGAAAT CGCGGACGAC 
AGTAACAATA ATAACCCTCG GGTACTGGAT CGCGACGACA GCGAGCTCGA CAAGCATGAT
GATCCCTTTG CTCCACGCGA AGGCAAAACC CTCACCTGGA CTAACATTCA CATGACCTTG
GTACGTGCGA ACGACGATGT AAAGGTTTTG TTGTGTATCG TTCTGCCGTA GCGCATCGTT
GTTCGATCGT CCTCGCTCGC ATTCTCATCG TCATTGTCGT CGCCGCTATC CCCCACATAT
AGGCTGGAAA GGGTGAGGAG TCCGAACGCA AACTTCTCGA TAACGTGTGG GGCGAGGTAC
CGGAGAAGCA AACGACGGCG GTCATGGGTC CTTCCGGAGC CGGCAAAACG TCGCTTTTGA
ATATTCTCGC CGGTCGCGCC AGTTCGCACG GACGCGTCAA GATCGAAAGC GACGTTCGTC
TCAACAACTA TTCCGTCGAT CCGACCAACA TCAAGGTCCG CAAGCTGATT GCCTTTGTTG
CGCAGGACGA TTCCTTGCAG GTCACTTCGA CGCCCCGGGA GGCCATTCGC TTTTCCGCCA
AGTTGCGTCT ACCCAGAGCT ACGACAGATC ACCAGCTCGA CAAACTCACC GACCGCATGA
TCACCGAACT AGGACTCACG GCCTGTGCCG ATTCCATTGT CGGAGGGGAA CTCATCAAGG
GAATTTCCGG AGGAGAACGT AAGCGTACTT CGGTCGGGGT CGAACTCGTC GTCAAGCCTG
CCTTGGTCTT TCTCGACGAG CCTACCAGTG GTTTGGATTC CTTCAGTGCC GTGCAGTTAT
GTCAGGTTCT CAAAAAGGTA GCCAACGCCG GATCGTCCGT TTTCTTTACG ATCCATCAGC
CTTCTTCGGA AATCTTCAAT TCCTTCGACC ATTTGATCCT CATGAACAAG GGACGCGTCA
TGTACACCGG CTCGGTCCAC GGAGTGCCGG ACTTCTTTGC CTCTCGAGGA CATCCCAATC
CTCCCAACTA CAATCCGGCC GATTTCATCA TGAACGTTGC ACAGTCGGTG CCCGTCAAGC
AACTCAACGA GGATGGATTC TTCCCCACCG ACGAACGCAA AATGGGGGAA GCCTTTGTTC
CGGATGACGG AAAGGATGCT CTCGGGATTA CCGTTACCCG TCGCACTGCT CGTGGTGTTG
ACGTGTACGA CACCAAACCC CCCGGTCTCG TGACGCAGGT CAAGCTGCTC TTTACTCGTG
AAATTAACAA CTTGCGTCGG GATGTTACGG CTCTTGGTGC CCGCTTTGGC CTCACCATCT
TTTTGGGAGT CTTGGTTGGT ATCATCTTTT TGGATGTGGG CAAGACTGAT CCCACTGTCG
CGGTCAATCT GCAGTCCCAC TTTGGTGCCC TCATTATGGT CCTCCTTATG AGCATGTTCG
GGACCGCCCA ACCCGCCCTG TTGTCCTTTC CCGAAGAACG CCCCGTCTTT TTGCGCGAGT
ATTCCACCAA TCACTATTCG GTCATTTCTT ACTTTTTATC GCGATTGACC ATGGAAGCCG
TGGTGACTGG ACTTCAGGTA TTTGTGCAGG CCATTATCAC GTACTTTATG ATCGGCTTTC
AACTGTCCTT TGGTTTGTTT TGGGCCGTTA CGTACTCTCT CGCCATGGCC AGTACGGCGT
TGGCCGTGTT GCTGGGTTGT TCCGTGGAGG ATCCCAAACT AGCACAGGAA ATGTTGCCGA
TTTTGTTTGT GCCGCAGATG CTCTTTGCCG GCTTCTTTGT CGTGCCTGAT TTGATTCGTA
AGTGGTCGGT GTTGAGCTGT GCAAAGATGG CGGTCCCGCC CTGTCTGTAC CGACCGATTT
TTGGAACCAC CGTGTGATGC GTTTCTCACA CGACAAATTC ATTTCTCATA TTTATGCTAC
AGCTGTCTGG TTGCGCTGGG CTCGTTACCT TTGTACCTTG ACCTACGCCA TTCGCATTCT
CTTGGTGGAA GAATTCTACG ATTGCGATCC TGGTAATCCA GAAGCCAACA ATGCTTGCAA
CGACTTGGTC TCGAACATTG ACGCCGACCC GGACGAGACG TGGTGGAATT GGTTGGTGCT
CGTAGCGCTG TTCGGGGTCG CCCGTATTTT TGCTCTCTAT ATTCTCCGTC AAAAGTCCAC
CAAATTCTTT TAA
 
Protein sequence
MEDVESGDGT GNYGNEIADD SNNNNPRVLD RDDSELDKHD DPFAPREGKT LTWTNIHMTL 
AGKGEESERK LLDNVWGEVP EKQTTAVMGP SGAGKTSLLN ILAGRASSHG RVKIESDVRL
NNYSVDPTNI KVRKLIAFVA QDDSLQVTST PREAIRFSAK LRLPRATTDH QLDKLTDRMI
TELGLTACAD SIVGGELIKG ISGGERKRTS VGVELVVKPA LVFLDEPTSG LDSFSAVQLC
QVLKKVANAG SSVFFTIHQP SSEIFNSFDH LILMNKGRVM YTGSVHGVPD FFASRGHPNP
PNYNPADFIM NVAQSVPVKQ LNEDGFFPTD ERKMGEAFVP DDGKDALGIT VTRRTARGVD
VYDTKPPGLV TQVKLLFTRE INNLRRDVTA LGARFGLTIF LGVLVGIIFL DVGKTDPTVA
VNLQSHFGAL IMVLLMSMFG TAQPALLSFP EERPVFLREY STNHYSVISY FLSRLTMEAV
VTGLQVFVQA IITYFMIGFQ LSFGLFWAVT YSLAMASTAL AVLLGCSVED PKLAQEMLPI
LFVPQMLFAG FFVVPDLIPV WLRWARYLCT LTYAIRILLV EEFYDCDPGN PEANNACNDL
VSNIDADPDE TWWNWLVLVA LFGVARIFAL YILRQKSTKF F