Gene PHATRDRAFT_31846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31846 
Symbol 
ID7196149 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1075504 
End bp1077251 
Gene Length1748 bp 
Protein Length552 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177216 
Protein GI219110929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCCA CGAGAAGGGA ACGTCTGCTT CCGCAGTTAC GAGATGCTCT TGCGTGGACT 
GCGGTGCTAC TACTACTGGG TTTACTACCC TTCGGCGCCG CGTCGACGAC GGACGATTGG
ATGCGTTTGG ATTACGCCGT CTTTCCGCGA CGACACGTCG ACGAAGCGGC GGTGCTGGAA
TGGGGACACT CGGCCATTAT TTCCGCACTC GACGCATCCG TCGCCTGGTT TGGTCCCCAA
ACCTCACAGG CGGCTCTACT TGAAGTCGAA GCCCAACCCG TATTGGCCTC GCCCGTACAC
GGCGTATCCA ATACGGTACA AGAAGCTTTG GATGCCCTCG AAGAACAAGC GAGCGACGTT
GAGACGAACG AACAGGTCGC TAATTGGAGG AAAATCGTCC TGGATAATGC CGACGAAGTC
CAAGGCAACG TCGTCGTCAT GACGAATACG GGAGACTTGT CGGGACCCCA AATGGCCATG
CTCGCCCAAA ACAGTGGCGC GGCCGCCTTG TTGGTCGTCA ACGTTGACGA AGACCGTCCC
GATGATATAT ATCGACTCGC GGCGGATCGG AACACAGTGA CGACGACGAC GAAAACAGCC
TTCGAAACAA CAGCCACCAC CACGCCGCAC TCGCCGCACT TATTGACATT CCCACCGTCA
TGATCTCCAT GAACGCCGCC AACGTACTTA CCACCGCCAC GGTGGATCCC AACGCTTCCT
CGCGGCGACG CGTAGTCAAT CACGGAATGC CCGATCGAGT CCGTTTGTAC GCCGGCGGGG
ATCGACCCTT CTTCGAAGAC GCCCAGGCCG AATCCCCCGC CGTGTATCTC ATTCACAATG
TTTTGACTCG CGAGGAATGC AAAGCCCTCC AAACGCGGGC GTCCACTCGT TTGCAGCCAC
TAGAGGCGAT TGCGAGTACG GCTGGTGGTA CACGCAGTCC ACTCCAGTAC ACTACCGCAT
CTTCGTTGCG CGGAGACAAA AGTGGCGGTC CGTACTATGT CGGAGACGTA TCACGCGTCG
TATTGTGGCA AGGATTGTGG CAAAGTCAGG CCGCCAAGGC CGTGGAAGAA CGCATCGAAC
AAGTTACCGG ATTCCCGTCT ACTCATTACT CCGATTGGAT CGTGGATCGT TACGAAGCTG
GTGCCTACGT CCGTCCCCAC GTGGACAATA TTTTGGCAGC CGACGGAACC GCCCCAATAG
CCGTCCTCAC CGTCTTTTTG AACGATGATG GCGGCGACGC CGCCATTGTC TATCCGTCCG
TACCCACCAA CGCAGCGGCT CAAAAACCAC TGAAAATTCG TCCCCAGCAA GGCCTGGCCG
TCGTCCATCA CGTTACGGAC GATCATCATC GGATCGATAC CAACGCCGTG ACGGGAGTCC
TTCCGGCGTC TACCGAGCAC GGTGATGCCT ATTACCTTGC ACGCAAGTAC ATTTATGCCA
CTCCCGTCAG TACCGCCCGC CGTCTGGTGC TTCCAGCACT GTCGTTGGTA GCAGCCGGTG
GGGGAAATCT GCCCAGCCTG GTTGTTCGAC TGCACGTCGC CATGCTGGAA CAGTTTGGCG
TTCCGCAGGG CAACGCAAAT TTTGACAGGG TCTGTATCTT TGTCCCATTG TTGCTCGTGC
TACTTCTAGT GCAGTACGTG GTCAATCGCC TTATGAACCA ACCCTCCAAG CCACCGAGCA
AGTCAGCCTC CGGAACCGGG TCGAGCTCTA CAAGCGCAAA TAAGAAGCGG GACAAGAAGC
AAAATTAG
 
Protein sequence
MSSTRRERLL PQLRDALAWT AVLLLLGLLP FGAASTTDDW MRLDYAVFPR RHVDEAAVLE 
WGHSAIISAL DASVAWFGPQ TSQAALLEVE AQPVLASPVH GVSNTVQEAL DALEEQASDV
ETNEQVANWR KIVLDNADEV QGNVVVMTNT GDLSGPQMAM LAQNSGAAAF LRNNSHHHAA
LAALIDIPTV MISMNAANVL TTATVDPNAS SRRRVVNHGM PDRVRLYAGG DRPFFEDAQA
ESPAVYLIHN VLTREECKAL QTRASTRLQP LEAIASTAGG TRSPLQYTTA SSLRGDKSGG
PYYVGDVSRV VLWQGLWQSQ AAKAVEERIE QVTGFPSTHY SDWIVDRYEA GAYVRPHVDN
ILAADGTAPI AVLTVFLNDD GGDAAIVYPS VPTNAAAQKP LKIRPQQGLA VVHHVTDDHH
RIDTNAVTGV LPASTEHGDA YYLARKYIYA TPVSTARRLV LPALSLVAAG GGNLPSLVVR
LHVAMLEQFG VPQGNANFDR VCIFVPLLLV LLLVQYVVNR LMNQPSKPPS KSASGTGSSS
TSANKKRDKK QN