Gene PHATRDRAFT_46031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46031 
Symbol 
ID7201521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp39001 
End bp40155 
Gene Length1155 bp 
Protein Length359 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180364 
Protein GI219119198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00169986 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTAGCCCAAC ATATTACTCC CTTCGAAAAA CAAACCGTGT GTCTAAATTC AATCTATGCG 
ATCTCGGGGC TTCCGATGAA AAGCTTTGTG TCCAAGGCAG TTTTCTTGTT TGCTTCACTC
CGACGACAGC GCGCGGGTGC CTGGACTTCG CCACTTTCGG TGCGCCGACG ATGCCCACGA
GGCCACGGAT CCTATCCTAC GCAGTGTCTG TCACACGCCA GTCTACTGTC GGTAGCGGAA
TGTCTAGAAT TGTACCATAA TTCCACACGT AGCGGGGAAC GTAGCATCCG CTTTGTCGAC
GGTTCTTGGT ATCACAAGGG AAATCGAAAT GGCTTGTTCG AATTCCTGAA CGGACCCCGT
CTACCCGACA GTGTTTACAT GGACATGGAC GACATTAGTT GCCAGACCGA CCTTTTTCCA
ACTCTCAATC CCTCCGATCT GTACCTGATG CAGCCACCGC GAGCCTTGTT GAGTGCGTGG
ATGGACTTTT ACAAAATTCG TCGTACAGAT CAAGTGATTG TGTACGGACG TTCCGGCAGT
GTCTTTTTGC CTCGCACTTG GTTTACCTTG CACGCTGTGT TAAGTCACGT CAGGGTAAGT
ATAATGCAAG GCAGTTTAGA AGACTGGATG CGCGCGGGTG GTCCACTAGA TGAAGGAGTA
TTGGAACAGT CCAGCAGCGT AGTTAGGGCG GCTGATCTCG ACTGGGAACA ACCGACCAGG
TACGACAGCA ACAGCCAGTC ACCGAGCGAA CAGGCCGTGG TCAGCATCGT GGACGCGAAC
TACATGCTTT ATGTAATAGG GGACAACAAG TGCAGTACGA AGATCTTGGA CGCGCGGGGT
TCCAGTTTTG CAGCGGGTCA CATGCCCGGT GCGGTCCACA TTCCGTACAG TAGTTTGTTG
GTAGATCCGA CCAGCGGAAG TCAATACAAA CCGGCTGAAG AAATGCGGAA GATATTTCTT
GCGCAAGGTG TGGATCCCAC AGCGAATACT CCTCTTGTGT GTTCGTGTGG TAGCGGCGTT
TCGGCGTGTA GTCTCTATTT GGCGCTACAC GTTTGTGGAC GTTCTCCGGA GCAAAGCACC
AAGGTGTACG ACGGCAGCTG GAATGAGTGG AAAACGCTGC CTTACACACC GAAAGAACAA
GTGCCGAAAA AGTGA
 
Protein sequence
MKSFVSKAVF LFASLRRQRA GAWTSPLSVR RRCPRGHGSY PTQCLSHASL LSVAECLELY 
HNSTRSGERS IRFVDGSWYH KGNRNGLFEF LNGPRLPDSV YMDMDDISCQ TDLFPTLNPS
DLYLMQPPRA LLSAWMDFYK IRRTDQVIVY GRSGSVFLPR TWFTLHAVLS HVRVSIMQGS
LEDWMRAGGP LDEGVLEQSS SVVRAADLDW EQPTRYDSNS QSPSEQAVVS IVDANYMLYV
IGDNKCSTKI LDARGSSFAA GHMPGAVHIP YSSLLVDPTS GSQYKPAEEM RKIFLAQGVD
PTANTPLVCS CGSGVSACSL YLALHVCGRS PEQSTKVYDG SWNEWKTLPY TPKEQVPKK