Gene PHATRDRAFT_43232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43232 
Symbol 
ID7196953 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2397734 
End bp2399790 
Gene Length2057 bp 
Protein Length568 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177506 
Protein GI219111509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.332386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC GACTCCAAGC TGTTCTCTTT TTTTGGGCTT CCAAGAGAGG TATGCCACTC 
ACTGTCATCT TCCCATTTCC TGTGTATCCG CTTCCGATCC GCCACCGACC GAACTCTTAC
AAACAAAGAT TCATTGAACC TTTCAAAGGA CCTCGGTGCC GACATGGAAG AGGCTATGGA
AACAGGCGAC ATTTTCCTCG GCACTGTCGT CGCTGTCTTG ACAATTGTCG GGATCAGTGT
AACCATTAAC CTGTTGAAGT TCTTGTGGAG ATCGTTCGCC GCTCAGGTAG GTTCAACGAT
CTATCCCTGA ATAGTCCGGT ACCGACGGCA CTTTCTCACA TCTTCTATTC CAATGAATTT
TGTACAGCCC ACAACGGCCA AGATCCGTCA GGTCTTCCCG GGTGCCCAGA CGAACGATCA
GCTCGTCGAA ACGATCAGAT CCAGTCTCGA AAAGTTCGGA TTCGGAGAAA ACTCCCTCAT
TGCTACCTCC TTGTGTTGCG ACGAAGTCAA CCGTCCCCTC GATAAGGCTC TGTCGGAGAC
CTACGGTAGC TACTTCTCCA TGGGAGGTCT CGCTGGCTTC CCCTTTGGAG GTCTGACCTC
CTTCGGAGCC ATGGCTGCGC ACATCCCCGA CGGGGGCTCT TGCGTTGTGG TGTACGGGCC
TCACGTTGGT GTGGACTCCA AGGGTAACGT GGGTACCGTT GAGCGCCGCG GACGTCAGAA
GGGCGGATCT TGCTGTGGAT CCGGTGTTGC CGCGGCTGGC TTTGTGAAGT CTTGCCTTGC
GGGTGACGCC AAGCCCCCCG GCGCCCCCTC GGACCCCCTG GACGCGCAGC AGACGTTCGT
GAACTCTATG CTCCTTCCCC ATGGAGCCCG TCTGAACTCC GCAGAAGAGC CCATGGTCGA
GCTTCCGTAC GCTTTGTTTG ACGCCCAGGA CGAGTTCATG CGCAAGATCA TCGAGAAAGG
ATCCGGTAAC GTGGCAGGAA ACGGTCGCAT TGCTCTGTTG GGAGGAATCC AGATCAACAC
CCCCGCCGAC CAGCCCGACT ACTTTTTACC ACTGCGCTTT GACGTCCTGT CGAACAAGGG
CGAGACTATT GAGAAGATTA TTGATTCTCC CTCGCGCGTT ACCGCTACAA AGATCTCCAG
TGTGTTCCCC AACGCGGTAC CGAACGAAAA GCTCCTCGCC AAGATCAACA GCACACTGGG
CTGCTATGGG TACGGCAAGA ACTCTCTGGT TGCTACCTCG CTGTGCTGTG ACGAAGTCAA
CCGTCCTTTG GAAGATGACC TCAAGGCCGC ATTCGGCGAA AACTTCAACA TGGGCGGACT
CGCCGGCTTT GCGTTTGGAG GTGTCACCAG TTTCGGTGCC ATGGCAGCGC ATATTCCGGA
CAGTGGCTCG TGTTTGGTGG TATACGGGCC GCACGTAGGT GTCGACTCGA ACGGCAAGGT
GGGAACGGTC GAACGACGTG GACGGGCGAA GGGCGGGTCT TGCTGTGGAT CTGGTGTCGC
CGCGTCAATG TACGTCAGAT CGGTGCGTAA TGGCGGGGAA GAAGCTGCTC CGCCTACGGA
TCCACTCGAC GCGCAACAAA GCTATGTTGG CACTATGCTA CTCCCGTATG GTGAACGCTT
GGAAAATGCG GAAGACCCTA TGGTGGAACT TCCATATGCT CTTTTTGACG CACAGGACGA
GCTAATGCAG AAGATTGTTG CCAAAGGCTG CTCGAACGTT GCTGGCAACG GCAAGATTGC
TCTTTTGGGA GGAATTCAAA TCAATACGCC TAAAGGCATG GCAGATTACT TTTTGCCCCT
TCGTTTCGAT ATTCGCGACA ACCGCGACGT TACCATTGAA GATTTCCTGG TAGAGACTGG
TACCTAGACC TCACATTTTC CTGGCTATGA CGCAGGCCAA TCGCTATGCA TGGAGATGCC
ATTCTCCTCC ATCTTACCGG CATCGCGCTC TCCTTTGACG AAATGTTTTT TACCTCTTCA
CGAGACCTTA CGAGGGTAAC CGTACTCTAT ACGCATTGTG GCAATATAAC TTTAACTAAG
ACGCTTGCTG TTAGTGC
 
Protein sequence
MQQRLQAVLF FWASKRDSLN LSKDLGADME EAMETGDIFL GTVVAVLTIV GISVTINLLK 
FLWRSFAAQP TTAKIRQVFP GAQTNDQLVE TIRSSLEKFG FGENSLIATS LCCDEVNRPL
DKALSETYGS YFSMGGLAGF PFGGLTSFGA MAAHIPDGGS CVVVYGPHVG VDSKGNVGTV
ERRGRQKGGS CCGSGVAAAG FVKSCLAGDA KPPGAPSDPL DAQQTFVNSM LLPHGARLNS
AEEPMVELPY ALFDAQDEFM RKIIEKGSGN VAGNGRIALL GGIQINTPAD QPDYFLPLRF
DVLSNKGETI EKIIDSPSRV TATKISSVFP NAVPNEKLLA KINSTLGCYG YGKNSLVATS
LCCDEVNRPL EDDLKAAFGE NFNMGGLAGF AFGGVTSFGA MAAHIPDSGS CLVVYGPHVG
VDSNGKVGTV ERRGRAKGGS CCGSGVAASM YVRSVRNGGE EAAPPTDPLD AQQSYVGTML
LPYGERLENA EDPMVELPYA LFDAQDELMQ KIVAKGCSNV AGNGKIALLG GIQINTPKGM
ADYFLPLRFD IRDNRDVTIE DFLVETGT