Gene PHATRDRAFT_42854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42854 
Symbol 
ID7196506 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1321814 
End bp1324988 
Gene Length3175 bp 
Protein Length768 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177269 
Protein GI219111035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.652023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTGTTCCCA CAAGGGTGCA AGTGCTTCTT CTTTACATTA GAAACTTTGT CATTGTAAAA 
TTTTTGCACA TGAGTTCTAC TACAACACAG AGGCGAAAAA TTGGCTCGGT CAAGTCAGGA
GATTCGTTGA CGGTACAGGT AGAACTGGAG GAAAATCGCG AGTCCACCTC GGTCCCAATG
GTGACAGCAA CAGTTCGTCG ACGATTCTGT GGTACCGGAC CTTTCGACAG ATATTGGTTG
AACCTGGATT GTTGTGGCCT CTTTTGCGCC CTGATAACTT ATTGTCTGCA CGCCTATGGA
GTGTATGCTG TCTGCTTCGT TCTGATCCCA CCCTGGATGA GCACTACGAG CGAAGATGGC
ATCCGGAGTT TGAGCATAGC AGGCATCGGG AATCGCATCG GGTTTTCTCT GTTGGCAGCA
TTGGCGGTTG CGGCTCATTT CAAAACAATG ACAACCGATC CCGGTACGGT GCCACCCGAT
GCGCAACCGC TTCCCGAAAC CGAGGAGAAA ATAGAAACTG AGGAAGAAAA GCAATTGCAA
AGTTTGATGA TTATGCCAAC TCAAAAGGGA CGTCGACTTT GTCGTCGTTG CAAAGCGTTC
AAGCCGCAGC GGGCACATCA TTGTAGTGTC TGTCGCCGAT GCGTTATTAA AATGGACCAT
CACTGGTACG TGTTTTGGTT GTCGATTTTT TCCCTGCGAC TGACTCTCCT TGTTTGCTAA
CGCGACATTA TATCCATTTT CTTTACGACA GCCCCTGGGT CAAGTATGTT TTGTTATATC
CTCCTTCATT TCTAGCGTTT GGCAGTAAAA AACAAACTAA CCCGATTTGC TTCCATTCAC
AGCAACTGCG TAGGTATTGG CAACCACAAA TACTTTCTAC TATTTGTGTT CTACACCTTT
TTGACCTGCA CGTATTCCAT GGTGTTTGTC ATTACTCGAT TTGCGACGTG TGTGTCACAC
GATACGACGG GCGGACGTCA CAATCGCCAC CATATTGCCT GCTTGGATCA CCCTACCCAG
ATGCTTACAG TTCTCGGTCT TTTGATCGAA GCTTTGCTCT TTGGAATGTT TACCTCCTGC
ATGATGTATG ATCAATCCGA AGTAATCCGA TCCAAATTGA CACACATTGA CCGTCTGAAA
GGTCTCGATA TTGGTGGTTC CTTGGAAGGC ATCACCGAAG TCTTCGGAAT TGGCAGCTGC
AGTCGGGATG TCAACCACAC AGGATTTCGC TGCGATTGGT TGTCCCCCTT TCGTCGAGTT
TGCTACCCAC CCTCAGCGGT GGACGAAGTA ATGGGATTTT GCCGACTCGC GAGAAAGGGC
ACGTCCGAAA CAGAGTTGCC GGCCCGGTCG AACGGTTCCG CGTTGCGTAA GGTGGCGGAC
TTGGTATGAC ACAGCAGAAG TATTGCATAC TTTGGTTTTT TTTCAAGAAT GGTTCCTGAA
TACACAAAAT TAGTTCACCA ATCACCTTCT ATAACCTGAG TTCAGAGAGT GAAGACTCCG
ACGTTTGTCT CTCGTCTAAT CAGAGTGTCG GCCAATAGTT TGAATGCTGA AATATGCGGA
ACGCATAGTT TCTCATATCC AACTATGATA GGTTCATTCG TCGGAATCTA TTGTAGCAGC
GCCGGCCAAT GCCGGTGTCG TTTTGAGACA GGAGGGCGCT CTCAAATATT GTTCAATAAC
AATACTATAG TACTGACTCG CGCCAACGAC TACGCCGGCC GGGGCAGCGC TTAATTTCGC
CCCGTCACAC AACCGGACCG GACACGTTAC AGGGGGCCGT TCGTGGTGGT ACGGATCTGT
TTTTCCAGAA GAGTGAGAGA ATCAAGAATC TATTGACTAT GTGACCATGA AATTGTGCTC
ACTGTCAAAT CCGCATCGTC TTCTCTTAAG ACGGACTTTA CCTATCTAGG TAACAAAACG
GCCACCGCTG CCACCAAATT TTTGATTAGT AGAACGGAGC TGATCGAAAA AAATGAGTGC
TCCTGGTACG TAAACCAAAC TCGCTCGTTT GGTGCGTAGG TAGTGTCCCG CTGCCGCCTC
GACTCACTAC GCGCATCGCA TTGTTTTTTC TGTTATGTTG CAGTCCCAAT GAATATGGAG
GAATGGCAGC GGCGAATTCG GGCGGATAAA GAAAAGGAGC GCCAGCAGAA ATCGCAGTCT
GCCGAAATCC TTCAAAGCTA CCGGGGTGGC GTCAAGGACG AGGATCTGAA ACTTTCGGCC
CTGAGGCAAG AAGAGCGGGA AAAGCATTTG GACGCCGAAA GGTTGATGCA TAGTTACCAA
AATAATGAAC GGATCGAAGT CAGGCAGCGA CCTGTTCGGG TAGATCCGCA GCTTGCGTCA
CCACCGGTAC GAGCGGAAAG CGATCGGTCG GTCAACGTGA CACCAGGATC GGTATCGGCC
ATGGCGGGGA GATTTGCGCA GGTATCAGAC TCTGACAATA GTTTGTCTGT GTCACCGTCA
GCACGGACAA GAAAAACAGT CGTTGTGGAG GCTCTTCCGC CGTTTGGGGT CACTTCCCTA
GTGGAAGAAG GTGCGGCGAA GGAGACAAAT CTCACCAACT CGACGGCTCT ATCGCCAGTG
CTGGATGCAT TCGGAACAGA AAGTCAAGAA TACGACCAAG TCGCTTTAGA GAACGCTGCA
GATCTCGCAG CATTCTCATC CGCGACCGTT CCAGAATCCT TTCCGAGAAT GATTCGATTG
GATGTGCTAA TTTCTTTTGG ACTGGTCACG TCTTCTGAGA ACCCTATTTT AGACGGCTAC
GTTAAGGCGG CCGGCCAGAT CGTCCAGTGG CGTCTGACGG AAAATTCTGA TTTGGGGAGA
AGTGTAACGT ACAATACTGA TGTTCCTGCT TTTATCAAAA AGTCGAATTG GGACGGTACG
TCAAGCTAGG ATGAACGCGC TGTAGATGGT AGGCCTTTAT TCTCACCTGT TCTATAATCC
TCCTTCGAAT AGACTTCTAC GTGGACTCGT CGGGTCGCTC CGATGTCCGG CGCTGCGTAG
CGGTGGCGGC TATCCCACTA TTTCTGACGA ACGGATTTCC TGTGGATACC GTCAAAGATG
ATATTGTTCG GAGTTTGCAA CACTCGATTC ATTCAGGGGA ATTTGTCGAG CTTGCGCAAG
CTTTCCGATA GCAGCACTCT AGATTACTCT TAAAGTATGG GGAAAAATTC ACCGG
 
Protein sequence
MSSTTTQRRK IGSVKSGDSL TVQVELEENR ESTSVPMVTA TVRRRFCGTG PFDRYWLNLD 
CCGLFCALIT YCLHAYGVYA VCFVLIPPWM STTSEDGIRS LSIAGIGNRI GFSLLAALAV
AAHFKTMTTD PGTVPPDAQP LPETEEKIET EEEKQLQSLM IMPTQKGRRL CRRCKAFKPQ
RAHHCSVCRR CVIKMDHHCP WVNNCVGIGN HKYFLLFVFY TFLTCTYSMV FVITRFATCV
SHDTTGGRHN RHHIACLDHP TQMLTVLGLL IEALLFGMFT SCMMYDQSEV IRSKLTHIDR
LKGLDIGGSL EGITEVFGIG SCSRDVNHTG FRCDWLSPFR RVCYPPSAVD EVMGFCRLAR
KGTSETELPA RSNGSALRKV ADLRLISPRH TTGPDTLQGA VRGGTDLFFQ KSNKTATAAT
KFLISRTELI EKNECSWYVN QTRSFVPMNM EEWQRRIRAD KEKERQQKSQ SAEILQSYRG
GVKDEDLKLS ALRQEEREKH LDAERLMHSY QNNERIEVRQ RPVRVDPQLA SPPVRAESDR
SVNVTPGSVS AMAGRFAQVS DSDNSLSVSP SARTRKTVVV EALPPFGVTS LVEEGAAKET
NLTNSTALSP VLDAFGTESQ EYDQVALENA ADLAAFSSAT VPESFPRMIR LDVLISFGLV
TSSENPILDG YVKAAGQIVQ WRLTENSDLG RSVTYNTDVP AFIKKSNWDD FYVDSSGRSD
VRRCVAVAAI PLFLTNGFPV DTVKDDIVRS LQHSIHSGEF VELAQAFR