Gene PHATRDRAFT_54789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54789 
Symbol 
ID7202729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp697009 
End bp701055 
Gene Length4047 bp 
Protein Length1205 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182117 
Protein GI219123613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGT TACTCGCCTC CGTGAGTAAA GGACGAGATG TTAGTGACTT TTATCCGCAC 
GTCGTCAAGC TGGTCGGAGC TTACAGTCTG GAGGTACGCA AAATGGTCTA TATGTACCTC
GAACAGTACG CGGATCACGA CCCAACAACA CGCGAACTCT CTCTGTTGTC CATCAACGCT
TTCCAACGTG GTTTAGCCGA TACGGAACAA TGGATTCGAG CTCTGGCTTT GCGTGTCTTG
ACCTCGATTC GACTCGCCGA TATTTTGCAA ATTCAAATAT TGGGCGTCCA AAAATGCTCT
CAGGATTCGT CACCCTACGT GCGTAAGTGT GCCGCGAACG CCTTGTCCAA GCTGCATCCG
CGGTGTGCAC CAGATCCGTC CCAGCAGACC CTCTTATTGG AGATTTTACA GTCCATGCTG
GATCGAGACA AGGCTACCAT GGTGCTAACG TCCGCCTTGA TTGCGTTTCA AGAACTGTGT
CCGGAACGGC TGGAACTCTT GCACGGTTCT TTTCGAAAAA CGTGTCATCT CTTGACCGAC
ATGGACGAGT GGGGGCAAGT CGTGACTATT GAGATTCTGG CACGATACTG TCGACGTTTT
TTTAAAGAAC CCCTGGGATG GCGGAACGGG TCTGCGGAGC AGATTGATCG CGAACGTCGA
GTACGGAGGA CCGTTGCTAC CACACGTCCC GTAACAACCT ACAATGCCAA CTCTCAGGCT
ACCAGCGCTA CGTCGGCATC TCCGCTACCG GAGCCTCTAT CCACTAGAGC TGCACAAACA
GGGGTGTCCC TACCAACCCA TTTTAGAGAT CATGTGGACG ACAAGACTTC TTCTACCGCT
CATCCGCCTC GCAAAGTCAA ACGTCGCGTT GTGAAAGAAG GTTTCTATTC CGACGAAGAG
GATGCAAGCA CCGAGGAAGA AGTGTACGTG GATGAACTTA ACAGCCCTTC ATTGCCATTG
GCGGCAGCTA TGCGGCAACG CAACATTTTG GGTCTTGCAG GTCCCGATGG TACGAAAACC
GTTCGACAGT CTTCCAACGT TTTGTTTTCG ACTCAGGAGG ACACCGAGCT GGCCGAAGAT
CATCAACGTC TCCTACATGC CGCTATGCCT TTGCTCAAAA GTCGCAACGC TGGCGTCGTA
CTCGCCACCT GCTCTCTGCA ATATTACTGT GGTATCTCCA GTATTCAAGT ACGTGCCGCT
ATGGGAAGGG CACTTGTCAG GATCCATAGA GATTGCCGCG AAATTCAATA CGTGGTATTG
ACCGCCATTC GCGATTTGGT GAAGCATTGC CCATCAGCGT TTGCCCCATT TTTGCACGAT
TTTTTCGTCA AGGCTCTAGA TCCGCCCTTT ACTCGTCTGA TCAAGCTCGA TATTCTGACT
TCGCTGGCGC TGGAGCCTGC TGCCATTAAA GCCGTGCTGC AAGAAATGCG CTCCTACGTG
CGAGACGGAC ACGTCGAATT CGTGCGGCAT GCAATTCGAG CAGTTGGACG TACCGTCGAA
TTAGCTCGCA TCGTGTATGA TCGACACGGT CAAAAATCTG GCAAAACCAG CGTTCTGGCT
AAAGAACGTG CCGAAACGAA TAGTATCGCA TTGGATTGCT TGCATGGACT ATTGACGTTG
ACGCAAACAT CAGATCACGT TGTCATTGTT GGAGAATGTG TTTGTGTGAT GCAGCGCATT
TTGCAGCTGT TGCAAGCGCC TGAGCCCTAC ACCGGCGAAA TTTCTGTGGT TAAAGATCCT
AATAATGTTC AGCAACGAGC CGTGCAGCGC ATTTTGATAC TGCTGGTGTA TACCCTATCT
TCACGCGTCG AGAACGCACC AGAGGATGAT GAGGACGCTT CTGAACCGAC TGTGTTGGCA
AAAATCGCCG TTTCGCTTTC ATCCGATGCA ACAGCATCCG CCTTATGGGT TGTCGGAAGT
TTGTGTTTTG CGCCTCTAAC GGAATCACCG CTTAGTGAAT CGGTGGGCGT TGGCCTGGTT
AAGGGTTCTG CTCGTTTAGA AGTGGCTCGT CTAATAGCAC GGGCGTTTCT GGAAATGGAA
GCGGTCGAGA AGGAGCAAGC AATTCATTTC GCATCTCGTA TTATGGTCTC TAAGGCCACC
TCTTTGAACG GATCGTCAAC TGAAGAGTTT GCCCTGTGTG AGGCTATCTT GTCGATGGCT
CGTACCGACG TCAACGTCGA TGTTCGAGAT CGTGCCCGAT TCGAGTCCAA CCTTGTTCGA
GCCACCGTCG GCCTTCAACA TGACACAGAC GCAATGGAAG ACCTACCAGT ACTAAAACGA
CAGCTGACGG TCGGAGATGC AAAACGAATG TTGTTGACAT CCAAACCGGC ATGTTCTTCT
CTTCCACTGG AAGATGATTT CAGTACCGTT TCGGGCGAGA ACGGTGGCTT TCGTTTTGGA
ACTCTCAGTA GCTTGGTTGG CCATCGTGCC CGTAAAGCAT ACTTGCCATT GCCCCGCTGG
GCGGATCAAA ACAGTTCTGA TACGTTACGT GTGCCAATTG AAGACAAAAA GACAGATGCT
TTAAAAGATG TTGAAGGTGA GACGAGAACG AAGAACACGA ACGGTGCAAA TGAGTTTTAC
GAGTCCTCAG ACGATGACGA GCAGGACAGC TCTTCGGAAA GCTCCTCGCA GGACAGTAGC
GATGAAGCCG GATCTTCCTC GGACTCATAC AGCGATGAAT CATCTTCTTC TGACGATGAC
GACGAGTCTT CTAGCGATGA TAGTGATGTA GGCATGCAAA GCCTTGGTCA AGATGCCACG
TTGATACCGA TGGAAGTTGA ACAGCGGAAA GTCGCGCACG ATTTGAATTC TCAGAAACTA
CCTCTTCCAG TGGTAGAGAA TGTTGACGGG TCTTCCTCTT CCGAAGAGGA GGCCAGCAGC
ACGTCTAGCG ATGATGAGAC CTCGACCGAT AGTTACAAAC TCAGTCCAAA AGCCAACGGT
GGGACACATG ACGGCACTTT CATACCCCTA GACGCTTCCA GCAAAGCTGC GCCTGCTGCA
ACTTCCACCA TTGCCTCCTC CTTAGCTAGT GACTTTGAAG GCATGACATT GGCACCCGCT
ATCCAAAATC AAAAGCCGCA ACTGGATCCC GACCGCGACA GGGATTCTAG CGTTTGGCAA
GTCTGGGTAC GGCCCGAACA CGCGAATGGA TTGTTGGTGA AGATTCGCTA TCTACGAGGA
CCAACTCGGT CCAAAGAGGC GCAGGTTTTG GTCGGCACGG GAGCCGAGAA ACCTTCCCTG
GTCCTGTTGC AAGTGAGATT TGAAAACAGT AAGGATACAA CAGTTCGGCG ATTGCGCATT
CTCCAACGGG CTTCCGCTTC GGGTACGTCT TCATCCATTG CACCCCGCAA AATGCTTCTT
CCTCCCGAAA TCGACCAACT GAAAAAAGGA CAAACCGTGG ATCACATCGT GGCCATTGAA
TTCGCCAGTG TTTCCGATCG GGAAGGTACA ATGTTGGCAA AACTGGAAGT CAAGTTTAGC
ACTGGCGGCA TACCGGTGGA AATAAAGCCG AGTCTTTGCG ATTTATTGTT GCCCTGTTTT
CGATCGGTGG CAGACTTTGA TCAAGCCGTA GCCCGACTGC AAGGCTTTCA ACGGGTGGAT
ACACGCTTTC CTATGTCCGA CGATTCCCAA GCCCAGCGTG ACACCCTGAT GTCCCGTTTG
ATGCGAACGG CGCCCTGGAC ATTGATCCTC GAAGGTGATG CCGAAGCTAC CAGAGATGAA
ACATGGCCCG GCCAAAAGTT GCGTTTGGCG GGCACGCTGC CAGCATCGTC CGATCCCGTG
TACGTCTTGG TGACAATCAC GGGGTCTGGT ATTACGGGCC CGGGTAGTGC CGGTGGTGGA
TGCCAGGCAC TCTTGTCTGT TTGTTCCGAC AACGCATTGG CCGTGAATAG CATTTTGAAC
ACGTTGAAAA AGACGGTTCA GAACTTGAGC GATACGGAAA CACAGTAGTC TTCTATAGCC
AGATAGGTAG TACACAAAAA AAAACTCTGT CTAGAGCAAC TCTTCGCGAA ACCGTAAACG
TCTTAAGATA ACGCAGCCGT AAAGGTA
 
Protein sequence
MKWLLASVSK GRDVSDFYPH VVKLVGAYSL EVRKMVYMYL EQYADHDPTT RELSLLSINA 
FQRGLADTEQ WIRALALRVL TSIRLADILQ IQILGVQKCS QDSSPYVRKC AANALSKLHP
RCAPDPSQQT LLLEILQSML DRDKATMVLT SALIAFQELC PERLELLHGS FRKTCHLLTD
MDEWGQVVTI EILARYCRRF FKEPLGWRNG SAEQIDREQG FYSDEEDAST EEESSNVLFS
TQEDTELAED HQRLLHAAMP LLKSRNAGVV LATCSLQYYC GISSIQVRAA MGRALVRIHR
DCREIQYVVL TAIRDLVKHC PSAFAPFLHD FFVKALDPPF TRLIKLDILT SLALEPAAIK
AVLQEMRSYV RDGHVEFVRH AIRAVGRTVE LARIVYDRHG QKSGKTSVLA KERAETNSIA
LDCLHGLLTL TQTSDHVVIV GECVCVMQRI LQLLQAPEPY TGEISVVKDP NNVQQRAVQR
ILILLVYTLS SRVENAPEDD EDASEPTVLA KIAVSLSSDA TASALWVVGS LCFAPLTESP
LSESVGVGLV KGSARLEVAR LIARAFLEME AVEKEQAIHF ASRIMVSKAT SLNGSSTEEF
ALCEAILSMA RTDVNVDVRD RARFESNLVR ATVGLQHDTD AMEDLPVLKR QLTVGDAKRM
LLTSKPACSS LPLEDDFSTV SGENGGFRFG TLSSLVGHRA RKAYLPLPRW ADQNSSDTLR
VPIEDKKTDA LKDVEGETRT KNTNGANEFY ESSDDDEQDS SSESSSQDSS DEAGSSSDSY
SDESSSSDDD DESSSDDSDV GMQSLGQDAT LIPMEVEQRK VAHDLNSQKL PLPVVENVDG
SSSSEEEASS TSSDDETSTD SYKLSPKANG GTHDGTFIPL DASSKAAPAA TSTIASSLAS
DFEGMTLAPA IQNQKPQLDP DRDRDSSVWQ VWVRPEHANG LLVKIRYLRG PTRSKEAQVL
VGTGAEKPSL VLLQVRFENS KDTTVRRLRI LQRASASGTS SSIAPRKMLL PPEIDQLKKG
QTVDHIVAIE FASVSDREGT MLAKLEVKFS TGGIPVEIKP SLCDLLLPCF RSVADFDQAV
ARLQGFQRVD TRFPMSDDSQ AQRDTLMSRL MRTAPWTLIL EGDAEATRDE TWPGQKLRLA
GTLPASSDPV YVLVTITGSG ITGPGSAGGG CQALLSVCSD NALAVNSILN TLKKTVQNLS
DTETQ