Gene PHATRDRAFT_42776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42776 
Symbol 
ID7196150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1078101 
End bp1081031 
Gene Length2931 bp 
Protein Length976 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176718 
Protein GI219109931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAAC CTCCCGGAAG TCACAATGGT GCCGGATCCC GTAACGTTCG TCGCCAGAAC 
CTAGGCCATT CGAAGAACAC ATCCACGTCG CCGGACGATC AAGTCCGCCT TCTCCAGCGA
TTGCCCCCGA ACAAGCGCTG TTGCGATTGT CGCGCCAAGC TCCCGTCCTG CGTCAACTTG
ACCGTTGGTA GTTTCGTTTG CCCCGCCTGT GCCGGCATTC ACCGCGAACT CAATCAACGC
GTCAAGGGCG TGGGACATTC CAGCTTTACC GACAAGGAAG TCGAATTCTT GCAGAGCGTC
GGTAACAACG ATCTCATTAA CGCTATCTAT TTGGCGACCT ACGACGACGC ACAATCCTCA
AGGGGAGGAA GAATCCAGGA ACCGAAGGAT AATACCGATC CGCAACATTT GAAGACTTGG
ATTCGCCGAA AATATGTGGA TCGTGCCTGG TATCGTCCTT CTTCTTCCAC GGCTGCGGCT
CCGCATCCCA CACAGCAGCC TCCACACGCG ACTATTGTGG CGATTCCTCC GACGGCGCCT
GGAACTGCAG GATCCACCGA TTTTTGGAGC AACAACAACA ACCACGCCTC TCCCGCACCG
GCTTGGGACG CATTCGGGAG TAACAACACT ACCGCCACGA CGAATGCGGG ATCATCGCAA
TCAGGATTTG GACCAGCGCA GTCTCCTCCT CCTCAGGCAC CAAACGTGGT AACGAACTTT
GCTAATTTCA ACACGGCGCC ACCACCCACC CAGCACGGAC CTCCCAATCC TGGGCAGCAG
CTGCAACAAC AGCAACAGGC CGTCAACAGC TTTGCCAACT TTGAAGCTTC GGGATCGAAT
GGCAACGGAA TCCAGCAACA GCCCAACTCG GGCTTTGCCA ACTTCAACTC TCCAGCGGCT
ACTATTGTGG GACAACAGCA GTTTTCAACA AACGCTGGTG CCCAACAGCA GCCGCCTTTG
AACACCTTTG CAAATTTGAA CGCTTCAACA TCTACTGTAG CTGCTTCCGG AAGCCAGCAG
CCGCAATTCC GTTCGAACAC TGGTGCTCCC CAGCAACAGC AACCGGTCCC TCAGATAAGC
GCATTCGGAG CCACATCGGC TACCACAGCA CCACAGCCCC AACCAGGATT TGCCAATTTT
AATTCTGGTG CTTATGCAGA CCCTCAGCAA TACTTTTCGA CGAACAGCGG ATCTCAGCAA
CCTCAATCGT CGCCACAACC AAGCGGGATT GCTAATTTGA ACACGGGTCA GGCACCAGCT
AAAATGAATA ACGGAACACA GCTGCCTCAA CCAGGATTCG CCAATTTCAA TGCAACATCG
GCTGGTGTTG CCGCGAGCCA ACAATATTTC CCTCAGAATA ACAATAACAG CTCTCACCAA
GCAAAGCTTC CGTCGGCACA GACGAGCGGA TTTGCCACCA TTCATTCGGG AGGTGTACAT
GGTACAAACA ACGCTCCAAA TCAGCCGCAA CCTGCAGTTC CAAATTTCAA TCCAAATGCG
GCTGGAAGCG TTCCCGGAGG CCAACAGCAA TTTTCGCCGA ACGGGAACTC GCACCAAATA
CAGCAGCCGT CGCCACACTC GAGCGGATTC GCCAACTTTA ATTCAGGAGG TCCACCTGCT
GCCACGGATA ACGCTGCACA GCAACCGCAT TCTGGATTTC CGAATGTCGA CGGAACATCA
GTTGGTAAGG TTCACGGAGG CCAACAGCAA ATTTCGTCGA ACAGTAACTC CCACCGAATT
CAGCAGCCGA GCGGAGTTGC CACCTTTAAC TTGGGATCTG CGCCCGCTTC ACCGAAGAAC
ACAAGACAAC AGCCTGGATT CAATCATTTC AACACTGCAT CGGCCGCTCC TGGGGGGCAG
CAGCAATTTT CGAACGGTAG CTCACCGCGA ATGCAGATGC CTTCGCCACA GCCGAACGGA
TTTGCCAATT CAAGTACAGG AGGTGAAGTA AATAAGGCAA TGCAGCGGGC GCAATTTGCT
TCGCCTGGCT CACAAAGCGT ACAGGACAGC TCCATTCCAG GCAATGATGG ACTGCACCAC
AGGGGAGTCA ATAGTTCGAT CACTCAAGGA GGTATACCTT TCGTGCAAGG ACGTAACACA
GAGCAACCGA TATCAAGTGT CCAGCAACCT ATGCACTTCC AAGGCGGTAG CATCAGTCTT
TCTGACGGTC TTAGAGAAAG TCACATCTCG TCCATTACAC GAGGTATGGG GAACTTGGGT
AATATTGCCT CCCAGGCAGG AAGCGTGCCG CATGCTATGA ATCAAACGTC GATTCATCAA
CAGCCAATCA GTGAATTATC CGAACAGCAA AAGTCACAGC ATAATATTCA TCATATCCAC
TCTTCCCCTG GTCCAGATCG ACAAACACCT ACGGAAGCTG CGAAAAATAC ATCTACGGAC
GGTGACTCCG CCCCAAATGA TGGTAAAACA GCATCCGTAT ACATGGAAAA CCATCCATCC
AAATTCACGG CTGGCCAAAC TGTGTACTAC AAGAGCTCCA CTTACGTGGG AAAAGCGAAG
ATCATGAAGG TGCATTTGGA CGATGATCTT GAACCATTCT ATACTATTCT TGTAGACGGC
AAAGAGAAGC AAACAGATAA TGGGCATTTG TCGGAAAGGA GTCCTTTGGA GGAAAAGGTG
CAGGAATTGA TTGGTTCTTT GACTGAGGAT CAGCTCTTGC AAGTTCATCA GTTTATTATA
AGGTTCCCAT TGACTGTGAC CAGTTCAGAA ACAGATTCTG TTGTTCCTCC TGCTGCTTCT
GCCACCATAA TTACTGGCAG TAGACAGCCA CCGGTCTTAG CTTCGACTTC ATCAATGTAT
CCTCCTGCTT CTTTGACTGC AAACGTCCCC ATTCCTGGTT CGGGGCAACA ACAAGCTGCA
ACAGGAGACG CGCCAATGTC TCCGAAAGGA AATCCATTTG ATTTGTACTA A
 
Protein sequence
MLQPPGSHNG AGSRNVRRQN LGHSKNTSTS PDDQVRLLQR LPPNKRCCDC RAKLPSCVNL 
TVGSFVCPAC AGIHRELNQR VKGVGHSSFT DKEVEFLQSV GNNDLINAIY LATYDDAQSS
RGGRIQEPKD NTDPQHLKTW IRRKYVDRAW YRPSSSTAAA PHPTQQPPHA TIVAIPPTAP
GTAGSTDFWS NNNNHASPAP AWDAFGSNNT TATTNAGSSQ SGFGPAQSPP PQAPNVVTNF
ANFNTAPPPT QHGPPNPGQQ LQQQQQAVNS FANFEASGSN GNGIQQQPNS GFANFNSPAA
TIVGQQQFST NAGAQQQPPL NTFANLNAST STVAASGSQQ PQFRSNTGAP QQQQPVPQIS
AFGATSATTA PQPQPGFANF NSGAYADPQQ YFSTNSGSQQ PQSSPQPSGI ANLNTGQAPA
KMNNGTQLPQ PGFANFNATS AGVAASQQYF PQNNNNSSHQ AKLPSAQTSG FATIHSGGVH
GTNNAPNQPQ PAVPNFNPNA AGSVPGGQQQ FSPNGNSHQI QQPSPHSSGF ANFNSGGPPA
ATDNAAQQPH SGFPNVDGTS VGKVHGGQQQ ISSNSNSHRI QQPSGVATFN LGSAPASPKN
TRQQPGFNHF NTASAAPGGQ QQFSNGSSPR MQMPSPQPNG FANSSTGGEV NKAMQRAQFA
SPGSQSVQDS SIPGNDGLHH RGVNSSITQG GIPFVQGRNT EQPISSVQQP MHFQGGSISL
SDGLRESHIS SITRGMGNLG NIASQAGSVP HAMNQTSIHQ QPISELSEQQ KSQHNIHHIH
SSPGPDRQTP TEAAKNTSTD GDSAPNDGKT ASVYMENHPS KFTAGQTVYY KSSTYVGKAK
IMKVHLDDDL EPFYTILVDG KEKQTDNGHL SERSPLEEKV QELIGSLTED QLLQVHQFII
RFPLTVTSSE TDSVVPPAAS ATIITGSRQP PVLASTSSMY PPASLTANVP IPGSGQQQAA
TGDAPMSPKG NPFDLY