Gene PHATRDRAFT_47943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47943 
Symbol 
ID7203132 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp512733 
End bp516915 
Gene Length4183 bp 
Protein Length1167 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182242 
Protein GI219123876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGTGT GTGTGTCAGC TTTCTGGTCC ATTCACTACT CTCGAAAGGC TAGAATTGAT 
TGTATTGTGT ACAGCGAGTA TCTCCTTTCC TTCTCTATTT GCCTCTGGTC TTCGTTCGGA
AGGAAGAGGA ATTGAAAAAA AGGAATCGTT GTCGTTTCCT TCGTCGAATC GCATCCCTCC
ATTCGATTGG TGGTTTCGGT AGCGTATCCC CCGTTTCCGG TTCTCGACAC ACAGCTACAT
CCACGTACCT ACAGATACCG AGTATTGACA CGAATACTAT TCTCATGGCG TCTCCAAATC
CTGCGTATGC GGGACGTGGA GGTCGTGGCG GACGGGGTGG ACCTCCGCCC TATTACAACA
ACAACAATAG TGGTGTACCG GGTCCCCGTC CTCCCACCGG AGCCTGGCAA CCCGCACCCA
CTCATACGAT GCCGACACAG TCTACCGCCG TGCCGAATCC GTCATCGCCC AATCCTCACC
ATCTCACACC CCAACAACAA CAGCAACAAC AACCACCTCT CCTGAACAAT CCCTACGCGC
CAAACCAAGC AGGACCCCGT GGAAACCAGC CTCCTCCATA CCCACCCCCG GCCTACTACA
ACCAGCACTA CGCAGCTGCC GCACAACAGC AGCAGCAGCA GCCGAATCCT TACGGCGCCA
CGGGTCGGGT TCCGTGGCAG CCCGTACCGA ATCAAGGCTA TCCCTCCACC ATGAGGGGTG
GCGTCTTTTA TCCACCTCCC GCCGCTGCCG CTGGACCCGG TCGTCCTTCC GCTACCAACG
CGGGGACCAT CCCAGCCTCC GTCGTGGTGC CCCGCGAACG CAAAGCTCTC GTCATTATGG
TACGTCCCGA GTGCCATGGA CCAGCGTCCC CCGCTTTGCG CTCGTCTCAC ACCGCTCTAC
CCGATTACTC GTGTTTGTTT CTGCCGTTGG CGACTTTTTT ACACCAAAAC AGGATAAAGA
CGGCAACGTG ATGGATTTCA CCAAAACGGC TCCCAAAAAG ACTACGGCCA CGACTTCGGC
GACTCTTGCC CCGTCGGCGG TCCCGGCCAA AACGGAATCG AGCAGCCAAG ACGCTGGCGC
CAAGTTGCGA CAGGCCGCAC TCGAACGCAT CCAAGCCCAA GACAAGGCCA AAAAGGAAGC
CGAAGAAAAA GCAAAGGAAC AATTGTTACA GGAAGAACAA GCCAAGAAGA AAGCAAGTGA
CGAGACCGAA GTACAAGGTA AAAACACGTC CGAGGCCCAA AGCAAGGTCG CCGAAGAGCC
AACCAAGTCT CTGGCGGATC GTTTGAAGCA AGCACCCAAA CCCGCCACTG TCAAAACGAC
GACTTCTTCT TCCGGTCGTA TCGTCTTTTC CAAGTCGGCG TTGCTCAAGT ACGTACCGTC
CTTCGTGGCC TGGCGGAACA CACTCAGCCT TTCGCCTAGT TTTTTACGTT TTGGATCTTT
CCCTTTTCTC ACCCGCTCGT ATTTCATTTT TTAAATCTCT CTTGCAGATT TAAGAGCACG
GAAAGATGTA TGCAATGTCC CGAATCGCTC CCTGACATGA CGATCGTCAA GGGACCCGCC
CGTGGCAAGA GCGGGGGCAA CCGCGGTGGT GGTGGTGGAG GAGGCGATCG ACGCAACGAC
CACGACAGCG GTGGCAGTAG TTGGAAACGA GGCAACGCAC CCCCACGTCG TCAAAGCGCC
ACCAATAACG ATGGCGGTGG TGGTGGCGGC GGCGGCGGCA GCTACTGGAG CCGCGGACAA
GCTCCTCCGC CACAGCAACA GCAACAAAAC AACGACAAGC ACCGGAACAA TCGGGGTGGA
CGCGGTCCTC CGCCACCTCT GTACGACGGC CCCGTCGCGC CTCTAGTGAA GTCGGAGAAT
CACTGGCGTC CACAAAAGAA CGCGTCAGCT TTTATTATTG CCGAGAAACA GGTCAAGGCT
ATTTTGAACA AGATGACCAA AGAAAAATTC GATAAACTCG CGACACAGAT GCTGGAGATT
CCTTTGACCT CTTCTGATAT GTTGAAAATG ATGATTAACA ACGTGTACGA TAAGGCCATT
GACGAACCTA CTTTCGGAGA TATGTACGCT GATTTGTGTA AGAGACTTTC CAAGATTGCT
ATCGACTTTA TTAAAATTAT CGAGTCGGAC GAAGAGCCCC CGACGGACGA CGATACGACC
ACGGAACCGG CATCTCCAGC TGACGATAAA AGTAGCCACC ATACTGTTTA TCGCTGGTCG
AACGACGTGA GTACTACCGA TAGTGAGATT GTTGGACCGT TTGAATCCGA GGAAGAATGC
ATCGAGGTTG CTCTTGGTCA AGCAAATGAG CCTACTCCGG TTGAGCGAGG CGAAATGTCT
CTGGAACTGG TCAGCGCAAC CATTAGGAAA GGCGTGTTTA TCAAAATAAT GAAGCAAAAA
AAGGACAAAG ATGGCGAGGA ACCAAAGCTC TACACCGTAT ATTTCCCAGT AAAGGAAGCA
AAAGAGTGTG GTCAGCAGCT GTCAAACATT TTCCTGAGCA AAATGGAATG TGTTTCAGAT
GCGACCAAAC TTAATAGCTT CAAGCGCTCG CTACTCAACA AATGTGAAGA AGAGTTCGAC
AAACAGGATA TTTATGTAGA CTGGAAGAAA GAAAAGAATG ATTATGAGGC AAAGAAGGCT
ACACTTACTG CTCAAGAACG GGCTGAAACA GAAGCTGAGC TGGACTTTCG TCGCATTCGA
ATTAAGAAGC AGATGCTCGG AAACGTTAAG TTCATTGGGC AGCTGTACAA GAAGGGTTTA
TTGAAGGAAA AAATTATGCG GTACTGCATT GCCAGTCTGC TAAAGCTGGA AGCAAACGAC
GCCAAAGCCA AGAATCCGTT GTACCGAGAC ACCGGTGATT TTGATATCGA CGAGGAAGAC
CACGAAGCGA TTTGCAGCAT GTTCACGACT ATTGGTCTGA CTATTGACAC TCTTTCGACC
TCTGATTTCA TGAGCGTTTG TTTTGAGAAG ATTTCAAAAC TGAGCAACGA ATCAAGCCTC
CCCGCCCGAT CTCGTTTCAT GTACAAAGAT TTGCTTGAGC TACGTGACAA TAGATGGGTT
CCGCGGCGCA AGGAAGAAAA GGCGAAAACG CTCGAAGAGA TTCGAAAAGA TGTGGAAGAG
GAAGAGCGAC GACAGGCTCA GCAGTCGATG CGCAATAATA GCAACCGTGG AGGTGGCGGT
GGCAGCCGAG ACTTTCGTGG TAGTGGCATC GGCAATGCAT TCGGAGGAAG CAACCGATCG
CGTCCACAGA AGCTATCTAG CGAAACGGAC GCTGACGGGT TCGTCGCTGT CCCCACCAAA
ACTGGCTTTG CGTCCCTGAG AGGGCCAGCA GGGAAACCCA GGCACCAGCC TAGTGAAAGT
ACCAGCACGG TCAGTACGAA ATCGTCAGCT TTTTCTGCGC TTGCTAAGGA TCGAGATCCG
CCTTCATCGA AGAAGTCTCC CGCCCCACTC GATAGCGATA CCTTGGAACG CCGGATTAAG
CGTATACGAA CGGATTTTAT GGGTGATGGA GGGAACGTTG AGGAGCTTCT ACTAAGCTGG
GACGAGATAT GTGGCACACC AAAGGCTGGC GTCACCCTTG TTCAAAAGAA TTCCGATAGA
ATGATGGACT GCAAAGAGGA TGAAAGGCAA GCCATCGTGA GAATTGTAGC GATACTAGCC
GAGAAAGGCA AGCTCACAAA AGAGGACGTT CGCACTGGTC TGCAAGACGC GATTGAATTT
ATCGACAGTT TCATACTTGA TTCACCACGG GCTTACGAGT ACTTGGGTGA CATGCTTGGG
GAAATGCTGA AGTTGAAAGC GATTGATGTC GCATGGCTTT GCAAAGAGAC CGAGAAGACG
AAAGTGGATC AAAATAGCGA GGCACCGATC CGACTAACAC GCGCAACCAT TTCAGCCTTA
AAGGCAGCCG CGGGAGTAGA CTTTGCAAAG AAATGTTTTT CTGGATCCAG CGAGAAGGAT
TTGATTGGTT TGATTGGTTC CGATTCCTGG AAATCTATGG CGGCTGATCA ATTTTCCTAA
CATTCAAGAG TATAAATCTG AAGTTGAGAA TTGCAGTAAC CTGTATATGG CTGTATGATC
TGAACCGTCT TCCGGGCAAG CGAAGAGCTT AACCTTTCTT GTCACAATGT GTGATCAATA
ATACAGTTAG CAATAGTCTA CTTTGAAATT AATAGTGTTA TCG
 
Protein sequence
MASPNPAYAG RGGRGGRGGP PPYYNNNNSG VPGPRPPTGA WQPAPTHTMP TQSTAVPNPS 
SPNPHHLTPQ QQQQQQPPLL NNPYAPNQAG PRGNQPPPYP PPAYYNQHYA AAAQQQQQQP
NPYGATGRVP WQPVPNQGYP STMRGGVFYP PPAAAAGPGR PSATNAGTIP ASVVVPRERK
ALVIMDKDGN VMDFTKTAPK KTTATTSATL APSAVPAKTE SSSQDAGAKL RQAALERIQA
QDKAKKEAEE KAKEQLLQEE QAKKKASDET EVQGKNTSEA QSKVAEEPTK SLADRLKQAP
KPATVKTTTS SSGRIVFSKS ALLKFKSTER CMQCPESLPD MTIVKGPARG KSGGNRGGGG
GGGDRRNDHD SGGSSWKRGN APPRRQSATN NDGGGGGGGG GSYWSRGQAP PPQQQQQNND
KHRNNRGGRG PPPPLYDGPV APLVKSENHW RPQKNASAFI IAEKQVKAIL NKMTKEKFDK
LATQMLEIPL TSSDMLKMMI NNVYDKAIDE PTFGDMYADL CKRLSKIAID FIKIIESDEE
PPTDDDTTTE PASPADDKSS HHTVYRWSND VSTTDSEIVG PFESEEECIE VALGQANEPT
PVERGEMSLE LVSATIRKGV FIKIMKQKKD KDGEEPKLYT VYFPVKEAKE CGQQLSNIFL
SKMECVSDAT KLNSFKRSLL NKCEEEFDKQ DIYVDWKKEK NDYEAKKATL TAQERAETEA
ELDFRRIRIK KQMLGNVKFI GQLYKKGLLK EKIMRYCIAS LLKLEANDAK AKNPLYRDTG
DFDIDEEDHE AICSMFTTIG LTIDTLSTSD FMSVCFEKIS KLSNESSLPA RSRFMYKDLL
ELRDNRWVPR RKEEKAKTLE EIRKDVEEEE RRQAQQSMRN NSNRGGGGGS RDFRGSGIGN
AFGGSNRSRP QKLSSETDAD GFVAVPTKTG FASLRGPAGK PRHQPSESTS TVSTKSSAFS
ALAKDRDPPS SKKSPAPLDS DTLERRIKRI RTDFMGDGGN VEELLLSWDE ICGTPKAGVT
LVQKNSDRMM DCKEDERQAI VRIVAILAEK GKLTKEDVRT GLQDAIEFID SFILDSPRAY
EYLGDMLGEM LKLKAIDVAW LCKETEKTKV DQNSEAPIRL TRATISALKA AAGVDFAKKC
FSGSSEKDLI GLIGSDSWKS MAADQFS