Gene PHATRDRAFT_47188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47188 
Symbol 
ID7201965 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp752093 
End bp755112 
Gene Length3020 bp 
Protein Length916 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181438 
Protein GI219122198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCAGGTATC CAGCGGGATG AACAGAAGTA CGAGCGACTG CGACCGTACC TCGACCGAGC 
CAAGACGGCA AACCTTCACT GAGAATGATT GTGGCCAACA ACAACAGCGG CAGCAGCAGC
AAGGGCAAAA ACTCCCAAAG AACAAATCCA AAGAGGCCGA ACCGAATGTA GATGCCCCCC
TACGAATCAC GGATGGCACC GACAGTAGCT GCGCCCATTC TTCAATGTCC TCTTCGGAGT
GTAGCTCCAA TGTAGGTCTC ACAATCAGCG CGATTGAGAA AACGACTCGG CGAATACGCT
CATCACAACA TTTTCCGCTG GCGGAAGTTT CGGTACCGAG CTCGATAGCT TCGCCGCCGT
CGGATGCTTC TGTAGCGAAC ACTTGTACTG TGACGGAGAT TTCTGAACCG GGAACTGCAT
TGAAGCGAGC AAGCGTTTTT TCGGAGGAAA GCTACCTTGC GCAATCTCTT CCGATGGCAG
ACAGTACTCC CAGCAAGCCG CAACACGAGA AAAATGCTGA GGCAGAAATC GACGATGAAT
TTGAATCAGC TCTTTTATCA CAGCGGAAGA GCAGGGTCTT GTTCTCTCGT TTGCGAGAAG
CCATACCCCA AGGTCGTCTA TCGATAGTAC GTGCTTTTCA GTAATGTCTG TGTATTTCAA
AAACTGTTTT TACTCACTAG CTCTCTTGTC GTACAGATCG ACCTGTCTCG AAGGGGTCTC
GATGTCTCTC ACGCGTTCCT CTTAAAGGAG GCAATCACTC ACAGTCCACA GCTATCTGTT
TTGAAGCTTG CTTACAATGA ATTTTGCGAC GAAGGGACTA CTATTCTCGC AATGGCATTC
TGTCAAAACG GGGTACATCA CAAGCATTTG TCGGTAGTAG ATCTGGCTTT CAACGAGATA
GGCGATGTGG GGTGCGAGGC GCTGGCAGTA CACGCTATGG CCGGGAACTA CGTATTGCGT
GCAATAGACC TTAGTGGAAA TCAGATTGGA GAGCGGGGGG CGCTTTCTAT TGCCGGTGCA
ATTTTACATG GCACTGGCTT GTCGCGATTG CACATGTCGG CGAACCGAAT TGGATCTATG
GGTGTGCAGG CCGTTGCTGG TGCAATCGCC AATCGAGATT CACGAATAGC TGAGACGGAG
GCTGCGTTGA CGGGGTCAAC TGAAATTCAC AGCATTGTTG ATTTGCAGTT AGGAACAGTT
CTGATAGCAT CTGGAGGATT CGCTGCGATA CCGGGAATGC TTGTGACAAA TACTGCCCTG
CGCTCGCTTT GCGTATCAAA TAACAATCTT GACGATCAAG ATATTCTATT GATGTCGCAA
GCTCTGACAC AAAACAAAAG GCTACCCTTG GAAGAGCTAG TACTTTCTTT CAATCAAATC
ACGTGTCAAG GTGTTGAGAA TTTAATGAAC TCTATATGGG GATCGACTAC GTTGAAGAAA
ATAAAGCTGG ACAATAACCG ACTACGAGAT CGGGGCGCAC AGCTTTGTGC GGTTGTTCTG
ACATCAATTC CACTGCAATC ACTTGATATT GGGTGTAACG CCGTGACGAG CGCCGGGATC
AAAGCTTTGA TGAAGAATGT ATCCGAGAAC AGTTCGCTAA TTTCACTCGG GCTTGCTGGA
ATACACATCG ATCAAAATTC TTCCAAGGCT GTATCGTATG CGTTGGCGTA CAACACTTCG
TTGCAAGCGG TTTACTTCGA CAATTCTCAC GCGGGGTATT CTGCTCAGCG ACACATCGTT
GCCGGAATTG TTTCCAACCA AAGTGCACCT TTACGATTGC TGACAGGCTT CCCTCTTGCG
CGTACGTTTT TGAAATCTTT TTTACTCACG AGCCGTTTTG AGTTCATCTT ACTTATGTTT
TGCTTCTGCT ACAGCCATTG CCGTCACCCT GGGAGTGCCA CGGTTGCCGG AGGTTTGGTC
GAACGACCAG GTGTTGGGTT TTTTCCGGCT TATGTGGCGC CAGTGGGCAA TTAAAGCCGG
ACGCGGAAAC GTTGGAAAAG GCGATATTCC CCGTGGACCA GCGCCTCCAG CGGCGGTTGC
GGCAGCTGCC AAAGTCGCTT TTGCTTCGCT GGGAACTGCG CTTCAAACTC TATTTCAGAC
GGAAATGTAC GAGAAACCAA TTTCGGAACG CCCCTCGGTC GATCCTTCGG ATACTGCTTT
GTTGGAACGA AGTCTATCAG GAACCCTTGA GATCCCAAAA TGTTCCTTCG TGAACGAGGA
CGAATTGAGC GAATGGGAAG GAGGAAAGTC GAAGATGGAC GGCACCGATA CACTGCCTTC
TTCGGCGCAT ACACTATCCG TTCAGGAAAC TTATGAGAAT TCCGAGCGAC GTAGTCGCAA
TTTACGCTGG CTAAGGTTGC ACTTTCGTTC ACTATCAGAG GTTGGGCGAA TTCCTTTCAA
CAACGCCGAA CTTTGGCATC TACATCAGTA CTATTTCTCG CCACCGAATG TCGTGCTTCA
TGACTGCGAT GGTCTGCATC ACGAGGTAAC GTCGACGCCT GCCCGCGGTA TGGCAGCACC
TTCAACACCA AATCAGCAGC CCAGCGCACC TGGAATGGGT CGCGCGATTT CCTTTCAATC
GTTGGGAAAT GCATTCTCTG TCTCCCGCTC CCTGTCACAC GCTGGAGGGC ACAAACGGCG
ATCTGAAAAG CAATGCCAAT CAGAGGAACA ACCCGCACTG AAACGCCCGA AAAATTCAAA
GCCTAGGATC GCCTACTATC CAAGAATCAT GGTGAGTTGC AATATTCGAG TAACTGGGTC
TCTTGTGAAA GTTTTTTTGC TCACAAGCCG TTTCAGACCA AGCTACAGGC TCTTGGAAGT
AGTCAAGCAG ATCAAATACT AGCGTTGCTT CGACAGTTAA AATTCGCAGA AAGCTTGTTG
TTTGCTGGAA AGAGTCTCTA CTGTGATGAA GCTTCGATTG CTGATAACGA AGCTCATTAC
AGTGACGTCG AAATGATCCT GTTAGATCTT CTGTAGCGAG TTACTGCGAT CTGTCAATAA
TTCTATAACT TCACATGATA
 
Protein sequence
MNRSTSDCDR TSTEPRRQTF TENDCGQQQQ RQQQQGQKLP KNKSKEAEPN VDAPLRITDG 
TDSSCAHSSM SSSECSSNVG LTISAIEKTT RRIRSSQHFP LAEVSVPSSI ASPPSDASVA
NTCTVTEISE PGTALKRASV FSEESYLAQS LPMADSTPSK PQHEKNAEAE IDDEFESALL
SQRKSRVLFS RLREAIPQGR LSIIDLSRRG LDVSHAFLLK EAITHSPQLS VLKLAYNEFC
DEGTTILAMA FCQNGVHHKH LSVVDLAFNE IGDVGCEALA VHAMAGNYVL RAIDLSGNQI
GERGALSIAG AILHGTGLSR LHMSANRIGS MGVQAVAGAI ANRDSRIAET EAALTGSTEI
HSIVDLQLGT VLIASGGFAA IPGMLVTNTA LRSLCVSNNN LDDQDILLMS QALTQNKRLP
LEELVLSFNQ ITCQGVENLM NSIWGSTTLK KIKLDNNRLR DRGAQLCAVV LTSIPLQSLD
IGCNAVTSAG IKALMKNVSE NSSLISLGLA GIHIDQNSSK AVSYALAYNT SLQAVYFDNS
HAGYSAQRHI VAGIVSNQSA PLRLLTGFPL APIAVTLGVP RLPEVWSNDQ VLGFFRLMWR
QWAIKAGRGN VGKGDIPRGP APPAAVAAAA KVAFASLGTA LQTLFQTEMY EKPISERPSV
DPSDTALLER SLSGTLEIPK CSFVNEDELS EWEGGKSKMD GTDTLPSSAH TLSVQETYEN
SERRSRNLRW LRLHFRSLSE VGRIPFNNAE LWHLHQYYFS PPNVVLHDCD GLHHEVTSTP
ARGMAAPSTP NQQPSAPGMG RAISFQSLGN AFSVSRSLSH AGGHKRRSEK QCQSEEQPAL
KRPKNSKPRI AYYPRIMTKL QALGSSQADQ ILALLRQLKF AESLLFAGKS LYCDEASIAD
NEAHYSDVEM ILLDLL