Gene PHATRDRAFT_47845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47845 
Symbol 
ID7202979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp221358 
End bp223475 
Gene Length2118 bp 
Protein Length620 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182181 
Protein GI219123749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0593792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAATCGCGTC CACGCTCGTA GCATGGAGTC ACCGTAAAGA AGTAAGGTTT ATGATACCAT 
CTTTCAATTG TCTTCCGTGC GTGTTAGAGC TCAGCTTAAT TTCCTTGGTA CATAATAGTC
CTCTCGCGTG CGCTGCATTG GCAGGCAGAA ACTTCGTTGA CGATAGAGTC TGCCTGCCCT
GGATCTAAAC CACAGCCAAA AATATGACAT TGACGTCAAG TATTTCAACC GATTCAACAA
ACGGTAAAGT TGGACAACTC GAAACGCCAA ACCAGCGTAG TATGGCAACA GAAAGAAGTT
TTCTTTTCGG TTCTCGTCGG GTGTTTACGT TCGCCATGCT TGCTGGAGTT TTAATTTTTC
TGATGGGGAG TCTCTCCGTG TCGATGCAAT CGCAGCAATA CATCGAAGCG CTCTATTCGG
GGTTGTCCGC GGTCGCTAGT CTGAATAACA GCACCACCCT AAAAAGCGTT GAGGAATTGA
CGTATGTTGC ACAGCAACAT TCTCCTGGAC TTACGGGACG AACATTACCG GAGCATCCCC
ACGAAACACG AATACGACAA GCGATACATT CCCACTATCC TCATTTTGAA CATCATTCGG
TTCGAGGTAA AGTTGTGGAC GTGACGCAGA ATGTTAACCG GATCGAAGAG CTGTCCTTAA
CGAGGAAGAA AAATGTCACC TACATGGAAG TGAAGGATGA GGACCACGGA CCGTTGAATG
TCGTCCTGTT TTACGCTGAC GACTGGACTT TGAAAGTGCT TGGAGCCTTA AACCCTCACG
TAAAAACACC AAATATTGAT CAAATGGCGA AGAATGGAAT GCTCTTTCCC TATAATTGTG
TCACCACGAG TATTTGCTGG ATTTCACGTG CCACGTTGGT AACGGGTGTT TATGCGGCGG
TCCACCAACA ACTAAAAATT GCGCACAACA GTCTATTTAA TAACCTGACC ATTCCGTGGA
CAGAAACTCT ATTTCCACAG CTGAAGAAGC ACGGCTATTA CACGGGATTA GTTGGAAAAT
GGCACGCCCC GTCGCCAGGG AAAGAGATGA AATTGGCCTT TGATGTGATG AATATCTATT
ACGGGCGGCA TTGGGAACTG CGAAACGGTC AACGTCGTCA CGTGACCGAT CTAAACGGCG
AAGATGCGCT CAATTTTCTG AGAAGTCGCC CCAAAGACCA AAAGTTTGCA TTGAAGGTCT
CCTTCTTCGC CACACACGCT CAGGACTACA CAATTCCAGC CTACTCGCCC ATGAACGAGA
GTATGTCTTT GTACGAAGAC GACGACATTC CTTGGGTGCA AACGAATACA GAGCAGCACT
GGAAAGATCT GCCTTGGTTT TTTGACAACC GCAACGAAGG TCGGCGGCGG TATATTGGTC
GCTTCGATAC TCCCGATAAT TATCAATACA ACATCAAGTG CTTGTACCGT ATGGCGACCG
AAGTTGATTC GGTTGTTGGC GAAGTGATTG ATGAACTCAA AAGGCAAGGT GTTTACGACA
AAACGCTTTT GATCTTTACA ACAGACAACG GAAATTTGCA TGGCGAGCAC GGTCTTGCGG
AAAAGTGGTA TCCTTGGGAG GAATCAATTC GAGTCCCACT GGTCATCCAA GATCCACGCA
TGCCAGCAAC AGAACGTGGC AAAGTCAATG ATGAATTCAC GTTGTCGGTG GACCTTGCAC
CGACGATTTT GTCGGCGGCA AAGATTCCGA TACCATCTCA TATGCAAGGT CGGGATATTG
CCGAACTGTA CTTTGATCCA CACCAGGCAA CGGTATCATG GCGTAAGGAT TTCTTTTACG
AATGGAGTCA AGGCGAGCCG GTAGAAGCCG TAGGCCATAA CGAGTACTAC CATATTCCAG
CGGTCTTTGC GCTGATTCGC AAGGACTGGA AGTATTTTTA CTGGCCGCAG GTCAAAGTTG
AGCAGCTATT CCAGATTGAG AACGATCCGT ACGAGCAGCG TGATGTGCTG AACTCGACGG
CTCAAACAAC ACAAGAAGCA CTGGATTTTA TGAGGGCAAG ATATTTTTTT CTAAAGAACT
ACTCCCAAAT GGGCAACCCA GTCTGATACT TCTCAGAAAA ATTCTTTTTC TTCTAATAGT
ATGAAGCATT TTCTCTTA
 
Protein sequence
MTLTSSISTD STNGKVGQLE TPNQRSMATE RSFLFGSRRV FTFAMLAGVL IFLMGSLSVS 
MQSQQYIEAL YSGLSAVASL NNSTTLKSVE ELTYVAQQHS PGLTGRTLPE HPHETRIRQA
IHSHYPHFEH HSVRGKVVDV TQNVNRIEEL SLTRKKNVTY MEVKDEDHGP LNVVLFYADD
WTLKVLGALN PHVKTPNIDQ MAKNGMLFPY NCVTTSICWI SRATLVTGVY AAVHQQLKIA
HNSLFNNLTI PWTETLFPQL KKHGYYTGLV GKWHAPSPGK EMKLAFDVMN IYYGRHWELR
NGQRRHVTDL NGEDALNFLR SRPKDQKFAL KVSFFATHAQ DYTIPAYSPM NESMSLYEDD
DIPWVQTNTE QHWKDLPWFF DNRNEGRRRY IGRFDTPDNY QYNIKCLYRM ATEVDSVVGE
VIDELKRQGV YDKTLLIFTT DNGNLHGEHG LAEKWYPWEE SIRVPLVIQD PRMPATERGK
VNDEFTLSVD LAPTILSAAK IPIPSHMQGR DIAELYFDPH QATVSWRKDF FYEWSQGEPV
EAVGHNEYYH IPAVFALIRK DWKYFYWPQV KVEQLFQIEN DPYEQRDVLN STAQTTQEAL
DFMRARYFFL KNYSQMGNPV