Gene PHATRDRAFT_47058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47058 
Symbol 
ID7202132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp299208 
End bp301144 
Gene Length1937 bp 
Protein Length606 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181347 
Protein GI219122008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTAAAATA CGAAATCTGC ATTGATAGGG CCCGGCCTTT ATTTAGGAAT TGAGTTCCTA 
TCTTTACGAT CCGTCATGGC GATCAAGAGC TCTTCTAGTA ACGTGACAAG CAAGCGCAAG
GATGGGGACA GCAAGTTTTC CTCGCAACCC TCGAAGAAAA GTAAAGGAAG TGGTGCCAAA
CACTCTACAG TGCCGCCTTC TACAGTGGGT TCGAAACGTG CTGTTAAGCA GGAGCGACAG
TCCCAGCGCA AGCACGCCGA CGTTGTGAAC GACGCCAAGC GGATTTGGAA TCAACTGCGT
CTCAAAACAA ACACGTCCGG ACAGAACCGT CAATATATGG ACACGCTCAT GCCCTTGATT
ACGGGCAAGG CCAATGAAAT CGCGTTACAG CACGACGCGG CGCGTGTCGT TCAAGCCGCG
ATTCAGTTCG GAACGGTTGA AGAACGGCGT CTAATATTAC AAGAACTGTG CGCGAAGCAA
AACAACTTTG CGGAACTCTG TAAATCTCAA TACGCACACT TTTGCGCTTT GAAAGCGATC
AAGTATTGTC ACAGCGACCC AGCCTCCGTG AAATTGATCA ACAAAGCACT CAAAGGACAT
ATGCCTAGAT TGGCTGTGCA CGCCGTCGGA TCTCGGGTTG TACAGTCAAT TTTCAGCACA
ATGACTCCAA AACAAAGTGC GGTGCTCAAG CAAGAGTTTT ATGGGCCACA TTTTGCTTTG
TTTGCGCTGG ACCTGCCACG AAATGATGCT GTGCCGACGT TGGCGACGAA CATAGCTGAG
GCGCCGGAAA AGAAAGAAGC AACACTCATA TTTGTACGCA ACTTGATCAA TAAAGGCATG
GAAAAGACTC TCTACGGTTT CACTTACTTT CAGGATCTGT TTGCCGAGTA TTGCGAGGTG
GCCGACCCGC GAGAGATTCG CATTTTGGCG GGCACGGCGG CCGACAACTC AATTCATCTC
TTATCTGGTC GCGCTGGGAC TCGCGTGGTT GCTTCCTTGA TTTCCTATGG CACTGCTAAA
GATCGGAAGC GCATCATGAA AAGCTTGAAA GGCTACACCA AGTCAGGACT ACTGCATCAC
GATGCATACC TAGCAATCAT CCGTTTGGTT CAGTTGACGG ATGATACAGT GTCTATTCAC
AAAAACATTT TCAACGAACT GCTCTTACCG AGCGATAAAT CCGACGAAGA ACTATCTTGT
CCACTTTTGG AGCTGGCGCT TTCCGATACC GGCTCCAAAC TTCTTTTAAT GCTTCTTGTT
GCGGATCCGG AAACATTGAA GAAGTTTTTT GATCCCTACG AGCTTTCGGT ACTTTTCGAA
AATCCGACTG TGATAGACGA TGGTCAAGAA GTTCTGACAA GCAAGAAGGA GCCAGAAATA
CGACGAAAAG AACTCATCAA GTATCTTCGA GAGCCGTTAA TTGAGATGTG CGCCAAAAGT
GCGAATGAGT TGATCAGGTC TCGGCCGGGA GCTCTTGTAC TTCGAGAAGT GTACCACTCG
TATCGACCGA TCTCGGTAGT TGAAGCGATT GTTGGAACGT GCCAAGCAGC ATTGAATCAG
GATTCCTCGA AAGGCGACGA AAAGGATGTC CAGAATTATC GCCTCTTTGA AGATAGAGAT
GGACATTTGG CAGTGAAGAA TCTTTTATTG GCTGATTCCG CGAAAGAATC AGAGGCCAAG
CTGGCGTCAG CTTTTTTCGA AACCTTCCAA GATCGACTGA TGGAGATTGC CCAATCCAAT
CGCGGGGCTT TCGTTATTAC AGCTCTTTGT AAAGTTTTGG CTGTAAGAAA AGGCGCTATT
TCCAAGTTGA ATCAAGCACA ACTGAAAAAA TTGGCCGACG GCAAAGGTGC TACTGCGGGG
TTTAAAGCAC TAGTGAATGA GATGGATGGC AAATAGCTTT TCTGCGTAAT TTCTGCATCG
TAACGATTAC ATATCAT
 
Protein sequence
MAIKSSSSNV TSKRKDGDSK FSSQPSKKSK GSGAKHSTVP PSTVGSKRAV KQERQSQRKH 
ADVVNDAKRI WNQLRLKTNT SGQNRQYMDT LMPLITGKAN EIALQHDAAR VVQAAIQFGT
VEERRLILQE LCAKQNNFAE LCKSQYAHFC ALKAIKYCHS DPASVKLINK ALKGHMPRLA
VHAVGSRVVQ SIFSTMTPKQ SAVLKQEFYG PHFALFALDL PRNDAVPTLA TNIAEAPEKK
EATLIFVRNL INKGMEKTLY GFTYFQDLFA EYCEVADPRE IRILAGTAAD NSIHLLSGRA
GTRVVASLIS YGTAKDRKRI MKSLKGYTKS GLLHHDAYLA IIRLVQLTDD TVSIHKNIFN
ELLLPSDKSD EELSCPLLEL ALSDTGSKLL LMLLVADPET LKKFFDPYEL SVLFENPTVI
DDGQEVLTSK KEPEIRRKEL IKYLREPLIE MCAKSANELI RSRPGALVLR EVYHSYRPIS
VVEAIVGTCQ AALNQDSSKG DEKDVQNYRL FEDRDGHLAV KNLLLADSAK ESEAKLASAF
FETFQDRLME IAQSNRGAFV ITALCKVLAV RKGAISKLNQ AQLKKLADGK GATAGFKALV
NEMDGK