Gene PHATRDRAFT_43359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43359 
Symbol 
ID7197108 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp238185 
End bp240358 
Gene Length2174 bp 
Protein Length689 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177891 
Protein GI219112279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCAGG GATTGCCACG TACCGGCATG AACACCGGAG GGAACGTCCA AAGTGCACCG 
TCCAAGCGTT GGGTTGATTG GGGGACACCA GCAACAACAA TGGTGGTTAC GTTGCCGTTT
CGCTTTGCTT CCAATCGACT GTTCTTTCCT GTTGTGACCA TGTTCGACCG ATCCAGTCGT
TGCAACTATT GGTTCACGAA TTTTTGGTCG TCATTGGCTT TTCGGTCGAG TGAGTCGAAT
CACGGGACAC CCAGGACAAC CCGGGACCGT TGCCCCGTTG GGGTGCACGA CAGACTGTGC
CAAAGCAATC GGGACGCTTG ACGGTATCGA CCGAGTTCGT CCCGATGTTT CGGACACCAT
GGGGAATCAC AGATCCTGAG ATCGTTTGGC GGTCCCCCAA TTACGGATTA CATTCGTATT
CTAGGATCGT CATTGACCAT GACTGACGGG TCCAGGGAAA TAGCGCCGTC TAGTGCAGAG
GCAATGTTGG ACACACTATA TCTATATCAA AACCCGACGA CTGGGGAAGT CTCAACGACG
GCCCGTTGGA CGTCGCGACA ACTCTGTCGA CTACTGTGCC CCTCTACGAG TACCGCTATT
CTACCGCAAC ATTTGACCCT CGATACACAA ATCTTGCGTT TGAATACGGA TGGCTCGTAC
GCCAATACAG GATGGCAGGC TGCCAAAACG GCACCGATCG TACGGCAAGC CGTCGAGATT
TGGTACTACG AACAGGACGG CGCCGTACAA GGTCCGGTTT CGAGTCGACA GCTGGCTACC
CTCTACTACG ATTGCCCCGT AGTCTTGTAC CCCACCTCAC GTGTCTACTC GGAAAGTACA
CCTTCCTGGA CGCCGATTCA GTCTCTTCCG TTACTGCAGC TTGCTCTGGA AGCTCTCCGG
CCGAACGGGG TGAACTCGTT GGGGACAACC CAGGATACGC CTACCTACGA TCCTGGCTTT
TTGGCATTTC CGTCAAACAC AAAAGTTTCT GAAAAGGAAT ACGATGAAAT TCCCAAAGAA
GCCAAGGACG AGCTTGAAGT TTTCTTGCAA TCCACCGCCA TCATTGGGGG ATCCCGGATT
ACCGAAGATG AAGAAGATGA AACTTACGAA AGTGACAATG GTACACGATA TGTGAAAGAT
CCGCGCACAG GAAACTGGAT TCACGAAGCA CTCGCCCCGA AGCAGCCGCA CAAGAAAGAG
AGCAACGAAG CAAAATCCTC TTCCCATCTC CAAACGGCAT CCGCACATCC ACCCAAAAAA
CGCAAAAAGG CAAAATTTGC GGCCAAAAAT TCCAAGTGCT GGATTTACGT CACGGGTCTG
CCACCCGACT GTACCGAAGA AGAGATAGCT TCGATCTTTT GCAAAGCTGG AATCATTGAC
TTGGATCCGG AAACACAGCA ACCAAAAATA AAAATATACC TCGACCAAGC ATCAGGCTTA
CCAAAGGGTG ATGCTTCCAT ATGTTACGCT CGCGCAGAGT CGGTAGACCT CGCTGTCACG
CTGTTGGACG AAGCGCCCTT TCGTCCGTCG GTTCGGTCGG ATGCCTGCGT ACAATATGTC
CTGCACGTTG AACGAGCTAA ATTTGAACAG CGTGGTCGGG TGTTTGACGA CGGTCGGCAG
CGTGTTTCAC TCGCCAAACG CAAGGTCGCC AAACTAGCGG CGGTGCAGGC CACGGACTGG
GACGAAGGGG AATTTAACGG CCGTCTGACG GGTGGGCGGA AGGGCTTGCG CATCGTTGTT
CTTAAGCATT TGTTCGATCC TTCTGTACTA TCCGCAAACG AGGAAGATGG TATGCTAGCC
GTATTGGAGC GTGATTTACG AAAGGAATGC GAGCAATGGG GTGTAGTGGA AAAGATCACC
ATATTTTCGA AAAATTTGCA GGGCGTCGTG GTGGTCAAGT TTGCTCAGCC GGGGTCTGCT
AGCGACGCAA TTAAGCACTT GGACGGGCTA GAATGGCCTA CTGGCTCGTC CAAGCGTCGT
GTACATGCCA CTTTTTGGGA CGGCGTCACC GACTTTACTG TACGAAATGA AATTAAGGAG
CAAGAAGAAG CCGAAAAACG TCAAAAAGAG TTTGGCAACT GGCTAGAAAA GCAGGAGCTA
CCCGAAGAGC TGCGTCTAAG GATAACTGAT TAAACGAAGT ATCAGTTTTC ATTGAAATGA
TATTGCTTTT GTCA
 
Protein sequence
MGQGLPRTGM NTGGNVQSAP SKRWVDWGTP ATTMVVTLPF RFASNRLFFP VVTMFDRSSR 
CNYWFTNFWS SLAFRSRQPG TVAPLGCTTD CAKAIGTLDG IDRILRSFGG PPITDYIRIL
GSSLTMTDGS REIAPSSAEA MLDTLYLYQN PTTGEVSTTA RWTSRQLCRL LCPSTSTAIL
PQHLTLDTQI LRLNTDGSYA NTGWQAAKTA PIVRQAVEIW YYEQDGAVQG PVSSRQLATL
YYDCPVVLYP TSRVYSESTP SWTPIQSLPL LQLALEALRP NGVNSLGTTQ DTPTYDPGFL
AFPSNTKVSE KEYDEIPKEA KDELEVFLQS TAIIGGSRIT EDEEDETYES DNGTRYVKDP
RTGNWIHEAL APKQPHKKES NEAKSSSHLQ TASAHPPKKR KKAKFAAKNS KCWIYVTGLP
PDCTEEEIAS IFCKAGIIDL DPETQQPKIK IYLDQASGLP KGDASICYAR AESVDLAVTL
LDEAPFRPSV RSDACVQYVL HVERAKFEQR GRVFDDGRQR VSLAKRKVAK LAAVQATDWD
EGEFNGRLTG GRKGLRIVVL KHLFDPSVLS ANEEDGMLAV LERDLRKECE QWGVVEKITI
FSKNLQGVVV VKFAQPGSAS DAIKHLDGLE WPTGSSKRRV HATFWDGVTD FTVRNEIKEQ
EEAEKRQKEF GNWLEKQELP EELRLRITD