Gene PHATRDRAFT_43447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43447 
Symbol 
ID7197161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp476673 
End bp478688 
Gene Length2016 bp 
Protein Length671 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177627 
Protein GI219111751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCCG CTGCCAAGAA TACGTCCTCA AGGAGTCACC ATCCCAAAAA GCAACGCGTC 
TCGGCCAAGC ACACGGCACA TGATCAACCC CCACGAATGG CACCCACCGA AAAATTGGAA
CAAGGACGTC ATCAACACCA ACACCACCCT ATCGTCGAAC ACGACGATGA CCTATCCCAC
GCAACGTCTG ATCGAAACAC TGGGACGTCC TTGTCCGAGA CTACCAAAGC GACCCGTACA
GTGTCGGAAA CGTGTGATTA TATTGCCGAG TTGAGTGAAG CCATACTGGA ACAACCGGAC
AAGGCCTTTA TCAGCAGCGA AATTCCCAAT CCGGCCAATC CGCGGTATCC CAAACAGGGG
CCGTCCAAAA TGAAGCAGTT GCTCGTTCTC GCCAACGCAT CCGTAGTGCC TCGGCACAAC
AACCACAACA CCGATAACGA CGATCCGTCG TCCCACAGCG CGTATACCTC ACAGCTAGCG
ACAATGTCGT TGCTCGCCAT TTTTCGAGAC ATTCTGCCCT CCTACCGGAT CAAGCTCCCG
ACCACGCAAC AAGCGGCCGT CAAAGTTTCG AAAGAAACCA AAGTACTTTG GGATTACGAA
CGTGCACTCC TGCAATCCTA CCAGGAATAC CTACAAATCC TCGAACACTG TTGGGATGCC
ACCCGCACTG CTCCGCATCC GTCCCAACTA GGGGTCACGA GTATCCTCAG TCTCTGCGAA
CTCCTCAAAT CGGCGTTTCA TTTCAATTTC CGCTCCAACC TACTCACGGT CGTGAGTCGC
CATACGAATC ATCCCAGTAC CGTGGTCGGC GATGCGTGCT GCGCGGCCAT AGCCTACGTC
TTTGCGCACG ATGCACAGGG CGAAGTTGCG CTCGAAGCTA CCCGGCTGCT GGCCAAGTTC
GTCAAAGATC GGGCCTTTAA AATTCGACCC TCCGTTCTCC GGACCTTTAC CAGTCTACCC
CTCCGCGTGC ACGTGGACGA AGCCCAAGCG GCGAAACTGG CGGCCGCCGC CAACGCCAAG
AAACGCAAAA AAGACAAGGA ACTCGCCGAA ATTGACGCCG AACTCAAAGA AAGTGACGCC
AAGGTGGACA AGATTATACT CGCACGGTGC CAATCGGAAA CGCTTCAACA CGTTACGCTT
ACGTACTTTC GGATTCTGAA GCACGATAAT TTGCAAGCGG CACACGTCGA GACTCTGTTG
CCGGCCGCGC TGGAGGGTTT GGCCAAGTTT GCTCATCTCA TCAATATTGA TACCGTCATG
GATTTACTCG GCGTTTTGAA GGATTTGCTG AAAAAGATGA ACGCACTACC TCTGGAGGCC
GCGCTCAATT GCATTTTGAC AGCGTTTCAA ACCTTGCAGG GGCCGGGGAA GGAAATGAAC
ATTGACGTCA AGGAATACAT TGTTCCGCTC TATACTCAAT TACCGCGTCT GGTGGGGGAC
GTTAATTGTC GTCGGCACTT GCCCACGGTA CTGCTCTGCT TGAATGCCGC CTTTATCAAA
CGCCGTGAAT ACTCAACGAT TCGAGTTTCT GCCTTTTGGA AACAAATCCT GACCGTTTCC
TTGCACGTAC CTCCGCACAC GGCGGTTCCG TTGATAGCCT TTGGACGGCA ACTTCTCCAA
CGATATCCCG TCACACACCA GATGCTGGAA AATGAACAAG ACGTGATTAC GTCGGGAGAG
TATACACCCG ACGTGGAGGA TCCCGAGCAC AGCAATCCTT TGGCCACGTC GGCCTGGGAA
TTAGCCTTGG CCAAATTCCA CGTGCACCTT TCGGTTGTTC AGCAAGCACA GAGTACCGCA
ACGTTAAGGC TACCCAATCT CCCGACCGAG AGTCCCGAAC GCTTGTACCA GGAACTGTTT
CGTGCGGAGG ACGAGCTCTT TTTCTCCTTC CAGCGTGTGC GTAAAAAGCA TCCGTTGACA
CCGCCGAAGC AGGATGGTAG CAAGAAACGG AAGCAGTACC GTTTCCTCAC GCCGCGGGCG
ACGGAATCAT TCTTGTTGAA AGCGAACGCA TTGTAG
 
Protein sequence
MGAAAKNTSS RSHHPKKQRV SAKHTAHDQP PRMAPTEKLE QGRHQHQHHP IVEHDDDLSH 
ATSDRNTGTS LSETTKATRT VSETCDYIAE LSEAILEQPD KAFISSEIPN PANPRYPKQG
PSKMKQLLVL ANASVVPRHN NHNTDNDDPS SHSAYTSQLA TMSLLAIFRD ILPSYRIKLP
TTQQAAVKVS KETKVLWDYE RALLQSYQEY LQILEHCWDA TRTAPHPSQL GVTSILSLCE
LLKSAFHFNF RSNLLTVVSR HTNHPSTVVG DACCAAIAYV FAHDAQGEVA LEATRLLAKF
VKDRAFKIRP SVLRTFTSLP LRVHVDEAQA AKLAAAANAK KRKKDKELAE IDAELKESDA
KVDKIILARC QSETLQHVTL TYFRILKHDN LQAAHVETLL PAALEGLAKF AHLINIDTVM
DLLGVLKDLL KKMNALPLEA ALNCILTAFQ TLQGPGKEMN IDVKEYIVPL YTQLPRLVGD
VNCRRHLPTV LLCLNAAFIK RREYSTIRVS AFWKQILTVS LHVPPHTAVP LIAFGRQLLQ
RYPVTHQMLE NEQDVITSGE YTPDVEDPEH SNPLATSAWE LALAKFHVHL SVVQQAQSTA
TLRLPNLPTE SPERLYQELF RAEDELFFSF QRVRKKHPLT PPKQDGSKKR KQYRFLTPRA
TESFLLKANA L