Gene PHATRDRAFT_40634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40634 
Symbol 
ID7198563 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp22062 
End bp25263 
Gene Length3202 bp 
Protein Length1038 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184717 
Protein GI219129062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACG CTGTGTTCTT CTCTATCGTA GCTCTCGCCA AGGCGGGCAT TGAGAGCTGT 
CGAAACGCTC AAATATGCCA GGATGAGGCA GGCCGCATCG GCAAGCGTCT GACGATAGTG
GTCGCGCGAG CGCATGAATG GGGAGCGGTT TGTGCAAGTG CGCGCCTGAT TCATTTCCAC
GAAGTCGTGG AGAATGTCTT CTTGTGCTTA CAAGCCGTTA CATCGCCTAG AAGCAAGCGG
TCCTCATGGA ACAAAATGTT CAAGTCTACG CTACAATCCC AAACTTTGCT CGACAAAATC
CTCGAAGCAG AGAGTCAGTT GAATACTGCT ATCAATGATC TACAGATGGA GCAATCCAAT
GCCATCTTTT CACAGCTGGT TGACGTCTCA AAAGGAGTTG CAGAATTGCT CGACCAGTTT
GGCACTCTTG CAATGAGCAA ATCGAATCCT TCCGTGACAG TACAGCAGCA ATTTGATAAG
GTCCTGGCGG ATGCACAGAC ACAAGCCCCC GAAGTCGCCG TTTCTATCCC CAGTGACCGA
ATCCACCATC CCGTACAAGA GTACAACTCT CTCGACTGTG TGGGAGATGA GGTTGCCTTA
TCTCCACAGC AACAAAAGAT AGCGTTTCGT CCATCCAAAG AGGATGTGCT TGCTATCTCG
CTCAAGGCAT CATTGCTGGA GTTTAGCGAT GACCAAAAGA ATCTTCTGGG CGGTGGAGGA
TTTGCGGAAG TCTTTCGAGG GACCTACAAC CACCGGCCAG TTGCGGTCAA GCGCCTCAAG
GCGTACCATG GAGATGTAGC GTCTCTCTCT CTCTCTCAAA TCGCCCGTGA TGTGGAACGA
CTCGCCGCCG AAGCCATTTT GACGCACAAG TGCAGCAAGC ACTCCAATAT TATCCACGTT
ATTGGATGCA TCACCGTATT GAGTGAAGTC GAGAGACCTC TCATTGTCAT GGAGCTAATG
CATACAACAT TATTTGATGC TCTCCATGAT CGAAACCAAA AGGATGCTAT GGGATTTTCT
CGTCGGCTCT TTCTGTTAAA AGGTATTGCT GGAGCCTTAG AGTTTCTTCA TCTGCAAGGT
ATTGTTCACC ATGATATTAA GTCTCTGAAT ATCTTGCTGA ACAAACAATT GACAATTGCC
AAGTTGGCTG ACTTTGGAGA GTCGAAAGTA AAAGGCCTCC ACACCACGAA ACTCCGTCTC
AGTACAATTT TGGCCACAAC AAGCCACCAG GGCAACCAGA TAGCAGGTAC AGCTGCATAT
CAAGCACCAG AAATCCTCTC GGAAGAAGTG CTTGACATAT CACGCGTTTG TGAGATGTTT
TCGTTTGGGG TGACAGTATG GGAGTGCATG ACAAGCAAGA TTCCACATGG AGGGAAGAAA
GAATCATCCA TAGCACTTCT GGCAGCGACA AAGAAGCACT TGCCCATGCT CGTAGTCCCA
TCCAAACCAA AGGATCTCCC AGAGATAGAG ATGGTGTCCT GGAAAGCGCT CAAAATGGTT
GCCACATCAT GCCTCTCTCG TGATCGCTTG GTGAGACCTA CTGCTTCTGT GGTGGTGGCA
CTTTGGCATA AAGTAAAGTC TCCAGGGAAT GTTGAACCAC TGTCTTTCTC TCTCCAAAAC
CCACCTTCAG CAAAAAGTGG TGGCATTGGC CAGACTTGGC TGCCAACAAG TACTCAGGGA
TCAACAAGTC AAGACATTGT CTTTGATACG AAAGGCTATG AAGACGAGTC CAAAGCAGGA
TATACTACTT CCTCAAAGAA ACGTTGCTAC ATTAGACTGT CTGTCATTGC AAGTATTGTG
GTGTTGCTAG GAGTCATAGT GCTATTGGCC GTTATCCTGG TGCCCAGAAG TTCTCCTGAT
CCCTCGTCAC CGTTGCCGGC TCACCTGTCT TTTCAAACCA CACAGGAGTT GTATGATGCT
GTTGATGCTT ACGTTGGTAC AACCAGCCCC GTAGACTCCA CCGCAGCGAC TGTGTATGGA
TATCCTATTG GATCATGGGA TGTGTCACGA ATCTCCAACT TTTCTCAAGT TTTTGATGGA
TCAGCTCGGA ACAGCGCCAT TGGAATGTTT GATGAAGATC TGAGTAACTG GGATGTTTCG
GCCGCGTCAA CAATGCATTC GATGTTCAAT GGTGCTTATG CGTTCAATAG CAATCTGTCA
GCTTGGAATG TGAGTCGGGT AGCAGACATG AGTTTCATGT TTTGGGGCGC ATCGGCGTTT
AATGGGGATC TTTCATCATG GAGGGTTGAC CGGGTTGCAA GTATGGAGTC TATGTTTGAG
GGTGCAAGCT CTTTCAATGG TGATCTTGCA TTGTGGAATG TGAGTCAGGT AACAGACATG
AGTTTCATGT TTTGGGGCGC ATCCTCCTTT AATGGAGACC TTTCAACATG GAGGGTAGAT
CAGGTTTCAA ATATGGAGTC AATGTTCTAC AATGCAAGTG CTTTCAACAG TGATCTTGCT
ATGTGGAATG TAGGACAGGT AACAGACATG AGTTTCATGT TTTGGGGTGC ATCTTCCTTC
AACGGGGATC TTTCATCATG GAGGGTAGAC AACGTTGCAA ATATGGAGTC TATGTTTTAC
AATACAAAGA CTTTCAATAG CGATGTCTCA GCATGGAATG TAGATCAGGT GATCAACATG
TCAAGTATGT TCCAGGCTGC ATCTGTCTTC AATGCTGACC TCTCATCCTG GAATGTAATG
CGGGTTACAA ACATGAGAGC TATGTTTGAG GAGGCAGGTG CCTTCAATGG CGATGTCTCA
ACATGGAATG TGGGCCAGGT AACAGACATG AGTTTTATGT TTTGGCATGC ATCCTCCTTT
AATGGGAACC TATTTTCATG GAGAGTAGAT CAGGTTGCAA GTATGGAGTC TATGTTTCAG
TTTGCAGCTG CCTTCAACGG CGACCTGTCA AGCTGGAATG TCAGCAAGGT AACTACCATG
CAAGAAATGT TTAATGGCGC ATCTTCATTT GAGGGCAACC TTTGTCCCTG GCTGGCTTGG
CTTCCTTTGG ATTGTAATGT TGATGGAATG TTCCTTGCTG CACAGTCCTG CACAGACACA
GCAGACCCTA TACTACCAGA TGGGCCAATG TGCAATACCT GCGCAACGTA AGGTAATTAT
ATTCAAATGC ATGAACCTAA GGGAGGTGAA TTTCCACAGA ATTATGACTA TACATGACTA
TATAAAAATT AGTCCCAAGT GA
 
Protein sequence
MADAVFFSIV ALAKAGIESC RNAQICQDEA GRIGKRLTIV VARAHEWGAV CASARLIHFH 
EVVENVFLCL QAVTSPRSKR SSWNKMFKST LQSQTLLDKI LEAESQLNTA INDLQMEQSN
AIFSQLVDVS KGVAELLDQF GTLAMSKSNP SVTVQQQFDK VLADAQTQAP EVAVSIPSDR
IHHPVQEYNS LDCVGDEVAL SPQQQKIAFR PSKEDVLAIS LKASLLEFSD DQKNLLGGGG
FAEVFRGTYN HRPVAVKRLK AYHGDVASLS LSQIARDVER LAAEAILTHK CSKHSNIIHV
IGCITVLSEV ERPLIVMELM HTTLFDALHD RNQKDAMGFS RRLFLLKGIA GALEFLHLQG
IVHHDIKSLN ILLNKQLTIA KLADFGESKV KGLHTTKLRL STILATTSHQ GNQIAGTAAY
QAPEILSEEV LDISRVCEMF SFGVTVWECM TSKIPHGGKK ESSIALLAAT KKHLPMLVVP
SKPKDLPEIE MVSWKALKMV ATSCLSRDRL VRPTASVVVA LWHKVKSPGN VEPLSFSLQN
PPSAKSGGIG QTWLPTSTQG STSQDIVFDT KGYEDESKAG YTTSSKKRCY IRLSVIASIV
VLLGVIVLLA VILVPRSSPD PSSPLPAHLS FQTTQELYDA VDAYVGTTSP VDSTAATVYG
YPIGSWDVSR ISNFSQVFDG SARNSAIGMF DEDLSNWDVS AASTMHSMFN GAYAFNSNLS
AWNVSRVADM SFMFWGASAF NGDLSSWRVD RVASMESMFE GASSFNGDLA LWNVSQVTDM
SFMFWGASSF NGDLSTWRVD QVSNMESMFY NASAFNSDLA MWNVGQVTDM SFMFWGASSF
NGDLSSWRVD NVANMESMFY NTKTFNSDVS AWNVDQVINM SSMFQAASVF NADLSSWNVM
RVTNMRAMFE EAGAFNGDVS TWNVGQVTDM SFMFWHASSF NGNLFSWRVD QVASMESMFQ
FAAAFNGDLS SWNVSKVTTM QEMFNGASSF EGNLCPWLAW LPLDCNVDGM FLAAQSCTDT
ADPILPDGPM CNTCATPK