Gene PHATRDRAFT_47275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47275 
Symbol 
ID7202360 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp156891 
End bp160633 
Gene Length3743 bp 
Protein Length1055 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181494 
Protein GI219122318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.328258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAATC CGTTCGATGA CTCGTTTCAA GGAGACAATG GACCGACTGT TGGGCCGCCG 
AACCCTTTTG ACGCGGACGA CGCGTTGGCC AGCTCCTCGG ACGTTGTCTC CACAAACGAT
CTCGACGAAG ATGAATATGC AGCAGTCGAA TCAACTTGGA AGTATCTGAA AGACCTTCCT
TATCGACAAA TTCCCATTTA CTCGAACGTG CTTTGGGAGT TAGACCCCGA GGACGAGGAC
TGGTTTGCGT ATGGTTTGGA ACGCTACCCT TCTTCGGCTT TGAATCCTTC CTGGCCTCGG
AGTGAACGTA TGACCTTGCT CCGCAAAACC ACGACGACGA AAGTTTCTGG CTGTCCCTAC
GGAGGACCTT TGGCATCGAT TACGACACCG GTCATGTCTA CGCCTACCTT TGCAAAGACT
CAAATTACGA TCTGGACGAA TGCTGGCAAA GTCTTGACAC GGATTCCGTT CCCTCCCCAA
TCGCATGCCA ACTACTCGCC CTCACTAATC ACGACCATTG GTTTTACTTC CCGCGCGCAG
TTGGTTGTTG TCTTACAGGA TTCCCTGTGT CTGACGTACA ATCTTCGAGG CGAACCAATT
TTAGCGCCCT TCTTTATTCT CCAGCAACCA TCACAAGGGA AGGCAATTTC AGTTACCCAA
GCGGTCGTCT TTGCTGGAGG TGTCGCCGTG TTGGCACAAA ACCAATCCTG CGCTTTAGTA
GAGCTCTTGG ATGAACACGA TAATTTGTCC TACTCAGCCT CGGCACCGCT CTCTGCTCGT
AAGGTAACAT TCGACACCAA CCAACACACC GACGTAACCT CTTCTGCCGA CGGTATTTTT
GCTCTCGTCA CGCCGCTCGA AACGGCCGAA TTTAGTCGAG CACACGGGCT TTCCTACTTG
ACCTTGGCCG TTCTACCTCG GCATTGCACA AGCCATGGGC ATCCTGAAGT GTTTATTTCC
ACTATATGCC ATTCCGTGGT GGTGGTGGAA GCTCGCGACG GCTCTATGAC AGACCTAGAT
TGCCGAGCCC GTATGGTAGC GCCGTTAACA CACATGGCGT TCGCACCGAA TGGTAGATTT
TTGGCCTGTT TCACGACCTC CGGGGTGTTG ACAGTAGTGT CTACGGACTT TGATGTTATG
GTATTAGACT TCGACGCTTC ACACAGTCGC GAAACCTCCT CTTCGAGTTC CCAGCCACCT
TTGGATATGC AATGGTGTGG AGAAGATAGG TATGGTACAT TTGTGCTCTC GCTCATTGTT
GGAATAAGAA TTTACATTTT TTACACTGAC TGAATTGCTC TTGCGCTTCC TTAAAAGTGT
TGTGTTGCAC TTTAAAAACT TGGGGGTGCT GATGGTGGGC CCTTACGGAG ACTGGCTACG
CTTCCCATAC AACGATACCT CTGATCAAGT ATACCTCGTT CCGGAAGCAG ATGGCTGCCG
TGTTGTGACT GAGAGCCGTG TCGAGATGCT GCAGCGTGTC CCCCCGGCGA CGGCATTGAT
CCTACGGATG GGATCGATTG AACCAGCCGC GATTTTATTA GACGCCGCAG ATGCGTTCTA
TTCCAAGTCC ACGGTCACAG TCCTGAATAG TGACGAGATG GTACAAGGCA TGGTCGAACA
AGGGACTTTA AATGCGGCAA TTACCAGCTG TTTTGAGGCT GCCACGAATG AGTTTGATAT
TTTTACACAA AAACGTTTGC TGAGAGCGGC ATCTTTCGGT ATGCATATTT CAGATAAGAA
ACAGGTCAAC GAAGAACGTA TGATTGTTGG GGGATCGACT ACAATTTCGG AAGCAGAACA
AGACGGAGAA GACTGCGAAA ACCAGGATCC TTACAGTTTG CCATCGAGAG TCACACGTCG
TTTTGTGGAA AGCTCCCGTA AGCTTCGTGT CTTGAACGCA CTCCGACATC CGCTGGTTGG
CGTTGTGATG ACATTTCCAC AGTGGCAGAG CATCGGGGCA ATTGGTGTCG TTGCTCGCTT
GGTAGCAATG AATCGCCCGG AGCTGGCCAC TTCAATCTGT GATTACCTAG CTTTACCCAA
ATCAATTCAG CTATTTGCGC GAGCGTCCAA GGCGTCTTCG TTTGTGGAGC AAAAGGCACA
GGCTGATGAG CACTTATCAG ATTCAGAGAT CGCCCAGGGC GCCATTATGA TAATTACAAA
AGAGGTGGTT TCATCGGCTG TATCGCCCGG GGCTTCAGCG AGTATGTTTC GAGGAGCCTA
CGCGACAGTG GCTCTTGCCG CCAACAAAGT CAACAAGCCA GGCGTTGCAA ATCTCTTGCT
CATGTTGGAG TCCAGTGTTG CCGATAAAGT ACCAGCTCTC ATTGCGGGTG GCTCTTACGC
TGACGCTATT GCAGTTGCTA CCACCGCCAG GTACGAGCTA ACTTATGCCT AATTATTTTC
TTCGCTTTCT GATGCTCTCA CAACTCACAT CCCGTTTATT CTGAAGGGAT GCGGACTTCA
TTTTTTCCAC TCTCATGGAT TTCGAAAGAA ACTGTATGAT AGCTGCGTCG CCGACAGATC
TCTCGCAGGC TCAATCAGCA TTCTTGTCCA CCGTTGTCGG TAAATTTACA CTTGAGGCAT
TTCATACGCT CCGGCGCTAT CTACGTAGTA CATCAGATAT ACAAAGGGTA CTCAACCTTT
TGCTCAGAGG GCAAAAGTTC TCCTGGGCTG GTCGGGAAAT GGCGCAAAAG GCGCTCGTCG
AGGTTGATGT TCGAGAGAAG CAAGGAATGT TAGCAGTAAG TATGAACTTT GTACGGTGTC
GATGGTAAGC AATAGCTTGA AAGTCAACAA ATTCCCTCAT AGTTTTCTCG TTACAGGAAG
CTTCCCGCAT TTTTGGTATA AGCAAAGAAA CTGCTTTCCA AAAATCGTGC ACAGATGATT
ATTTGGATTT GAGAAAGGAT CAAGAAGTTT TACGTAATAA GTATGGCTCT GTCGACGTTG
CCCCGGAAAG CTCGTCGGTG ACGGCGACAA TTTCGTCTAT TGTAAAGTTT GCTGCAAGTA
ACATACGAGA ACAGCACCGT CTACTGGCAG ATGCGGACAA GGTGGCGAAG AAATTTCGAG
TTGCTGAGAA GCGTTTGTGG CATATCAAAG TAATTGCTTT TGCGGCCAGT GAGCAATGGA
GTAATTTACG TATTCTTGCA GATTCCCGGG CGAAACCACC AATTGGATAC AAACCGTTTG
CACGAGCCGT TATTGATGGA AACCAAAATA GCAGCGAGAT TCTTCGGTAT ACTGAAAGAA
TTTCTGATCT CGAGGAGCGG TACGACATGC TCTGCTATGG TCAGCTTTGG AGCAATGCAT
TGGACGAAGC TTTTAAGATG AAGGACACCC GGCGCATTTT GAATGTGAAG AATCTGTGCA
ATTCTGCCGA CATCCAAATT AAGGCAGACC AATTAATGGG CCGTCTTGCC TAATTGAAAA
CGGTCAAGAT TCCCTAATCT GTCACAACTG CTTCTTTCCA GTTGAACCAT TCCTTGACCC
CCAAAACGCA CAGGCTCATG CTACTGTATA TCGGTTCTGT GTCTCCGAGA ACAAAGTGAT
ACTATCTGTA ATGTAAAACA GATTGTACGG ACACCATCGA TTAGTTTACA GTTATCAACA
ACTGTGAGCA AAGAACCGTC TCCCTTCCGA CCATTCCCTG ATTGCGGAAC TTACCTTGTA
ACGAAACACC TACTTGTAAA TCCCTCTCTG GCTAGTAGTA TTGTTTGCCT TACTGTTAGA
GGAGCCATTT TGGAAAACCG AAC
 
Protein sequence
MSNPFDDSFQ GDNGPTVGPP NPFDADDALA SSSDVVSTND LDEDEYAAVE STWKYLKDLP 
YRQIPIYSNV LWELDPEDED WFAYGLERYP SSALNPSWPR SERMTLLRKT TTTKVSGCPY
GGPLASITTP VMSTPTFAKT QITIWTNAGK VLTRIPFPPQ SHANYSPSLI TTIGFTSRAQ
LVVVLQDSLC LTYNLRGEPI LAPFFILQQP SQGKAISVTQ AVVFAGGVAV LAQNQSCALV
ELLDEHDNLS YSASAPLSAR KVTFDTNQHT DVTSSADGIF ALVTPLETAE FSRAHGLSYL
TLAVLPRHCT SHGHPEVFIS TICHSVVVVE ARDGSMTDLD CRARMVAPLT HMAFAPNGRF
LACFTTSGVL TVVSTDFDVM VLDFDASHSR ETSSSSSQPP LDMQWCGEDS VVLHFKNLGV
LMVGPYGDWL RFPYNDTSDQ VYLVPEADGC RVVTESRVEM LQRVPPATAL ILRMGSIEPA
AILLDAADAF YSKSTVTVLN SDEMVQGMVE QGTLNAAITS CFEAATNEFD IFTQKRLLRA
ASFGMHISDK KQVNEERMIV GGSTTISEAE QDGEDCENQD PYSLPSRVTR RFVESSRKLR
VLNALRHPLV GVVMTFPQWQ SIGAIGVVAR LVAMNRPELA TSICDYLALP KSIQLFARAS
KASSFVEQKA QADEHLSDSE IAQGAIMIIT KEVVSSAVSP GASASMFRGA YATVALAANK
VNKPGVANLL LMLESSVADK VPALIAGGSY ADAIAVATTA RDADFIFSTL MDFERNCMIA
ASPTDLSQAQ SAFLSTVVGK FTLEAFHTLR RYLRSTSDIQ RVLNLLLRGQ KFSWAGREMA
QKALVEVDVR EKQGMLAEAS RIFGISKETA FQKSCTDDYL DLRKDQEVLR NKYGSVDVAP
ESSSVTATIS SIVKFAASNI REQHRLLADA DKVAKKFRVA EKRLWHIKVI AFAASEQWSN
LRILADSRAK PPIGYKPFAR AVIDGNQNSS EILRYTERIS DLEERYDMLC YGQLWSNALD
EAFKMKDTRR ILNVKNLCNS ADIQIKADQL MGRLA