Gene PHATRDRAFT_47412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47412 
Symbol 
ID7202449 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp562545 
End bp566608 
Gene Length4064 bp 
Protein Length1256 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181753 
Protein GI219122855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAATCGAG TGATCTACGG CTTGTTCCTT TTACAGTAAC AGCTTCTGAT CCATTCTACT 
TTGCAAGCAA TTCCATTGCT CTCTACGAAT CAAAATAAGA TAGTTTCTGC AGTTAGTGCA
TGTCTTTTGT TCCCAACCCG ACGCAGACTA ATGCGTCTTC TACCCCCAAC AACAATGGTA
CTAGTACCAG CGCCAACCAG GTAGTTGCCG TTGCCGCTGG GGCACCCGCC CCAGAGGTCT
CCGACCATGA TCTCATCTTC AAGATCCGTA CTTTCGATGT CTTTTGGGTT GACAAGACGG
CGCTCATGTA CAAGGATGGT ATTGCCAAGG TACGTTCGGG CTTTGGCTAC AGGGGGGCGT
TCTCGTTTCC TACGATTGCC CGTGCTAATC TAGCCGCAAC GAATTTGTAT TTCCGCTTTT
AGACGTTCGA CAAGGATACG CGCAAGGCAT TTATTACCAT CAAGCAGCGT TCTTCCATTC
CTATCATTCT CCGCGATATG GCGCCACCGG AGGGTTCCAT CTTTGACCCT GCCAAAAAGT
GGGATGAGGA AAAGTATCGC CGTCTCCGTG CGACCCAGGA AGCGGCGGAA GAGGAAGCTG
CCGCTGCGGC GGCCGAAAAG GACAAGAAGA AAAAGAAGGG CGGCGAGAAG AAAAAGTCGA
CCGCTGATTC CATCAAAGAA AAGAACGCGG TGGAGAAGCT CAAGAAAGAC CACGAACGCG
ATTTGCAAAA GCTCTCCAAC CTCGGTACCC TACGGGCTCT CCAAGATACA TCGTGCGAAA
CAACCAACGG AAAGATTCGC CGGATGATGA AAATGCTTCA CATCGCCGTC TCCGACCTCC
GTAAAGGTGA AGTTCATTCG AGCGAGGCCG AAGTTCTCGA CATTCTCTGG GCATTGGAAG
AAATGTCATC CTTTCAGGCG GCGATGAACG AGATCTCGGA CGAAAAGTTT GCCAAGAAGG
AAGCGAAAGA ATCCGACAAG TCAGACAAAA AGTCTTCTAC AAAAAAAGAG ATTAAGAAGG
ACAAAAAACC GACAAAGAAG GATCCTAAAA CCGAAAAGAC GGACAAGAAA GTCGAGAAGG
TTGTTCTTAG TACCGATGCA CAGACATTGA AGAATCTACT GAAGGAAGGC TACAAGGACT
CGCTAAAATA CGCCCGTGGC CTCCTGAACG GGAAAGAAAG CTTGATTTCC TTTCAGTTGA
TGGAAATGCC GGATCGTCTT CCCCCTTTGT CTCGGTACAA CCGTCAATTC AAACTTGAAG
ATTGGCAATG TGATATTCTC CAGGCCATTG ATGATCGCCA GTCTGCCGTT GTTTGTGCGC
CAACATCATC CGGAAAGACT TTGCTGAGCA CGTATACATG TAAGAACGCC AAAGGTACAG
TCCTTTTTGT TCTTCCCAGT GAAGTCCTGG TGTGGCAAGT TGCTTCCACG TATTATCAGT
TCTTCAAAGG AAATGTCACG GTATGCACGG ATCAGATAAC CTTCCAAGAA GTCACCGGTG
ATGCACAGGT GTATATTGGA ACTCCCAAAG CTTTAGAACG AAGCATCAGC AAAGTTCGAG
GTGTTGCTGG AGAGGAAATG ACAAAAGGTG AACGAGAGTT CATGGTATTG GATGGTGGCT
ATCATCAATT TGATTATTTA GTTCTTGACG AGGTCCATAC CTTGAATGGG CCTGAAGGTG
ATGCTCTTCA GCGCATCATT AAGGCTTGTA ACTGCCCGGT GCTAGCCTTG AGTGCCACTA
TTGGAAATGC GAATCAGCTT CGAAACTGGT TTAGTACCGT TCGCAACGAT CATATCGACG
CTTTGGCTGT TGATGTTCCC GAAAAGCCCG AAGAAGTCAT TCTAAAGGAG CACTTCGCCC
GCTTCATAAA CTTACAGCGC TATGTTATTG CCGAAGCCGA AGGGAAAGAT GGGAAACTTA
AGCAAAAGCT TGTCAAGCTG CATCCGGTTG CGGCCATGAC TCCTGATCGT CTCAAGAATC
AGCCCGAATT GATTGGCGGT CTTTCCATGA CTCCGTCTGA TCTCATGGAT TTATGGAAGC
GCATGCGGGC GATCTTCCCC TCCACTGTCC TCGAAGATCT TGACGACCCG GACAAATTCT
TTGCTCAATA CGTTGATGAA AGCAAACGTG TCACGTTGAA TCAGACAAAG GAATACGAGA
CTCGTCTAAA GACACGCCTT GCGGTGCTGG CTGAATCCCA TCCCGAACTC TTTGAACGCC
TCCGTAAGGC TCAGCTCCCC CCACCATTGA CTGCGAAGGA GGATGTTAGT GACACTTTGT
ACGGTATTCT CGAACAGCTG AAACAGAGCG AATTGCTTCC TGCTGTTGCT TTTCAACTGA
ATACGTATGG AGCATTTAAC ATGTTCAAGA CTCTCTTACA TCGCTTAGAG AGTGAGCAGG
TAGCAGAATT TCCGAGCTAT CGCAGGGACT TGATTGAACG AGCTAAGAAA AATGCGCAGA
TGCGAAAAGT TGCTGCTGGA AAGGCCAGCC GCGAAAACGC AGCCGAGGCA GAGGAAGAAG
CGAAACAAGG GTTTGTGGAT GAGTCGCTTC TCCAAGAGGA TACAACGAAA CCACACGACA
AATACGTTCT AGCTGCTGCA GGAAAACGTC TCGGTTACAA CGAGGTCGAA GATATCATTG
CCGACATGAA GAAATCTGGC GAACGCGTCG ATATCAACCA CGCTTTGATT CGCGGTCTTC
GTCGTGGAAT TGCCATCTAT ACAAACGAAG TTGGATTCTC ATGCTATCGC CGCCAAGTGC
AGATGCTTGC TCAAAAGGGC CGGTTAGCGG TTGTATTCTC TGACGATGCC CTCGCTTACG
GTGTAAACAT GCCGTTTCGG ACTTGCATTT TCTGTGGTGA TATGGGGGAC GCTCTTACGC
CTCTTATTGC GCAGCAGATG CAAGGTCGAG CTGGTCGACG TGGCTTGGAC GTGCAGGGTA
ATATCTTATA CCTCGGAATG GACTGGCCAT ACATTGAAAA CCTGATGCTG GGACAGATCA
GTCAAGTTAC CGGTAAGGAA CCTCGTTATC CTTTGATTGT TCTCCAGCAA GCTCTCGCGG
CTTCCAACGA TCCTGATGAT ACAAAGCACT TCATCCATGA CGACGGAGTA TCTCCTTTTG
CCAATGCAGT ACGTCGCATT CAGAGGAGCC AGCATTGTTT CCCGACGGTG ACGGAAGAGC
AGATGGATTG GATGTGTAAG ACGAGCCTCG AAGACTTCTG CAAAGGGGCT GAGACAGATC
ACTATCGCGA TATGTCGACG ACTGTCGTGA AAGGGCTTGG ATATGTGCAC GATGACCTTA
CGTTGGCAAT GGATCACAAC GTACTTTGCA TGGTTTGGGA GCTTCACGAA TATTTGCCTG
AGGCTGTACA TCTGTGTGCC GTACTCGAAC AAATGTACAT TCGTTTCTGT TACAACAAGA
CGAAAAGCTA CAAGGAAAGT GACTCGACCC AGAATGAATT TTTGTCTGTT CTTCTTCATG
TCGTGGACCG TGTTCCAGCG AAAGAAGGGG AAGAATCCCT CCAGCAGATC CTACGTGTTT
CACAATCTGA TGATGGTAAA GCGCTGAATG AAGAAGCCCG AGCAATGTGG CTTGAAACTG
AGAAGATCCT ATGTGAGCAA CAGGCTTTGA TAGACAGTCT GAATGTTGAT GAAAGTGAGA
AAAACAAGCT TGGCCTCATC ATTCCTGCAG CAAATGAAGT AGACAATGCT GGTGTTCCTT
TGGATAAAGG CGTCTACGAA ATGCTCGTTC TGAAGCAGAA GGGTTTCTCT GAAGAACAAA
GTACATCGCG CAGAAACGAA TTGAAGAATC GTATTGTGCG TCTGGGTCAA ATTTGCCAGA
TTACTCACAA CTCTATTCAG CAACCACACG GAAAATATGA CGCTCTTGAG GTGCACTTTC
GTCGTATGTT TTCTAACATC AAGTACAGCG TTGCGGATAT GATGAATCAA ATCTCGAACC
AGGAAGATTT GACGGAAGTC TAATCGGAAA GCTTGCTTCC CTAGTAGATA GCATTAGTGA
CCCTATTGAC TGCAAATGTA TATAATATTC TCTTTAGTAC TTGC
 
Protein sequence
MSFVPNPTQT NASSTPNNNG TSTSANQVVA VAAGAPAPEV SDHDLIFKIR TFDVFWVDKT 
ALMYKDGIAK TFDKDTRKAF ITIKQRSSIP IILRDMAPPE GSIFDPAKKW DEEKYRRLRA
TQEAAEEEAA AAAAEKDKKK KKGGEKKKST ADSIKEKNAV EKLKKDHERD LQKLSNLGTL
RALQDTSCET TNGKIRRMMK MLHIAVSDLR KGEVHSSEAE VLDILWALEE MSSFQAAMNE
ISDEKFAKKE AKESDKSDKK SSTKKEIKKD KKPTKKDPKT EKTDKKVEKV VLSTDAQTLK
NLLKEGYKDS LKYARGLLNG KESLISFQLM EMPDRLPPLS RYNRQFKLED WQCDILQAID
DRQSAVVCAP TSSGKTLLST YTCKNAKGTV LFVLPSEVLV WQVASTYYQF FKGNVTVCTD
QITFQEVTGD AQVYIGTPKA LERSISKVRG VAGEEMTKGE REFMVLDGGY HQFDYLVLDE
VHTLNGPEGD ALQRIIKACN CPVLALSATI GNANQLRNWF STVRNDHIDA LAVDVPEKPE
EVILKEHFAR FINLQRYVIA EAEGKDGKLK QKLVKLHPVA AMTPDRLKNQ PELIGGLSMT
PSDLMDLWKR MRAIFPSTVL EDLDDPDKFF AQYVDESKRV TLNQTKEYET RLKTRLAVLA
ESHPELFERL RKAQLPPPLT AKEDVSDTLY GILEQLKQSE LLPAVAFQLN TYGAFNMFKT
LLHRLESEQV AEFPSYRRDL IERAKKNAQM RKVAAGKASR ENAAEAEEEA KQGFVDESLL
QEDTTKPHDK YVLAAAGKRL GYNEVEDIIA DMKKSGERVD INHALIRGLR RGIAIYTNEV
GFSCYRRQVQ MLAQKGRLAV VFSDDALAYG VNMPFRTCIF CGDMGDALTP LIAQQMQGRA
GRRGLDVQGN ILYLGMDWPY IENLMLGQIS QVTGKEPRYP LIVLQQALAA SNDPDDTKHF
IHDDGVSPFA NAVRRIQRSQ HCFPTVTEEQ MDWMCKTSLE DFCKGAETDH YRDMSTTVVK
GLGYVHDDLT LAMDHNVLCM VWELHEYLPE AVHLCAVLEQ MYIRFCYNKT KSYKESDSTQ
NEFLSVLLHV VDRVPAKEGE ESLQQILRVS QSDDGKALNE EARAMWLETE KILCEQQALI
DSLNVDESEK NKLGLIIPAA NEVDNAGVPL DKGVYEMLVL KQKGFSEEQS TSRRNELKNR
IVRLGQICQI THNSIQQPHG KYDALEVHFR RMFSNIKYSV ADMMNQISNQ EDLTEV