Gene PHATRDRAFT_21548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21548 
Symbol 
ID7202417 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp405522 
End bp410298 
Gene Length4777 bp 
Protein Length1360 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181721 
Protein GI219122788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.50837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCATTTGGAT TCTCGGGAAC ACATCCTCAC ATTCAGAATC CGTTTAGTAA GCATTAGATC 
CTGCCTTTCG CATGCAAAAC GAATGTCACG ACGGCAGTCT CACCGACCTT TATTGTTTGT
TCTCCACGCA TGGCTAAAGG CAAGTTGTCG CAAAAGACTC CAATAGAGGC GTACTACGTC
CATATCGCGT TTCATCATGC CGGCGGCCGA CGACTTGGAA GAAACCAATC ACTCGACGTC
CGCGGGAGGC AAAGCGAAGA AACCCAAGGC GAAGCAAGCT TCTGCTGTCG AAACGCTCCA
CTTCGCTTTC GACTGCGGGG TCCGTGTCAA GGTCCTCTTT TTTTTCGGCG TAATTGCAGG
AATTGCTAAC GGACTCGTTT ACCCGATCCT GGCGTGGCTC TTTTCTTCCT CCTTCTCCGA
CATTTCCGCC GCTTCTACCA ACGGTCTCAG TCAGATTCGC GAGCTCGCGT TTACGTTTCT
GATCGTGGGA GTCTACGCGC TGGTCTGTGC AACCATTCAA TCCTTCTGTT TTGAACTGGT
GGCGTACCAC GCGTCGCAGA ACTTTCGTCT CCAATGGTTC GGCGCGTTGC TGCGCCAGGA
CGCGGCATTT TTCGACGTTT ACGACGTCGG CGGTATCGCT GCTCAAGTGG GACCCAACGC
CAATAAGTAT CGACGCGGTA TGGGCCGTAA GTTTGGGGAG GGCGTTCAGT TTTTAACCAC
CGGCATCGGT GGAATTGGTT TTGCCTTTTT CGTTTCGTGG CGAATCGCGT TCGTGGTGCT
GTGCGTTATT CCCTTTGTGT CAGTCGCGGC ACTCATGGTG GTACAGCTAA ACCAGCAAAA
AGGCGCACGG GCGAGCAAAA GTTACAAACG CGCTGGTAGT GTTGCGTATT CCAGCGTTTC
CGCTATCAAA ACGGTGCTTT CCTTGAATGC TGTTCCGACA ATGCTCAAGC AATACTCCCA
GGCAACACAA GAGGCTTTTG CCGATGCCGT CAGTATTCTT CTCAAACAGG GTCTCGCAAA
CGGTACGTCC AGGAAATCGC GCTGCTATCA CATTTTTGCA GCCAGGGCCT GACTAACGGC
TTGATCCTTG AATTAGGTTC CATGTTGGGC GCTTTCTTGA TGCTGTACGC CATCTTGGCC
TTGTACGGTA GCGCGCTCCT GTACCGTGAT GTAGAGGATA CAGGATGCGA TCCGTCTGGC
GGAGTGAACG ACAACGCGAC CTGCCCCAAT AGTGGAAGCG ACGTATTCGG AGCGATGCTG
GGTGTCGCTT TTGCTGGTCA AGGTGTTTCC CAAGTCGGCA ACTTTTTCGA AGCCTTTGCG
GCTGCCCGAA TTGCTGCTTT TGAAGCCTAT TCGGCTATTC GACGTACCGC CGGGGCCCCG
GCCGAAACCA TTTACAAGGA AGACGATGTG GAAGATCTGA ACAGTACCGT GCACTCCCGT
AAATCAAAGA AGAGCGAGCC CGATGTGGAA TCTGCCGAAA GACCGATCAA GGCGATTCTA
CCGAAGTACG AAATCGATTC GACATCCGAT AAGGGAAAGA AACCGTCCGA CATTGCGGGT
ACGCTTGCGT TCAATGACGT ACGCTTCAAC TACCCCACCC GTCCCACGGA AGCTATTCTG
AAAGGTCTTT CTGTAGAAAT TGAAGCCGGC AAAATATCTG CTTTCTGTGG TCCCTCCGGC
GGTGGCAAGA GCACAGTTAT GTCTTTGATC GAACGCTTTT ACGATCCTCT GTCCGGTAGT
GTTTCTTTGG ATGGAGTGAA CTTGCGGGAT ATCAACGTGT CGCACCTTCG CAGCATGATT
GGGTACGTCG GGCAGGAGCC TACCTTGTTT GCGACTAGTA TTCGTGGAAA CATTCGTTTC
GGAAATCCCG ACGCGACCGA CGAGATGATT GAGAGCGCGG CTCGTATGGC GAATGCGCAC
GATTTCATCA TGTCGTTTTC GGATGGCTAC GACACGCAGG TAGGAGACCG AGGGAGCCAA
TTGTCCGGAG GTCAAAAGCA GCGTATCGCC ATTGGTAAGT TCTCCTGTTT CCACTCCGCA
AAATGGCATT CAAAACTTTT TCTCACAGGG TTTTGTTCTT GTTGTATCTC GTAGCGCGTG
TTCTGGTGCA CAATCCAAAA ATTCTATTGC TGGACGAAGC TACCAGTGCT TTGGATGCAG
AGTCAGAACT TGTGGTACAG GATGCTTTGG ACAAGATCTT GGAACAAAAG AATATCACCA
CCGTGATCAT TGCGCATCGG TTGTCGACCA TCCGCAACGC TGATGTGATC AATGTGGTTG
TCGGTGGAGT TGTTGCCGAG AAAGGTACTC ACGACGAGTT GATGGCGGGA GACACGTACT
ATCGTAAGCT AGTTGAGAAA CAGGAAGGCC AGGATAGAGC AGACACTGAC AGCTCCCCTG
GTACGTCTCG GAATAGTAGC TCGGTGGATT TGGTTCAGCT CGCAGAAACA TCCAAGGAGA
ACATGCGTGC TTCGATAGAT GCGAAGCACG AAACTCCGTT ATTGCAATTT CGAGATGTTC
GTTTTGCGTA TCCGACGCGG CCGAAAAAGA AGGTTTTCGA CGATTTTAAC CTTACCATCA
TGAAGGGTGA AACAGTAGCT TTGGTAGGAC CTAGCGGTGG TGGTAAGAGC ACGACGGTTG
GCTTAATGGA ACGGTTCTAC GATCCGACCG AGGGCACGCT TGAGTATTTA GGGATGGACG
TGAAGTCTTT AAATGTGCCT TGGTATCGCG ACCAGATTGG CTATGTGGGG CAGGAACCAA
CTCTTTTCAA CGATACTATA TCTCGAAATA TTGCGTACGG TGCGCCGGGT GCGTCGCAGT
TCGAGATTGA GGAGGCCTGC AAGCGCGCGA ATGCTCACGA CTTTATCATG GAGTTTCCGG
ACGGTTACAA CACACCCTTG GGCGAGTCGT CTCAGCTGTC GGGTGGCCAG AAGCAGCGTA
TAGCCATTGG TAAGTTTACA ATGGGTTTTT AGATGTACGG CAATCGACGT AATACTCACG
ATCTCCTCTT CTTTCGGTAG CCCGCGCCTT GGTGAAACGA CCTAATATCT TGATCTTGGA
CGAAGCCACG AGCGCCCTAG ACAACGAGAG CGAGGCAGTT GTGCAGGCTG CCATTGACAA
GCTGATGAGC TCAAGTGAGC ACACGGTTGT ATTGATCGCG CATCGTTTGT CTACGATACG
AAACGCTGAC AAGATAGCGT TCGTGGCCGA CGGCAAAGTT TTAGAATATG GCAGTCACGA
AACTCTAATG GAGCGCCCTC ATGGCCGCTA TAAGCGTCTT TTTGAATCCT CTCGACGGGA
TGCCACTCTG TCAGCTCTCA ACAGCCAATC GAAGAAAGCT TCCGGTAAAG ACGTAGATCG
GGAAGAAGAC GAAGAGATTG ACTGGGAGGG AAAGATCCAG GCAGAAGAGG CCGCTGCATT
CAATGCCAAA CGCGCTCGAG ACATGGCCAA ACCAGATTCT TCATACATGC TTATTGGTGC
CATTGGAGCA GTGATGGCTG GAGGTGTATT CCCGATGTGG GGCGTTCTTT TTTCTGAAAC
AATTGACTTA CTTTTCCAGC CTGTACTCCT TTGTCCCGCT GAGGATGGAA GCATTCCGAA
CAATTTCCCA ACTTGTGAGG ATTACTGGAA AGGTATCGCC AACGATATGC AGGACCGCTC
CTTTGCGTTA GCTGGCTATT GGGCTTGTGT AATGTTTGGG TGTCTTGTTG GCAATGTGCT
GACCTTTTAT GGCTTTGGCA CTGCAAGTGA GCGTCTGAAC AAACGGGTTC GTGACATGTC
CTTTACCTCT TTGTTGCGCC AGGAAGTCGC TTTTTTTGAC ATGCGAAGTG TCGGAAGTAT
TACCTCGCAG TTACAGGACG ACGCAGCCCG TATTCATGCG TTTTCTGGTG AACCGGTTCG
ATCGTTTATC ACAGCACTTT CCTCCATCGT TACAGGTGTA GTACTATCTT TTATTGTAAG
TGGCTTACCG TTTTCAAATG GGTGGCCATA GCTCAAGAAA TATCTGACGC GTATCTTTCT
TGCACGAAAT ACAGTTCATG TGGCCTTTCG CTCTTTTGGC AATTGGTTGC GTTCCTCTGA
TGGGATTTGC TACATCGCTG GAGATGAAGC AGATGCTTGG AGAGGACGAA GGCGATGTGG
ATAACGTTGT TGAAGCACTG AACACCCCGG GTGGCCTTAT TGTGGAAACG TTGTTGAATA
TACGCACTGT GTCGGCTTTG ACGCTTGAGA ACAAGCGTTT TACGGACTAT CAAGATTCTT
TACTGAAAAC GGAGCCAGAC TTTAAATTTG ACGCTTTTAT GACTGGTTTT GTCAGCGGAA
TTTCTATGTT TATTCAGCAA TGGATCAATG GATTGCAGCT TTGGTTTGGT GGATATATTC
TTTCCAAGTT TCCGGATGAC TACGACTTCA ACGACTTCCT CATTGCCAAC TTTGCTGTTC
TATTCGCCTT GTTTGGTCTC GGTGCGGCGT TTCAGGACAT TTCTGACCGA AAGGAAGTGG
AGAAGAGCGC GGGGCGTATT TTCTACTTGC TGGATCGTGC CTCTTCGATT GATCCTCTTT
CCACGGAAGG AAAAAAATTG TGATTATTCC TTCCATTAAA GTAGTCTGAA CTTTGGCTCA
CATGTCAACT CTGTCTCTGT AATTATGGAT CGTTGACCAA TTTACAATCT CTTTGCGTTG
GTGAATGGTC AAACTCTGCA AAATTCGCTT GAGAAGATTG TAAATTCCAG TAGCCTTACA
GTTACAGTTA ATAATAGTAT TCTACGGAAA ACAAGTT
 
Protein sequence
MPAADDLEET NHSTSAGGKA KKPKAKQASA VETLHFAFDC GVRVKVLFFF GVIAGIANGL 
VYPILAWLFS SSFSDISAAS TNGLSQIREL AFTFLIVGVY ALVCATIQSF CFELVAYHAS
QNFRLQWFGA LLRQDAAFFD VYDVGGIAAQ VGPNANKYRR GMGRKFGEGV QFLTTGIGGI
GFAFFVSWRI AFVVLCVIPF VSVAALMVVQ LNQQKGARAS KSYKRAGSVA YSSVSAIKTV
LSLNAVPTML KQYSQATQEA FADAVSILLK QGLANGSMLG AFLMLYAILA LYGSALLYRD
VEDTGCDPSG GVNDNATCPN SGSDVFGAML GVAFAGQGVS QVGNFFEAFA AARIAAFEAY
SAIRRTAGAP AETIYKEDDV EDLNSTVHSR KSKKSEPDVE SAERPIKAIL PKYEIDSTSD
KGKKPSDIAG TLAFNDVRFN YPTRPTEAIL KGLSVEIEAG KISAFCGPSG GGKSTVMSLI
ERFYDPLSGS VSLDGVNLRD INVSHLRSMI GYVGQEPTLF ATSIRGNIRF GNPDATDEMI
ESAARMANAH DFIMSFSDGY DTQVGDRGSQ LSGGQKQRIA IARVLVHNPK ILLLDEATSA
LDAESELVVQ DALDKILEQK NITTVIIAHR LSTIRNADVI NVVVGGVVAE KGTHDELMAG
DTYYRKLVEK QEGQDRADTD SSPGTSRNSS SVDLVQLAET SKENMRASID AKHETPLLQF
RDVRFAYPTR PKKKVFDDFN LTIMKGETVA LVGPSGGGKS TTVGLMERFY DPTEGTLEYL
GMDVKSLNVP WYRDQIGYVG QEPTLFNDTI SRNIAYGAPG ASQFEIEEAC KRANAHDFIM
EFPDGYNTPL GESSQLSGGQ KQRIAIARAL VKRPNILILD EATSALDNES EAVVQAAIDK
LMSSSEHTVV LIAHRLSTIR NADKIAFVAD GKVLEYGSHE TLMERPHGRY KRLFESSRRD
ATLSALNSQS KKASGKDVDR EEDEEIDWEG KIQAEEAAAF NAKRARDMAK PDSSYMLIGA
IGAVMAGGVF PMWGVLFSET IDLLFQPVLL CPAEDGSIPN NFPTCEDYWK GIANDMQDRS
FALAGYWACV MFGCLVGNVL TFYGFGTASE RLNKRVRDMS FTSLLRQEVA FFDMRSVGSI
TSQLQDDAAR IHAFSGEPVR SFITALSSIV TGVVLSFIFM WPFALLAIGC VPLMGFATSL
EMKQMLGEDE GDVDNVVEAL NTPGGLIVET LLNIRTVSAL TLENKRFTDY QDSLLKTEPD
FKFDAFMTGF VSGISMFIQQ WINGLQLWFG GYILSKFPDD YDFNDFLIAN FAVLFALFGL
GAAFQDISDR KEVEKSAGRI FYLLDRASSI DPLSTEGKKL