Gene PHATRDRAFT_38388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38388 
Symbol 
ID7203250 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp776472 
End bp781289 
Gene Length4818 bp 
Protein Length1277 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182298 
Protein GI219123992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACC 
CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA
ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ATGGCCAACA CGGTCACCTT
TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCATTCC ATGCATTCCT
CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCCCC TCAAATTGTC
GAAGCGAACC GCCAGAACGA CAAACGACAA AAGCTGTTTG ACCTCTATCA CAACGCCATT
AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATCGA ATCTCTCGGT
CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTTTCTCA TCTCTGGGAA
ACTTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCCAACG ACGAACGCAT GAAAGCCGCC
TGGCATCCAC CAACGCCTAT CCAGCAACTC TTCCAGCAGC TTGAAAAAGG CAATCAGTTT
ATCATCGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC
ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCGTTATAA AGATGAAGCC
GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC
ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA
AAATCGTATT GCTGGACCCA CGGCATCGTT CACAACACAA AGCACACCAG TGCGACATGT
GAAAAACAGG CCCCGGGGCA CAAAACCGGC GCTACATTGC ACGACAAACA AGGCGGGTCG
ACTAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG
TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT
CCTCCATTAG CTTCCTCCCC GCCATTTTTC CCTCCCGACG CCATTGCAGA CACTGGCTGT
ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACATT GCCAACCGAC GGTCCCCGGC
ATCAACGTGG TCCTCCCTGA TGGTCACACA ATCACTTCGA GTCATATCAC CGAACTCAAC
ATTCCCTCGC TTCCTCCGGC AGCTCGTACC GCCCATATCT TTCCCGGTCT CTCGAATGGA
TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCTGAC
ACAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC
CGATTGTGGA CCCTCGACTC CCCTGTAACG CCCAATCCGC CCGCCACTGA ATTGCATGCG
CCTGTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC
CGCATTGCCT TTGTTCATGC ATCCTTATTC TCACCACAAC TTTCAACATG GTGCAAGGCC
ATTGACGAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC
CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC
TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC
ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC
GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GCTTTGTTAC CACTTCAAGC
TCCGGCAATG CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCGAA
GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG
CTATCCTCCA AAGGTTTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCCACTGCG
TTACAACAAT TCATGTCCTC TGTTGATATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC
CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG
TGCAGCACCG ACAAGAACTT TCCGCTTCAC CTTTGGGATC GCTTACTCCC ACAAGCCATC
ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA
CTCCATGGCT CGTTCGACTA CAATCGTACC CCTTTGGCTC CCCCGGGCAT CCGCGTACTT
GTACACGAAA AACCGACAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGCTGGTAC
GTTGGTCCCG CCATGAACCA TTACCGATGT TATCGCGTCT GGATCAAGGA GACCACCAGC
GAACGCATTT CTGACACTCT GACATGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG
TCTCGCGACA CAATTGTCGC CGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCATCT
CCCGCGTCCC CCTTGTCGCC TCTTTCGGTC CACGAACGCG AAGCCCTCTC GCAACTTTCA
GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTCGC TCCCACGGCA
ACCCTAAGTC CGCCAACTGC ATCGACTTCT TCACCTCGTC AAGTCCGCTT CCGAGACCCG
GTCACTGCAT CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCCTCCGCA GTCACTTCCG
AGGGTGCCTC CCCCAAACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC
CAAGCACGTC GTAGGGCCGC TCGAAAACTG AAAGAAAAAA TTTCCGCTTC AGCATCCGTT
GTTCCTACCC AAGCAACACC TGCACCCGTC GTACCTTCTC CCAAGGTCCC CACACCTCCG
CACAGTCACG GCACTCGCTT ACAAGCCGCT CGATACCCAG GACACTCGTT CGACAGCGCC
AACGCCGTCG TCGACCCCAA TTCCGGAGCC ACTCTCGAGT ATTCAAAACT CAAAAATTCT
GAACAAGGCC CCGAATGGAT TCAAGCCGCC GCCAATGAGA TGGGCCGCCT GTCTCAAGGC
GTCAAACCCA ACATGCCCAC CGGCACCGAC ACGATGCATT TTATTCCGCA TACCGCAAAG
CCGCACGACC GCAAGGCCAC TTACCTGAAG ATTGTAGCGG CTATCAAGCC ACACAAGGCC
GAAAAATACC GCATCCGTTT CACTGTCGGC GGCGACCGTA TCGAGTACAA CGGACCCACA
AGTACCCCTA CAGCTGCATT ACCAGCCATC AAGATCCTCG TTAACAGTGT CATTTCCACC
AAAGGCGCAC GCTTTATGAC CTGTGACCTC AAGGATTTTT ATTTGGGCAC TCCTCTCCCT
GTGTACGAGT ACATGCGCAT TCCTGCAGTC CATATACCAG ACTGCATTAT GGAACAGTAC
AAGCTTGCCC CGCTAGTTCA CAAAGGCAAT GTTCTAGTGG AAATTCGAAA AGGAATGTAC
GGTCTCCCAC ATGCAGGCCG CATTGCGAAC GACCGCCTCA TTGATCATTT AGCTCTCGAC
GGATACCATC AACTGACCCT ACCGGCCGCT TTGTTACCAC TTCAAGCTCC GGCAATGCAT
ACATGCTAGT GGTTTATGAC TACGATAGCA ATTTTATTCA TGTCGAAGCC ATGAAGAACC
GCACCGGTCC CGAGATTTTG AGCGCCTACA AGCGTGCTCA CGCCATGCTA TCCTCCAAAG
GTTTGCGCCC CCAACTCCAA CGCTTAGACA ACGAAGCCTC CACTGCGTTA CAACAATTCA
TGTCCTCTGT TGACATTGAT TTTCAATTAG CTCCTCCGCA CGTGCACCGT CGGAACGCCG
CCGAACGGGC AATCCGCACG TTCAAAAACC ACTTCATTGC AGGTTTGTGC AGCACCGACA
AGAACTTTCC GCTTCACCTT TGGGATTGCT TACTCCCACA AGCCATCATG ACTCTCAACC
TTCTTCGAGG GTCTCGTATC AACCCAAATC TGTCGTCCTG GGCCCAACTC CATGGCTCGT
TCGACTACAA TCGTACCCCT TTGGCTCCCC CGGGCATCCG CGTACTTGTA CACGAAAAAC
CGACAATTCG CAGAACTTGG GCCCCCCACG CAGCCGACGG CTGGTACGTT GGTCCCGCCA
TGAACCATTA CCGATGTTAT CGCGTCTGGA TCAAGGAGAC CACCAGCGAA CGCATTTCTG
ACACTCTGAC ATGGTTTCCC AGCCAAGTCA AAATGCCCAG CACCTCGTCT CGCGACACAA
TTGTCGCCGC TGCTCACGAT CTTGCCCATG CTCTGGCACA TCCATCTCCC GCGTCCCCCT
TGTCGCCTCT TTCGGTCCAC GAACGCGAAG CCCTCTCGCA ACTTTCAGAT ATTTTTTCGA
AAGCCGCTAA CCCAGTTGAC TCGTCCCTCC CAGTTGCTCC CACGGCAACC CTAAGTCCGC
CAACTGCATC GACTTCTTCA CCTCGTCAAG TCCGCTTCCG AGACCCGGTC ACTGAATCAC
TTCCGAGGGT GCCGACCGCC ACAGCCGCCC CTCCGCAGTC ACTTCCGAGG GTGCCTCCCC
CAAACTCCGA GGCCGAGACA TACAAGCTTG TCACCTGCAA CCCTCGCCAA GCACGTCGTA
GGGCCGCTCG AAAACTGA
 
Protein sequence
MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL 
FLVISETEYL EMTDGIPCIP PVQPPFDPVH AANATAPQIV EANRQNDKRQ KLFDLYHNAI
KAFRNQLLEA IPIEYIESLG HPTRGFNKVS PLEILSHLWE TFGKIQASDL IANDERMKAA
WHPPTPIQQL FQQLEKGNQF IIASGQVMDE RIIARIGYQI IEKTGLFDLA SRDWRYKDEA
DKTLANFKKH FQKANKDLAL TATSSSAGYH TANQSTVTKG KSYCWTHGIV HNTKHTSATC
EKQAPGHKTG ATLHDKQGGS TKTYQYTPPS SVAPNTPPLA SSPPFFPPDA IADTGCTGHF
LSTNIAHIHC QPTVPGINVV LPDGHTITSS HITELNIPSL PPAARTAHIF PGLSNGSLIS
IGQLCDHGCT ATFTSDTVRI ELNNTVVLRG GRSPYTRLWT LDSPVTPNPP ATELHAPVHD
KNFANHLGDH SGTLADRIAF VHASLFSPQL STWCKAIDEG RLTTFPDITS AQVKRHPPQS
VPMVKGHLDQ QRSNLRSTKP KVTLSASVDP DDINFDTNPV VQDPPAARTQ FLYADFAEVT
GKIFTDPTGR FVTTSSSGNA YMLVVYDYDS NFIHVEAMKN RTGPEILSAY KRAHAMLSSK
GLRPQLQRLD NEASTALQQF MSSVDIDFQL APPHVHRRNA AERAIRTFKN HFIAGLCSTD
KNFPLHLWDR LLPQAIMTLN LLRGSRINPN LSSWAQLHGS FDYNRTPLAP PGIRVLVHEK
PTIRRTWAPH AADGWYVGPA MNHYRCYRVW IKETTSERIS DTLTWFPSQV KMPSTSSRDT
IVAAAHDLAH ALAHPSPASP LSPLSVHERE ALSQLSDIFS KAANPVDSSL PVAPTATLSP
PTASTSSPRQ VRFRDPVTAS LPRVPTATAA PPQSLPRVPP PNSEAETYKL VTCNPRQARR
RAARKLKEKI SASASVVPTQ ATPAPVVPSP KVPTPPHMVY DYDSNFIHVE AMKNRTGPEI
LSAYKRAHAM LSSKGLRPQL QRLDNEASTA LQQFMSSVDI DFQLAPPHVH RRNAAERAIR
TFKNHFIAGL CSTDKNFPLH LWDCLLPQAI MTLNLLRGSR INPNLSSWAQ LHGSFDYNRT
PLAPPGIRVL VHEKPTIRRT WAPHAADGWY VGPAMNHYRC YRVWIKETTS ERISDTLTWF
PSQVKMPSTS SRDTIVAAAH DLAHALAHPS PASPLSPLSV HEREALSQLS DIFSKAANPV
DSSLPVAPTA TLRPLEN