Gene PHATRDRAFT_43214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43214 
Symbol 
ID7196578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2338220 
End bp2343557 
Gene Length5338 bp 
Protein Length1657 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176961 
Protein GI219110419 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAGAATTGA ATCTTCCAAT GAATCTTTGA GATCAGCGAG GTCCGCCCAG AAACACGGGA 
CTTGATGCGA ATGCTTTCCG TTTCGGGTCG TTGAAGACTT GTTTCTTGAT AAGAGTGAAT
ACGCAATCTT CTACCACAAA TCGGTTGGCT CACTTTTCGC CAATGGCTGT ACTGAACGAG
AACAAGAAAA CAGGTGCGTA TGTGTAAGGC GAATCTATCA CTTTTTCTGC ATAATTGTGG
CGTCTTCATC CTAACAGTGC TTTCCTTCTC GGTCTCTTGG TTCTCAAGAG AGTTTGTTTG
ACCGAAACTA GAGCGAAATG GCACCGCAGC AACGAATCGA GGCCGTGCTC GACAAGGAAG
GTACAACTCC GGTCCTTCCT CTCCTTCAAG CAGCTCAGCA TCTTCTGAAG CCTAACGGTT
CGATCGTTGG CCGCGATGTG GCCGACACGA TCCGCCCTTT ATTTCGAACT TTTTGTTCGG
CTATTCCGAC TTTCTACGAG GAAGCAAAAA CTAAGCCGGC TCTCGCGAGT ATTGATGGAC
GCAGCCCCAT TTCACATTCT CGCATTCACG AATTTATTGT CAACGAATAC GGCCCCACCC
TTCATGCTCT CGGGTTCGGA CGAGGCCATC GAATCGCATT GGTCTTGCCG AACGGGCCGG
AATTAGCCTT GGCGATCGTC GCTACCGCTC AATACGCATC CTGTGTGCCC TTGTCGGCCA
ACGGCGCCGC TAGCGAGTTG AAAGCCGATC TGCTGCGTGC CGGGGTCGAT CTCGTCGTCG
GTCCATACTC AGGTCGTATT CATAACGATT TTATCCAAAA CTCGCAAGCA ACCGACGAGC
GCTTCCACGT CATGCCATGG ACGGAAACAG ACTGGAGCAT GTTTGCAGAG ATCGAACAAA
CCGCTACAGG TTTGGGAATC CCTTTTGCGG GGTTGGTTCC GAGTCCCTAC GAGTCGGGTA
TTTTCAAAAT TGTTACCAAC GCAGCGACGG ACACTCTGCG ATTCTCCGAT CGTCACGAGA
TTATTTGTCC TATAAATCGA AGCACCAAGG CTTTTTTGAC CAAAGAAAAC ACCTTGAATG
ATGAGGTACT CGTCTTGTTC ACATCAGGAA CTACCGGTAA CAAAAAGCTT GTCCCACACC
AACTCGGTGA CATCTTAACT GCAGCAACGA CGATTGCTTT GAGCTGGGCT CTAACTCCCG
ACGACGTCAA CTGCAACCTC ATGCCCCTCT TTCACGTCGG AGGCATTGTC CGTCAGGTCT
TTTCACCCCT CGTCTCCGGC GGTTGTGTAA TTTGCTGTCC TAGCTTTGAT CCAAGCATCT
TTTGGCTGCT ACACGTAACA AAACAAGCCT TTACTTGGTA CTACGCCGCG CCCACCATGC
ATCAATTGAT TTTGCAAACG GGACAAGCAG ACGGGTTCTT GGTAGAAGGC AAACATTGTC
CACCTTTGCG AATGATTGCC AACGCTGCTG GGGGGTTGTT ACCCTCGTTG GCACTGCAGC
TACGGGATAC ATTTGTCGGT GCCACCGTGC TGCCATCCTA CGGAATGACC GAGTGTATGC
CAATCAGTTC TCCGCCGGCA ACTTATCAGT TGGAGAAACC TGGAACGTCG GGAGTGGCTG
TGGGACCGGA AATTGCTATT CTCAATACAA CGACAATGAA ATCGTTACCG ATCGGAGAAG
ATGGTCCAAT TTGCGTTCGT GGTGATCCTT GTTTCCGTGG TTACGGAAGG ATTGCCAATG
ATTTTTCGGA GGCCGTGAAT GACACATTTA TGGCTGACGG TTGGTTCAAT ACCGGTGATC
TAGGACACTT GGACAAGGAC GGCTATCTCT TTATTACTGG CCGTTCGAAG GAAGTCATCA
ATCGCGGGGG TGAAATCATA AGTCCCATGG AGGTCGAAGA AGCGGTCCTT AGCCATTCCG
ATATTTCTTT GGCAGCTGCC TTTAGCGCTC CTCATGATGT CCTTCAAGAA GTTGTCGGTA
TTGTTGTGGT TATGATGGCA GACCGTCCCC GTCTTGATCT AGCCTCTCTT CATGAGTTTC
TGGGAGAGCG ATTGGCAGCC CCTAAATGGC CGCAGTGCTT GATCTTTATG GATGGTTTGC
CCAAGAGCCA CACTAACAAG CTTCTCCGTG TCAAGCTTGG CAGCCGACTG GGGCTGCCTG
AACTCAGGGA TGATATGCTC GCAATAGATC GTACATGGGA AGGCAAGTGT CCACCGCAGG
GGACGCCGTT GGATGTTGCC ATACCAGTTT TTCCTGTTTC CGTTTCGGCC GAGGAAATCG
AAGAAAAGCT TGCCAGTATG TTGGTGACCA CAAAAAATCA AAATTTGCGG GTAATTCCAC
ATAGCACCCG GACCGGGTCC CTCGTTTGCT ATGTCTACAA CCTTGATCGA ATGGATGCAA
TTTTGCTAGC CCGCAAGGCT TTACCAAGAT ATGCGGTACC CAGTCACTTT GTTTCGCTTG
ACACAATCGA GCTTTTGTCG GGAAAAGTTC TGCCTTCTCC GAGTATGAAG GATGCCGTGG
CTTCTCTTCT CCAACGCTCG TCAAGCGCCA ATACAATTGA TCCTGTAGTC GATAACTTGC
AGAGCTTGTT TGCAGAGCTT TTGTCTTTGG ATTATCTCCC TGGGCCTGAA GCTAACTTTT
TCCACATCGG AGGAAGTTCA ATGTTGGCGT CACAGTTGGC TAGCAAACTT CGAAAGCAAT
TTGGGATAGC CTGTAGTGGG GCTGAAATTT TCCACAGTAC AAACTGTAAC GACCTTGCCA
AGCTTATTTA CCAACGAAGC GACGACTTTG CGACGATTTC ACCAATAGAT TCGAAATTGA
ATGATCAATC AGGCCCAAGC GGACGTGTTG TTGACGACCA TGGCGCACCC TTCCCCTCAA
AACGTCTAGC TATGGATGGT TCATTCCTTC GCTCACTATT TCAGCTTGTG CCTATTCTAA
TTATATTTCC AATTTGGCAG ATATCGCGCT ATATCCTCTT TTTCTGCTTC CTACTCTGGT
CGATTGATGT TGTTCCTGGT ACCCGCGATA TCGGAACCTT TATTGCTGCC TATTTGGCAT
TTCATTTGTG CTGGATTACT ATAACGCCGC TCGTGTTTGT TGCCATCAAA TGGAGCGTGA
TTGGTCGCTA CAAAGCAGGG CGCTACCCAA TTTGGGGTAG CTACTACTTA CGGTGGTGGT
TTGTTGACAT CTGCCGCAAA CTTTTTCTTC GTGGCATCTG GGGTTCCAAC GAGGTGTACC
TGAACATCTA CTACCGCCTA CTGGGGGCCA AGATTGGCAA GGGTGCCCGC ATTAGTCTAG
AGGCTGACTT GGCCGAATTT GACTTGGTAA ACGTTGGGGA AAACGCCGCT GTAGAAGATT
GCACGCTAAG GGCCTTTGGA GTAGACAATG GTGCCATGAT TCTTGGACCG GTGCATGTCG
GAAATAACGG CAGTGTTGGA GCGAAGTCAG TGGTCGCTCC CTTTACTTCA GTCCCTGACG
ATGGTCACCT CGGACCAGTG ACTTCGAGCT ATGAAGTAGG AAAAGCTCTG GACTTGAAGA
ACAGACAGTT TAATCGTCGA TGCCTGGCTG AGCCCAGCAT TTGGCTCCAG GTTTGTCTGG
GTTCGCCGAT CACCTTTGCT GTAAACTGCT TTGCTCAAAT TCCCTCGTTG CTGATCCTTA
TATGGATGCT GAGATACAAA GGCCAGCGTG GCGAGGAGTT CTTGACTCTG AATGATTTAA
TGGAGTGGCT TTGCGATCCT AAGCGTATCC CGTTCTACAT TGGAATTCGT GTTGCTCGCA
ATATTGTGTC TCCTTTTTTT TACATGGCGG CTGCCATTGT TGCCAAAAAG ACTGTGATTG
GCAAGTTTAC AGCCGGTCCG CGTGACACTT GGTCCAGCTG GTCTCTGTTT CGTCACTGGC
TAGCTGCAAC TTTATTCTCT CGTAAAAAAA TTCAGGCCGT CACGGATTTG ATCGGCCGTC
ACTACGAACT TGTTAGTGTC TTGTACCGTC TCCTCGGTGC CAAAGTTGGA AAACGTGTTT
TTTGGCCGGG CTCACAACCT GTGTTCACCG GTGAATTTGA TTTGTTGGAA ATTGGCGATG
ACGTTGTCTT TGGCTCGCGT TCTGGTATTT TCATGACGAC AGACACTTCA TGCGAAAAGG
TTGTTTTGTG TGCCGGTGCC AATGTTGCTG ACAATTGTGT TGTCCTTCCT GGAAGCGTTG
TCGGCAAAAA CGCTGTGCTG GGGTCAAACT CTGTTTGTCC ACTCGGATGG TACTTGCCCG
AGGGTAGCGT CTGGTTTGGA TCTAAGGGCT GCGAGCCCGA TTGCCTCGAC AAGGGTGTAG
TCACAGATTT CGATGGTCCG ATTCTAGTCA CTGATATTGA CGTAAAGACG GTTCCGATGG
TTGGAGATGC TACGACTCTT CGCCCTTTCG GTAAAGCTTT TTACAGAGGA GAAGCGTCTT
ACAGCGTTTG GCCCTTACAT GTGATTATCG CAGCGACTCT CTTGATCAGA AGTATCCTGA
GTGTGTTTCA CACGCTTCCG TTGCTAGGTG CAATTCAAGG CGGTGGCGCC ATTCTATATG
GTCTTCCTCT TATTGACCGG GTCTACGATA GTCATGAGTA CAACTTTTTC CACGTGTATT
TTGCGATTTT GTTTGTCTTT TTCTTCACGC ACGCGCTTCG CGTCGCACTT TGGCTTGTGA
TTGAACTGAC GGCCAAATGG ACTTTGATGG GGCGTCGAGA AGAAGGCCGT TACAACTATG
ATACGAGCTC GTACGCACAG CGATGGGAGC TCTATCAGCT TATTTCCAAA GTTCGAAAAT
TCAATCGTCT CAACTTTTTG GACTTCTTGT CCGGTACCCC TTTTATGGCT GCATACTTTC
GGCTGAACGG AGGCAGAATT GGTCGGGATT GTTGCCTATT TCCTGCTGGG GCCGACCCTT
TCATGCCCGA ACCCGATTTG GTCACGATGG GCGACCGCTG TGTTGTGGAT TGCGCTTCTA
TTGTCTGCCA TTTGAACACA CGCGGCAACT TTGAACTAGC CAGAATCACT CTAGAAAATG
AATGTACGCT TCGTACACGT TCTCGTTTGC AGCAAGGCTG TTACATGGAA CATGGCTCGC
AACTGTTAGA GAAAAGTTTG GCAATGACAG GAGAAGTTAT CGAAGCCAAC AGCGTTTGGC
AAGGTGGCCC AGCGTCATGG TGGTTCCAGT ACTCTCAACG TAGTCTGTAC ATGGCGGATG
AAGAGGAAAC TGCCGATGAA AAAACCAATC TTTTGAAAGC TAAAGTGTCT TCGTACAACG
TTCAGCTTTA GCGTTGGCGA TGAGCTCGTA ATCATTTACA TTTTGACCCT TTACTGTT
 
Protein sequence
MAPQQRIEAV LDKEGTTPVL PLLQAAQHLL KPNGSIVGRD VADTIRPLFR TFCSAIPTFY 
EEAKTKPALA SIDGRSPISH SRIHEFIVNE YGPTLHALGF GRGHRIALVL PNGPELALAI
VATAQYASCV PLSANGAASE LKADLLRAGV DLVVGPYSGR IHNDFIQNSQ ATDERFHVMP
WTETDWSMFA EIEQTATGLG IPFAGLVPSP YESGIFKIVT NAATDTLRFS DRHEIICPIN
RSTKAFLTKE NTLNDEVLVL FTSGTTGNKK LVPHQLGDIL TAATTIALSW ALTPDDVNCN
LMPLFHVGGI VRQVFSPLVS GGCVICCPSF DPSIFWLLHV TKQAFTWYYA APTMHQLILQ
TGQADGFLVE GKHCPPLRMI ANAAGGLLPS LALQLRDTFV GATVLPSYGM TECMPISSPP
ATYQLEKPGT SGVAVGPEIA ILNTTTMKSL PIGEDGPICV RGDPCFRGYG RIANDFSEAV
NDTFMADGWF NTGDLGHLDK DGYLFITGRS KEVINRGGEI ISPMEVEEAV LSHSDISLAA
AFSAPHDVLQ EVVGIVVVMM ADRPRLDLAS LHEFLGERLA APKWPQCLIF MDGLPKSHTN
KLLRVKLGSR LGLPELRDDM LAIDRTWEGK CPPQGTPLDV AIPVFPVSVS AEEIEEKLAS
MLVTTKNQNL RVIPHSTRTG SLVCYVYNLD RMDAILLARK ALPRYAVPSH FVSLDTIELL
SGKVLPSPSM KDAVASLLQR SSSANTIDPV VDNLQSLFAE LLSLDYLPGP EANFFHIGGS
SMLASQLASK LRKQFGIACS GAEIFHSTNC NDLAKLIYQR SDDFATISPI DSKLNDQSGP
SGRVVDDHGA PFPSKRLAMD GSFLRSLFQL VPILIIFPIW QISRYILFFC FLLWSIDVVP
GTRDIGTFIA AYLAFHLCWI TITPLVFVAI KWSVIGRYKA GRYPIWGSYY LRWWFVDICR
KLFLRGIWGS NEVYLNIYYR LLGAKIGKGA RISLEADLAE FDLVNVGENA AVEDCTLRAF
GVDNGAMILG PVHVGNNGSV GAKSVVAPFT SVPDDGHLGP VTSSYEVGKA LDLKNRQFNR
RCLAEPSIWL QVCLGSPITF AVNCFAQIPS LLILIWMLRY KGQRGEEFLT LNDLMEWLCD
PKRIPFYIGI RVARNIVSPF FYMAAAIVAK KTVIGKFTAG PRDTWSSWSL FRHWLAATLF
SRKKIQAVTD LIGRHYELVS VLYRLLGAKV GKRVFWPGSQ PVFTGEFDLL EIGDDVVFGS
RSGIFMTTDT SCEKVVLCAG ANVADNCVVL PGSVVGKNAV LGSNSVCPLG WYLPEGSVWF
GSKGCEPDCL DKGVVTDFDG PILVTDIDVK TVPMVGDATT LRPFGKAFYR GEASYSVWPL
HVIIAATLLI RSILSVFHTL PLLGAIQGGG AILYGLPLID RVYDSHEYNF FHVYFAILFV
FFFTHALRVA LWLVIELTAK WTLMGRREEG RYNYDTSSYA QRWELYQLIS KVRKFNRLNF
LDFLSGTPFM AAYFRLNGGR IGRDCCLFPA GADPFMPEPD LVTMGDRCVV DCASIVCHLN
TRGNFELARI TLENECTLRT RSRLQQGCYM EHGSQLLEKS LAMTGEVIEA NSVWQGGPAS
WWFQYSQRSL YMADEEETAD EKTNLLKAKV SSYNVQL