Gene PHATRDRAFT_45913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45913 
Symbol 
ID7201002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp649885 
End bp653015 
Gene Length3131 bp 
Protein Length1012 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180287 
Protein GI219119041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATGATTGGC ATGATGGATA GCGAGGAAAC GGAGACTGCT GAAAGCGAAA CAATTGTAAT 
AAATGAGGCT GGTTCAATCA CCCCCAATGT GAACGGCACA TTGACTGACG AAGATGCAGG
CGCGCAAGAA AATGCAAAAG GGTTGCCACA GACAGCTTCT ACACAGTGCT CCGGTGATCC
CCAGTCGAGA GCGCCGGTGA TGTTGGACGC GAAAGGCAAA AGCAATACTC TACCAGTCCA
AACGCAAATA TATCGCACGC GTATACCCGC GATCGCTTCA GAGGCTGACG TATTTGGCTC
GCAACGTGAT ACAATCGACG CACGAGAGAA AGCACCGCGC CCTTTTGAAT ATGAAATTGG
AATGGAAGAA GGGAGTGCTG CCTTTGCCTT GCCATCTCCA TTACTTTGTG AAGGAGCATC
TAGACCAGGC GCATTTCCAA TGGGCCTTAC TCGTCCCTTA GATGACAACG AAGATTTAAG
CGCACACTCG AATCTTTTGC AGTCTGAATC ATTTTTACAT ACGTCGTCCT TGTTGAGCAT
GGATGAGTCC AGCGGGATAT TGGTGGAAGC AAGCTTGGTT CCTGATGATA GTTTTATTGA
GACAAGTACA ACAGACAATA AACCAACTGC CCTTGTCCAG GCAAACCCTC TGCCAGACAC
GGCTTTTTTC TTTCGCCCGA AGATAATTTG CACTGTTCTT GGCACACTCA TAATAATTAT
TCTTGCGTTG GCGTTGGGTA TAGTCCTTTC AGCAAGATCC ATTGATGAAG GAGTCAGTGC
GCTAAACGAG GGAAACTTAT CAGCATCATC AAGCGACGCA CCAACCTCAG CAAATACATT
AATTGAGCTG TTTCCTATTG AAGCTTTGCC TACCTACACA CAGATGTCAC TGCAGGATCC
CTTGTCTGCT CAGTCAAGAG CCGTAGCTTG GCTTAAGGAT GACCCGCTTT TGGCAAGCTA
TTCCAATTCT CGTCGACTAC AGCGATTCGC ACTAGCAACG CTTTACTATT CGACGAGAGG
AGAAATGTGG AGTACAAATG ATGGCTGGCT GTCGGAAACG GATGAGTGCA CCTGGTTTTC
TGACCTCGGG AGTGCGTCTT CAATTTGCGA CGCAGGCATC TACAGGGCCA TTTCGTTGGT
AAAGAACAAA CTGCGTGGGA CTATCCCGCA GGAGATGGAA TTGCTGGACT CACTTGTACA
CTTGCGAATG AATTTCAACT TTGTTTTCGG CCCCTTGGTA CCTCAAATTG CAAACCTTAT
CGCATTGGAA GACCTTCAAT TAGCAAATGC TGGTTTACAG GGTCCATTTC CAATGGAACT
AGCCTCCCTT ACAAAATGTC GAAAAGGCTT TTTCCATTCC AACGATTTTA CAGGAGGGTT
ACCTTCTGAT TTATTTGCGA GCTGGACTGC TGTCAAGGAT TTAGATTTTG GGCGAAATAT
GTTTGAAGGC TCCATTCCAG TGGAAATTGG TGCCATGGTG AACATTGTTT CATTGGGATT
TGATGACAAT GTATTTTTGA GTGGAGAGCT CCCCTCAGAG TTTGGGCTCT TAACTGCTTT
GGAGTTTCTC TGGCTTCAAG GAAACTCGCT CACTGGAACA GTTCCTTCCA GCTTGGGAGC
TCTCACTCGT TTACGAGAGT TGCAAATACA CAACAACTTT TTCACGGGGG CTTTGCCGGA
AGAACTGTGT GAGCTGGTCA AGAAAAACAA CCTGATTGTA GTTGTGGACT GTTTTCAAGT
CAACTGTGAT TGTGAGTGTG AATGTGTGGA AAGTTCACGA CCTCTTCCAT CAGGACTGTT
GTCAATGCCG ACATTAAATC CAACTATGAT TCCATCAAGT GTTAACGTAA TCTCGCCAAC
TATCACATCA GCCAATCGCC TCCAAGAACT TCGGGACAAT AGCTTACCTG AATTTACACG
CTTAGCTCTT GGCGATCCAA CCAGCCCACA AGCGCGTGCT TTTTCTTGGT TAGAGCAAGA
TCCCAATGTT GATGATTTCT CCGAAAAACG TTTATTTCAT CGTTTTGTAC TGGCAACCTT
GTATGAATCA ACAAATGGAG AGGCTTGGAT CAGAAACGAC GGGTGGATGA CATACACTCC
TGAATGTGAC TGGTATTTTG ACACAACATG GACTGCAAGT CCGACATGTC TGGGAGACAC
ATTTGAGTAC CTGGTACTGG AAGACAACAA TTTACAAGGA TCTCTTCCTT TAGAATTAGG
CTTGTTGACG GGATTGAAAG CTATTGTGCT GTCGCAGAAT TTTCTTTCCG GTGAAATTCC
ATCAACTCTG GGATCAATTT CTGGCTTGAT AGAGCTTGAG CTGAGTGAAA ACAATTTGGA
GTGGTTCATT CCTACAGAAT TGGGTCTTTT GACCTCACTG ACTGTACTTA ACTTGCAGTC
TAACAATCTG AGCGGCTCAA TACCAAGAGA AATTGGCAAT ATGTTAAAGC TGGAGTACTT
ATTCTTGGAC ACAAATATCT TGACGGGAAC ACTACCAATG GAACTTGGAA ATCTCGTCAA
CCTACTTTCA ATATGGATCT TTCGCAATGA CTTAGATGGT AGCATTCCAT CTTCTCTTGC
TGACATCTCT CGACTAGAAG ATTTGCAGAT CGACCGAAAC TTATTGAATT ACAGTCTTCC
GTCACCATTA TGGAGGGCTC TGAGTCGGGC TGCATTTATT AATGTTGCTG ACAATCTGCT
ATCAGGAACA ATTCCATCTC AAGTCGGTTT GCTACGTCAA GTAGTCATGA TCGACTTCTT
TGATAATTTG TTTTCTGGCA CAATTCCAAC AGAGTTTGGC TTGCTCACCA ACTTGGAGGA
GCTCAGTTTT GTGGACAATA TCTTCTCAGG GACAATACCT ACTGAGCTTG GTCTTCTTAG
CAATATGAGG ACTTTATTTC TGCATGACAA TTACTTTCAT GGAAGTGTTC CCAGTGAACT
TTGCAATTTA GTCCACTCAC AATCCTTGGA TCTGTCAGTT GATTGTAATA TGGTGATCTG
TACATGTAAC TGTGAATGCG ACTTATCCTT CAGGCTTTGA GTTCTGAGGA TTCATCAAAG
TCTGTTGGAA GTCAGTAGAG TTAGCAGCGC ATAATTTACT TTTAACTGGA TATGGCGAGA
AACAGTTCTT T
 
Protein sequence
MIGMMDSEET ETAESETIVI NEAGSITPNV NGTLTDEDAG AQENAKGLPQ TASTQCSGDP 
QSRAPVMLDA KGKSNTLPVQ TQIYRTRIPA IASEADVFGS QRDTIDAREK APRPFEYEIG
MEEGSAAFAL PSPLLCEGAS RPGAFPMGLT RPLDDNEDLS AHSNLLQSES FLHTSSLLSM
DESSGILVEA SLVPDDSFIE TSTTDNKPTA LVQANPLPDT AFFFRPKIIC TVLGTLIIII
LALALGIVLS ARSIDEGVSA LNEGNLSASS SDAPTSANTL IELFPIEALP TYTQMSLQDP
LSAQSRAVAW LKDDPLLASY SNSRRLQRFA LATLYYSTRG EMWSTNDGWL SETDECTWFS
DLGSASSICD AGIYRAISLV KNKLRGTIPQ EMELLDSLVH LRMNFNFVFG PLVPQIANLI
ALEDLQLANA GLQGPFPMEL ASLTKCRKGF FHSNDFTGGL PSDLFASWTA VKDLDFGRNM
FEGSIPVEIG AMVNIVSLGF DDNVFLSGEL PSEFGLLTAL EFLWLQGNSL TGTVPSSLGA
LTRLRELQIH NNFFTGALPE ELCELVKKNN LIVVVDCFQV NCDCECECVE SSRPLPSGLL
SMPTLNPTMI PSSVNVISPT ITSANRLQEL RDNSLPEFTR LALGDPTSPQ ARAFSWLEQD
PNVDDFSEKR LFHRFVLATL YESTNGEAWI RNDGWMTYTP ECDWYFDTTW TASPTCLGDT
FEYLVLEDNN LQGSLPLELG LLTGLKAIVL SQNFLSGEIP STLGSISGLI ELELSENNLE
WFIPTELGLL TSLTVLNLQS NNLSGSIPRE IGNMLKLEYL FLDTNILTGT LPMELGNLVN
LLSIWIFRND LDGSIPSSLA DISRLEDLQI DRNLLNYSLP SPLWRALSRA AFINVADNLL
SGTIPSQVGL LRQVVMIDFF DNLFSGTIPT EFGLLTNLEE LSFVDNIFSG TIPTELGLLS
NMRTLFLHDN YFHGSVPSEL CNLVHSQSLD LSVDCNMVIC TCNCECDLSF RL