Gene PHATRDRAFT_47974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47974 
Symbol 
ID7203223 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp598961 
End bp602043 
Gene Length3083 bp 
Protein Length897 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182428 
Protein GI219124264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0286581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATCTCCATC GTCGTTTTGT GTTCTTCAAT AGAACTACGA AGCAACCCCT CCGTCTCTCT 
GCTCCCAGCC CTGCCCTACC GTCCATCCGC CTTATCCTCA GTCCCAATGT CGACCTCCGC
TCAATTCAAA CTGAGCGACT TTCCTCACAA AGTCCTCGAA CCCATTGCCA CCCTCACCGC
CCCACCGACT TACGCCACTC TCAAAGTGGC CCAACGTCAA CTCAGTACCA ACGCCGCCGC
CATCCCTACG CTCAACGGCG GCGGCGCGCA CGGCCACATG GCCCTCACAC TTACTGCCCG
CGCCTACGCC GACATCAGCG ACGTCCCGTT CGACATCCCC GTCGCCCCTC CGGCCAACCC
TCCCGTCGGC ACCACGCAAC CGCAAATCAC CGAGTTCAAC CGCATCCACC AACGCGATGC
CGACATTTAC AACCTTTATG TCGCCGTCAA TAACGCCCTC CGTCAGCAAC TCCTCGACGC
CATCCCGAAA ATCTACGTAC GCGCCCTCGG GCATCCCATT TTCGAATTCA GCACCGTCAC
CTGCCTCGAT CTGCTCTCTC ACCTTTGGAC CAAATACGGC ACCATCAAGC CCGCCGACCT
TCAGAACAAT TTGCAGTCCA TGTATACCCC GTGGAACACC GCTGAGCCCC TTGAGACTGT
TTTCCTTCAG CTCGACAACG CCATTGCGTT CTCTATTGAC GGCAACGACC CCATCTCTGA
GGCCGCCGCA GTTCGTGCCG GCTACGATGT CCTTGCCCAC TCCGGCCTTT TTCCCCAAGA
CTGCAAGGAG TGGCGGAAAT TACCCCTTGT TTCTCACACC CTTGCCAACT TCCAGGCCCA
CTTCACTCTT GCCGACAAAG ACCGGCGCCT CACGGCCACT ACTGGATCCC TCGGCTACGC
GAATGTGCTC ACGGCCACTC CCTCGCTTGC TCCCACCATT AGCTTGGACC CTCTCAGCCT
TCCTTTCTCA GCTCTCTCTA TGTCGGATTC CTCTGTTCAC TCGCCTGCCA TGACCTACTG
CTGGACTCAC GGCACCAGCA AAAACCGGCG CCATACCAGC ACCACTTGCA AGAACAAGGC
ACCGGGCCAT CGCGACGACG CGACGGCGAC CAACACGCTT GGCGGCTCCA CCAAAATTTG
GACAGCCCCC AAGCCCCCTG AATAGGAAGG AGGGACGGCT ACGCCGATGA CTAACTCTAG
TAATATCGAT CATTTAAATC ATATTACTAG TCTTAATTCA TCTGTAGTCC CCTCCCCGCC
TAGTCCACAC ACCTCAGCAA TTGCCGACAC AGGCTGCACC GGTCATTACA TCACGGTTGA
CTGCCCCCAC CAAAACAAAC AACCAGCAAA CCCAAGCCTC TCCGTCTGTG TCCCCAACGG
CTCCGTCCTC CGCTCAAGCC ACGTTGCCAC CCTGGCCCTT CCTGGTTTCT CCCCTACCGC
CTGCCAAGCC CACATATTTC CTGGGCTCGC TTCACATCCG CTCCTCTCCA TTGGCCAACT
ATGCGACGAC AGCTGCACAG CCACCTTCTC AGCCACTCGC CTCGACATTC ATCGCGACCA
AACGCTGCTG CTCTCCGGCA CCCGCTCCCG CCACACCGGC CTCTGGCACC TCGATCTCGC
CCCATCCCCT CCTCCCGCCA CAGCCCATGC TCTCATCCCT CATTCCTCCC TGACCGACCG
CATTGCCTTT ATCCATGCCT CCCTCTTCTC CCCGGCTCTC TCTACTTGGT GCAACGCCAT
CGACTCCGGT CATCTAACCA CCTTCCCGGA CATCACCGCC CGCCAAGTAT GCAAATACCC
ACCGGCCTCC CCTGCCATGA TTAAAGGCCA CCTCGATCAA CAACGGGCCA ACCTCCGGTC
CACCAAGTCT CCTCCCATTG GTCCCCTGGC GGCCCCCATT GCCCCTACTG CCACCTCAGC
TCACGACCGC CCACCTGTTG CTCGCACGCA CCACGTCTTT GCCACCCATC AGCGCGTCAC
TGGACAGATC TACACTGACC AACCAGGCCG TTTTCTCACT CCTTCCAGCG CTGGACACAC
CAACATGCTC GTCCTCTACG ATTACGACAG CAATGCCATT CACGTCGAGC TGATGAAGAG
CAAATCCGGT CCCGAGATCC TTGCGGCGTA CAAACGTGCA CACACGCTTT TCACCGAACG
CGGCCTCCGA CCTCAACTCC AACGCCTGGA CAACGAAGCC TCTGCAGCCC TCCAAACCTT
CATGACTTCC GAACATATCG ACTTTCAGCT TGCTCCCCCG CATCTGCACC GTCGCAATGC
AGCCGAACGG GCCATCCGCA CCTTCAAGAA CCACTTCATT GCTGGCCTCT GCAGCACTAA
CCCGGATTTC CCGCTCCACC TGTGGGACCG ACTCATTCCC CATGCCCTCC TTACCCTCAA
CTTACTCCGT AGCTCCCGCC TCAATCCCAA GTTATCGGCC CACGCTCAGC TCCACGGTGC
CTTCGATTAC AATCGCACTC CACTCGCTCC CCCTGGCACT CGCGTCCTCG TTCACGTTAA
GCCCGCCGTT CGCGAAACAT GGGCACCCCA TGCAGTCGAA GGTTGGTATC TCGGCCCAGC
TCTGCACCAT TATCGCTGCC ATCGCGTCTG GATCACCGAA ACACGGGCAG AACGTGTCGC
CAACACCCTT TCTTGGTTAC CTAGCCAGAT CCCCATGCCT ACCGCCTCAT CCAACGACCG
TGCCCTGGCC GCCGCCCGCG ATCTGGTCCA TGCGCTCCAA AATCCCTCCC CTGCTTCCCC
GTTTGCACCT CTCGACGCAC ACCAGCACCA GGCCCTCACC CACCTTGCCG ATCTCTTTGC
CACCATTGCC GCACCTGCCT CTGCCGCCCA GACACCTGCT CCCGTCCCCA CGGTCCGTCC
CCCTGACCTA CCCGCCACCC CACCTCAGGT CCGCTTTGCC GTCCCGCTGG TCACCGCTGC
ACACGCCCCT GCCCTTCCGA GGGTGCCAAC ACCCTCGCCC GCACTTCCGA GGGTGCCCAC
CATGGCCACC TATTGCTCTC GCACAGGTAA CCCCGGCCGC CGACGCCGCA CAGCACGCAA
ACAGCCACCA ACCCCAACCC TAG
 
Protein sequence
MSTSAQFKLS DFPHKVLEPI ATLTAPPTYA TLKVAQRQLS TNAAAIPTLN GGGAHGHMAL 
TLTARAYADI SDVPFDIPVA PPANPPVGTT QPQITEFNRI HQRDADIYNL YVAVNNALRQ
QLLDAIPKIY VRALGHPIFE FSTVTCLDLL SHLWTKYGTI KPADLQNNLQ SMYTPWNTAE
PLETVFLQLD NAIAFSIDGN DPISEAAAVR AGYDVLAHSG LFPQDCKEWR KLPLVSHTLA
NFQAHFTLAD KDRRLTATTG SLGYANVLTA TPSLAPTISL DPLSLPFSAL SILNSSVVPS
PPSPHTSAIA DTGCTGHYIT VDCPHQNKQP ANPSLSVCVP NGSVLRSSHV ATLALPGFSP
TACQAHIFPG LASHPLLSIG QLCDDSCTAT FSATRLDIHR DQTLLLSGTR SRHTGLWHLD
LAPSPPPATA HALIPHSSLT DRIAFIHASL FSPALSTWCN AIDSGHLTTF PDITARQVCK
YPPASPAMIK GHLDQQRANL RSTKSPPIGP LAAPIAPTAT SAHDRPPVAR THHVFATHQR
VTGQIYTDQP GRFLTPSSAG HTNMLVLYDY DSNAIHVELM KSKSGPEILA AYKRAHTLFT
ERGLRPQLQR LDNEASAALQ TFMTSEHIDF QLAPPHLHRR NAAERAIRTF KNHFIAGLCS
TNPDFPLHLW DRLIPHALLT LNLLRSSRLN PKLSAHAQLH GAFDYNRTPL APPGTRVLVH
VKPAVRETWA PHAVEGWYLG PALHHYRCHR VWITETRAER VANTLSWLPS QIPMPTASSN
DRALAAARDL VHALQNPSPA SPFAPLDAHQ HQALTHLADL FATIAAPASA AQTPAPVPTV
RPPDLPATPP QVRFAVPLVT AAHAPALPRV PTPSPALPRV TPAADAAQHA NSHQPQP